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FIELD OF THE INVENTION 

The invention relates to the identification of nucleic acid and protein 
1 5 expression profiles and nucleic acids, products, and antibodies thereto that are involved in 
breast cancer; and to the use of such expression profiles and compositions in the diagnosis, 
prognosis and therapy of breast cancer. The invention further relates to methods for 
identifying and using agents and/or targets that inhibit breast cancer. 

20 BACKGROUND OF THE INVENTION 

Breast cancer is one of the most frequently diagnosed cancers and the second 
leading cause of female cancer death in North America and northern Europe, with lung 
cancer being the leading cause. Lifetime incidence of the disease in the United States is one- 
in-eight, with a l-in-29 lifetime risk of dying from breast cancer. Early detection of breast 

25 cancer, using mammography, clinical breast examination, and self breast examination, has 

dramatically improved the treatment of the disease, although sensitivity is still major concern, 
as mammography sensitivity has been estimated at only 60%-90%. Treatment of breast 
cancer consists largely of surgical lumpectomy or mastectomy, radiation therapy, anti- 
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hormone therapy, and/or chemotherapy. Although many breast cancer patients are effectively 
treated, the current therapies can all induce serious side effects which diminish quality of life. 
Deciding on a particular course of treatment is typically based on a variety of prognostic 
parameters and markers (Fitzgibbons et al, 2000, Arch. Pathol. Lab. Med. 124:966-978; 
5 Hamilton and Piccart, 2000, Ann. Oncol. 1 1 ;647-663), including genetic predispostion 
markers BRCA-1 and BRCA-2 (Robson, 2000, J. Clin. OncoL 18:113sup-118sup). 

Imaging of breast cancer for diagnosis has been problematic and limited. In 
addition, dissemination of tumor cells (metastases) to locoregional lymph nodes is an 
important prognostic factor; five year survival rates drop from 80 percent in patients with no 

10 lymph node metastases to 45 to 50 percent in those patients who do have lymph node 

metastases. A recent report showed that micrometastases can be detected from lymph nodes 
using reverse transcriptase-PCR methods based on the presence of mRNA for 
carcinoembryonic antigen, which has previously been shown to be present in the vast 
majority of breast cancers but not in normal tissues. Liefers et al., New England J. of Med. 

15 339(4):223 (1998). 

The identification of novel therapeutic targets and diagnostic markers is 
essential for improving the current treatment of breast cancer patients. Recent advances in 
molecular medicine have increased the interest in tumor-specific cell surface antigens that 
could serve as targets for various immunotherapeutic or small molecule strategies. Antigens 

20 suitable for immunotherapeutic strategies should be highly expressed in cancer tissues and 
ideally not expressed in normal adult tissues. Expression in tissues that are dispensable for 
life, however, may be tolerated. Examples of such antigens include Her2/neu and the B-cell 
antigen CD20. Humanized monclonal antibodies directed to Her2/neu 
(Herceptin®/trastuzumab) are currently in use for the treatment of metastatic breast cancer 

25 (Ross and Fletcher, 1998, Stem Cells 16:413-428). Similarly, anti-CD20 monoclonal 

antibodies (Rituxin®/rituximab) are used to effectively treat non-Hodgekin's lymphoma 
(Maloney et al., 1997, Blood 90:2188-2195; Leget and Czuczman, 1998, Curr. Opin. Oncol. 
10:548-551). 

Other potential immunotherapeutic targets have been identified for breast 
30 cancer. One such target is polymorphic epithelial mucin (MUC1). MUC1 is a transmembrane 
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protein, present at the apical surface of glandular epithelial cells. It is often overexpressed in 
breast cancer, and typically exhibits an altered glycosylation pattern, resulting in an 
antigenically distinct molecule, and is in early clinical trials as a vaccine target (Gilewski et 
al., 2000, Clin. Cancer Res. 6:1693-1701; Scholl et al., 2000, J. Immunother. 23:570-580). 
The tumor-expressed protein is often cleaved into the circulation, where it is detectable as the 
tumor marker, CA 15-3 (Bon et al., 1997, Clin. Chem. 43:585-593). However, many patients 
have tumors that express neither HER2 nor MUC-1 ; therefore, it is clear that other targets 
need to be identified to manage localized and metastatic disease. Many other genes have been 
reported to be overexpressed in breast cancer, such as EGFR (Sainsbury et al, 1987, Lancet 
1(8547):1 398-1402), c-erbB3 (Naidu et al, 1988, Br. J. Cancer 78:1385-1390), FGFR2 
(Penault-Llorca et ah, 1991, Int. J. Cancer 61:170-176), PKW (Preiherr et al, 2000, 
Anticancer Res. 20:2255-2264), MTA1 (Nawa et al., 2000, J. Cell Biochem. 79:202-212), 
breast cancer associated gene 1 (Kurt et al., 2000, Breast Cancer Res. Treat. 59:41-48). 
Although monoclonal antibodies to the protein products of some of these overexpressed 
genes have been reported (for review, see Green et al., 2000, Cancer Treat. Rev. 26:269-286), 
none are currently approved for breast cancer therapy in the US. 

Disclosures of certain genes and ESTs described as being expressed in breast 
cancer are found in international patent applications WO-99/33869, WO-97/25426, WO- 
97/02280 and WO-OO/55173, WO-98/45328 and WO-00/22130. Similarly, genes and ESTs 
described as being expressed in breast cancer are disclosed in US Patent Nos. 5,759,776 and 
5/693,522. The utility of such genes is described in each of these publications, and their 
disclosures are incorporated herein in their entirety. 

While industry and academia have identified novel sequences, there has not 
been an equal effort exerted to identify the function of these novel sequences. The 
elucidation of a role for novel proteins and compounds in disease states for identification of 
therapeutic targets and diagnostic markers is essential for improving the current treatment of 
breast cancer patients. Accordingly, provided herein are molecular targets for therapeutic 
intervention in breast and other cancers. Additionally, provided herein are methods that can 
be used in diagnosis and prognosis of breast cancer. Further provided are methods that can 
be used to screen candidate bioactive agents for the ability to modulate breast cancer. 

3 
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SUMMARY OF THE INVENTION 
The present invention therefore provides nucleotide sequences of genes that 
are up- and down-regulated in breast cancer cells. Such genes are useful for diagnostic 
5 purposes, and also as targets for screening for therapeutic compounds that modulate breast 
cancer, such as hormones or antibodies. Other aspects of the invention will become apparent 
to the skilled artisan by the following description of the invention. 

In one aspect, the present invention provides a method of detecting a breast 
cancer- associated transcript in a cell from a patient, the method comprising contacting a 
10 biological sample from the patient with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-25. 

In one embodiment, the present invention provides a method of determining 
the level of a breast cancer associated transcript in a cell from a patient. 

In one embodiment, the present invention provides a method of detecting a 
15 breast cancer-associated transcript in a cell from a patient, the method comprising contacting 
a biological sample from the patient with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-25. 

In one embodiment, the polynucleotide selectively hybridizes to a sequence at 
least 95% identical to a sequence as shown in Tables 1-25. 
20 In one embodiment, the biological sample is a tissue sample. In another 

embodiment, the biological sample comprises isolated nucleic acids, e.g., mRNA. 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent 

label. 

In one embodiment, the polynucleotide is immobilized on a solid surface. 
25 In one embodiment, the patient is undergoing a therapeutic regimen to treat 

breast cancer. In another embodiment, the patient is suspected of having metastatic breast 
cancer. 

In one embodiment, the patient is a human. 

In one embodiment, the breast cancer associated transcript is mRNA. 
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In one embodiment, the method further comprises the step of amplifying 
nucleic acids before the step of contacting the biological sample with the polynucleotide. 

In another aspect, the present invention provides a method of monitoring the 
efficacy of a therapeutic treatment of breast cancer, the method comprising the steps of: (i) 
5 providing a biological sample from a patient undergoing the therapeutic treatment; and (ii) 
determining the level of a breast cancer-associated transcript in the biological sample by 
contacting the biological sample with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-25, thereby monitoring 
the efficacy of the therapy. In a further embodiment, the patient has metastatic breast cancer. 
10 In a further embodiment, the patient has a drug resistant form of breast cancer. 

In one embodiment, the method further comprises the step of: (iii) comparing 
the level of the breast cancer-associated transcript to a level of the breast cancer-associated 
transcript in a biological sample from the patient prior to, or earlier in, the therapeutic 
treatment. 

15 Additionally, provided herein is a method of evaluating the effect of a 

candidate breast cancer drug comprising administering the drug to a patient and removing a 
cell sample from the patient. The expression profile of the cell is then determined. This 
method may further comprise comparing the expression profile to an expression profile of a 
healthy individual. In a preferred embodiment, said expression profile includes a gene of 

20 Tables 1-25. 

In one aspect, the present invention provides an isolated nucleic acid molecule 
consisting of a polynucleotide sequence as shown in Tables 1-25. 

In one embodiment, an expression vector or cell comprises the isolated nucleic 

acid. 

25 In one aspect, the present invention provides an isolated polypeptide which is 

encoded by a nucleic acid molecule having polynucleotide sequence as shown in Tables 1-25. 

In another aspect, the present invention provides an antibody that specifically 
binds to an isolated polypeptide which is encoded by a nucleic acid molecule having 
polynucleotide sequence as shown in Tables 1-25. 
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In one embodiment, the antibody is conjugated to an effector component, e.g., 
a fluorescent label, a radioisotope or a cytotoxic chemical. 

In one embodiment, the antibody is an antibody fragment. In another 
embodiment, the antibody is humanized. 
5 In one aspect, the present invention provides a method of detecting a breast 

cancer cell in a biological sample from a patient, the method comprising contacting the 
biological sample with an antibody as described herein. 

In another aspect, the present invention provides a method of detecting 
antibodies specific to breast cancer in a patient, the method comprising contacting a 
1 0 biological sample from the patient with a polypeptide encoded by a nucleic acid comprising a 
sequence from Tables 1-25. 

In another aspect, the present invention provides a method for identifying a 
compound that modulates a breast cancer-associated polypeptide, the method comprising the 
steps of: (i) contacting the compound with a breast cancer-associated polypeptide, the 
1 5 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 
80% identical to a sequence as shown in Tables 1-25; and (ii) determining the functional 
effect of the compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an enzymatic 
effect, or a chemical effect. 
20 In one embodiment, the polypeptide is expressed in a eukaryotic host cell or 

cell membrane. In another embodiment, the polypeptide is recombinant. 

In one embodiment, the functional effect is determined by measuring ligand 
binding to the polypeptide. 

In another aspect, the present invention provides a method of inhibiting 
25 proliferation of a breast cancer- associated cell to treat breast cancer in a patient, the method 
comprising the step of administering to the subject a therapeutically effective amount of a 
compound identified as described herein. 

In one embodiment, the compound is an antibody. 
In another aspect, the present invention provides a drug screening assay 
30 comprising the steps of: (i) administering a test compound to a mammal having breast cancer 

6 
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or to a cell sample isolated therefrom; (ii) comparing the level of gene expression of a 
polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 
as shown in Tables 1-25 in a treated cell or mammal with the level of gene expression of the 
polynucleotide in a control cell sample or mammal, wherein a test compound that modulates 
5 the level of expression of the polynucleotide is a candidate for the treatment of breast cancer. 

In one embodiment, the control is a mammal with breast cancer or a cell 
sample therefrom that has not been treated with the test compound. In another embodiment, 
the control is a normal cell or mammal. 

Tn one embodiment, the test compound is administered in varying amounts or 
10 concentrations, In another embodiment, the test compound is administered for varying time 
periods. In another embodiment, the comparison can occur after addition or removal of the 
drug candidate. 

In one embodiment, the levels of a plurality of polynucleotides that selectively 
hybridize to a sequence at least 80% identical to a sequence as shown in Tables 1-25 are 
15 individually compared to their respective levels in a control cell sample or mammal. In a 
preferred embodiment the plurality of polynucleotides is from three to ten. 

In another aspect, the present invention provides a method for treating a 
mammal having breast cancer comprising administering a compound identified by the assay 
described herein. 

20 In another aspect, the present invention provides a pharmaceutical 

composition for treating a mammal having breast cancer, the composition comprising a 
compound identified by the assay described herein and a physiologically acceptable 
excipient. 

In one aspect, the present invention provides a method of screening drug 
25 candidates by providing a cell expressing a gene that is up- and down-regulated as in a breast 
cancer. In one embodiment, a gene is selected from Tables 1-25. The method further 
includes adding a drug candidate to the cell and determining the effect of the drug candidate 
on the expression of the expression profile gene. 

In one embodiment, the method of screening drug candidates includes 
30 comparing the level of expression in the absence of the drug candidate to the level of 
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expression in the presence of the drug candidate, wherein the concentration of the drug 
candidate can vary when present, and wherein the comparison can occur after addition or 
removal of the drug candidate. In a preferred embodiment, the cell expresses at least two 
expression profile genes. The profile genes may show an increase or decrease. 
5 Also provided is a method of evaluating the effect of a candidate breast cancer 

drug comprising administering the drug to a transgenic animal expressing or over-expressing 
the breast cancer modulatory protein, or an animal lacking the breast cancer modulatory 
protein, for example as a result of a gene knockout. 

Moreover, provided herein is a biochip comprising one or more nucleic acid 

1 0 segments of Tables 1-25, wherein the biochip comprises fewer than 1000 nucleic acid probes. 
Preferably, at least two nucleic acid segments are included. More preferably, at least three 
nucleic acid segments are included. 

Furthermore, a method of diagnosing a disorder associated with breast cancer 
is provided. The method comprises determining the expression of a gene of Tables 1-25, 

1 5 preferably a gene of Table 25, in a first tissue type of a first individual, and comparing the 
distribution to the expression of the gene from a second normal tissue type from the first 
individual or a second unaffected individual. A difference in the expression indicates that the 
first individual has a disorder associated with breast cancer. 

In a further embodiment, the biochip also includes a polynucleotide sequence 

20 of a gene that is not up- and down-regulated in breast cancer. 

In one embodiment a method for screening for a bioactive agent capable of 
interfering with the binding of a breast cancer modulating protein (breast cancer modulatory 
protein) or a fragment thereof and an antibody which binds to said breast cancer modulatory 
protein or fragment thereof. In a preferred embodiment, the method comprises combining a 

25 breast cancer modulatory protein or fragment thereof, a candidate bioactive agent and an 
antibody which binds to said breast cancer modulatory protein or fragment thereof. The 
method further includes determining the binding of said breast cancer modulatory protein or 
fragment thereof and said antibody. Wherein there is a change in binding, an agent is 
identified as an interfering agent. The interfering agent can be an agonist or an antagonist. 

30 Preferably, the agent inhibits breast cancer. 
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Also provided herein are methods of eliciting an immune response in an 
individual. In one embodiment a method provided herein comprises administering to an 
individual a composition comprising a breast cancer modulating protein, or a fragment 
thereof. In another embodiment, the protein is encoded by a nucleic acid selected from those 
5 of Tables 1-25. 

Further provided herein are compositions capable of eliciting an immune 
response in an individual. In one embodiment, a composition provided herein comprises a 
breast cancer modulating protein, preferably encoded by a nucleic acid of Tables 1-25, more 
preferably of Table 25, or a fragment thereof, and a pharmaceutically acceptable carrier. In 

10 another embodiment, said composition comprises a nucleic acid comprising a sequence 
encoding a breast cancer modulating protein, preferably selected from the nucleic acids of 
Tables 1-25, and a pharmaceutically acceptable carrier. 

Also provided are methods of neutralizing the effect of a breast cancer protein, 
or a fragment thereof, comprising contacting an agent specific for said protein with said 

1 5 protein in an amount sufficient to effect neutralization. In another embodiment, the protein is 
encoded by a nucleic acid selected from those of Tables 1-25. 

In another aspect of the invention, a method of treating an individual for breast 
cancer is provided. In one embodiment, the method comprises administering to said 
individual an inhibitor of a breast cancer modulating protein. In another embodiment, the 

20 method comprises administering to a patient having breast cancer an antibody to a breast 

cancer modulating protein conjugated to a therapeutic moiety. Such a therapeutic moiety can 
be a cytotoxic agent or a radioisotope. 



DETAILED DESCRIPTION OF THE INVENTION 
25 In accordance with the objects outlined above, the present invention provides 

novel methods for diagnosis and prognosis evaluation for breast cancer (PC), including 
metastatic breast cancer, as well as methods for screening for compositions which modulate 
breast cancer. Also provided are methods for treating breast cancer. 

Tables 1-24B provide unigene cluster identification numbers for the 
30 nucleotide sequence of genes that exhibit increased or decreased expression in breast cancer 

9 
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samples. Tables 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13,18, 19, 20, 21, and 22 list those genes that are 
up-regulated in breast cancer cells. Table 14 lists those genes that are highly upregulated in 
breast cancer cells. Table 1, 2, 3, 15, and 23 list genes that are down-regulated in breast 
cancer cells and Table 16, lists genes that are highly down-regulated in breast cancer genes. 
5 The Tables also provide an exemplar accession number that provides a nucleotide sequence 
that is part of the unigene cluster. 

Definitions 

The term "breast cancer protein" or ''breast cancer polynucleotide" or "breast 

10 cancer-associated transcript" refers to nucleic acid and polypeptide polymorphic variants, 
alleles, mutants, and interspecies homologues that: (1) have a nucleotide sequence that has 
greater than about 60% nucleotide sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, 
preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater nucleotide 
sequence identity, preferably over a region of over a region of at least about 25, 50, 100, 200, 

15 500, 1000, or more nucleotides, to a nucleotide sequence of or associated with a gene of 

Tables 1-25; (2) bind to antibodies, e.g., polyclonal antibodies, raised against an immunogen 
comprising an amino acid sequence encoded by a nucleotide sequence of or associated with a 
gene of Tables 1-25, and conservatively modified variants thereof; (3) specifically hybridize 
under stringent hybridization conditions to a nucleic acid sequence, or the complement 

20 thereof of Tables 1-25 and conservatively modified variants thereof or (4) have an amino acid 
sequence that has greater than about 60% amino acid sequence identity, 65%, 70%, 75%, 
80%, 85%, 90%, preferably 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater 
amino sequence identity, preferably over a region of over a region of at least about 25, 50, 
100, 200, 500, 1000, or more amino acid, to an amino acid sequence encoded by a nucleotide 

25 sequence of or associated with a gene of Tables 1-25. A polynucleotide or polypeptide 
sequence is typically from a mammal including, but not limited to, primate, e.g., human; 
rodent, e.g., rat, mouse, hamster; cow, pig, horse, sheep, or other mammal. A "breast cancer 
polypeptide" and a "breast cancer polynucleotide," include both naturally occurring or 
recombinant forms. 
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A "full length" breast cancer protein or nucleic acid refers to a breast cancer 
polypeptide or polynucleotide sequence, or a variant thereof, that contains all of the elements 
normally contained in one or more naturally occurring, wild type breast cancer 
polynucleotide or polypeptide sequences. The "full length" may be prior to, or after, various 
5 stages of post-translation processing or splicing, including alternative splicing. 

"Biological sample" as used herein is a sample of biological tissue or fluid that 
contains nucleic acids or polypeptides, e.g., of a breast cancer protein, polynucleotide or 
transcript. Such samples include, but are not limited to, tissue isolated from primates, e.g., 
humans, or rodents, e.g., mice, and rats. Biological samples may also include sections of 

10 tissues such as biopsy and autopsy samples, frozen sections taken for histologic purposes, 
blood, plasma, serum, sputum, stool, tears, mucus, hair, skin, etc. Biological samples also 
include explants and primary and/or transformed cell cultures derived from patient tissues. A 
biological sample is typically obtained from a eukaryotic organism, most preferably a 
mammal such as a primate e.g., chimpanzee or human; cow; dog; cat; a rodent, e.g., guinea 

1 5 pig, rat, mouse; rabbit; or a bird; reptile; or fish. 

"Providing a biological sample" means to obtain a biological sample for use in 
methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g., 
isolated by another person, at another time, and/or for another purpose), or by performing the 

20 methods of the invention in vivo. Archival tissues, having treatment or outcome history, will 
be particularly useful. 

The terms "identical" or percent "identity," in the context of two or more 
nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that 
are the same or have a specified percentage of amino acid residues or nucleotides that are the 

25 same (i.e., about 60% identity, preferably 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 
95%, 96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and 
aligned for maximum correspondence over a comparison window or designated region) as 
measured using a BLAST or BLAST 2.0 sequence comparison algorithms with default 
parameters described below, or by manual alignment and visual inspection (see, e.g., NCBI 

30 web site http://www.ncbi.nlm.nih.gov/BLAST/ or the like). Such sequences are then said to 
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be "substantially identical." This definition also refers to, or may be applied to, the 
compliment of a test sequence. The definition also includes sequences that have deletions 
and/or additions, as well as those that have substitutions, as well as naturally occurring, e.g., 
polymorphic or allelic variants, and man-made variants. As described below, the preferred 
5 algorithms can account for gaps and the like. Preferably, identity exists over a region that is 
at least about 25 amino acids or nucleotides in length, or more preferably over a region that is 
50-100 amino acids or nucleotides in length. 

For sequence comparison, typically one sequence acts as a reference sequence, 
to which test sequences are compared. When using a sequence comparison algorithm, test 

10 and reference sequences are entered into a computer, subsequence coordinates are designated, 
if necessary, and sequence algorithm program parameters are designated. Preferably, default 
program parameters can be used, or alternative parameters can be designated. The sequence 
comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 

15 A "comparison window", as used herein, includes reference to a segment of 

one of the number of contiguous positions selected from the group consisting typically of 
from 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which 
a sequence may be compared to a reference sequence of the same number of contiguous 
positions after the two sequences are optimally aligned. Methods of alignment of sequences 

20 for comparison are well-known in the art. Optimal alignment of sequences for comparison 
can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl. 
Math. 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, /. Mol 
Biol 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc. Natl. 
Acad. Set USA 85:2444 (1988), by computerized implementations of these algorithms 

25 (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and 
visual inspection {see, e.g., Current Protocols in Molecidar Biology (Ausubel et aL, eds. 
1995 supplement)). 

Preferred examples of algorithms that are suitable for determining percent 

30 sequence identity and sequence similarity include the BLAST and BLAST 2.0 algorithms, 
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which are described in Altschul et al, Nuc. Acids Res. 25:3389-3402 (1977) and Altschul et 
al, J. Mol Biol. 215:403-410 (1990). BLAST and BLAST 2.0 are used, with the parameters 
described herein, to determine percent sequence identity for the nucleic acids and proteins of 
the invention. Software for performing BLAST analyses is publicly available through the 
5 National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This 

algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short 
words of length \V in the query sequence, which either match or satisfy some positive- valued 
threshold score T when aligned with a word of the same length in a database sequence. T is 
referred to as the neighborhood word score threshold (Altschul et al, supra). These initial 

10 neighborhood word hits act as seeds for initiating searches to find longer HSPs containing 
them. The word hits are extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased. Cumulative scores are calculated using, e.g., 
for nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always > 0) and N (penalty score for mismatching residues; always < 0). For amino acid 

15 sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word 
hits in each direction are halted when: the cumulative alignment score falls off by the 
quantity X from its maximum achieved value; the cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 

20 sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 
uses as defaults a wordlength (W) of 1 1, an expectation (E) of 10, M^5, N=-4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 
defaults a wordlength of 3, and expectation (E) of 1 0, and the BLOSUM62 scoring matrix 
{see Henikoff & Henikoff, Proa Natl Acad. Set USA 89:10915 (1989)) alignments (B) of 

25 50, expectation (E) of 10, M^S, N=-4, and a comparison of both strands. 

The BLAST algorithm also performs a statistical analysis of the similarity 
between two sequences (see, e.g., Karlin & Altschul, Proc. Natl. Acad. Sci. USA 90:5873- 
5787 (1993)). One measure of similarity provided by the BLAST algorithm is the smallest 
sum probability (P(N)), which provides an indication of the probability by which a match 

30 between two nucleotide or amino acid sequences would occur by chance. For example, a 
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nucleic acid is considered similar to a reference sequence if the smallest sum probability in a 
comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more 
preferably less than about 0.01, and most preferably less than about 0.001 . Log values may 
be large negative numbers, e.g., 5, 10, 20, 30, 40, 40, 70, 90, 110, 150, 170, etc. 
5 An indication that two nucleic acid sequences or polypeptides are substantially 

identical is that the polypeptide encoded by the first nucleic acid is immunologically cross 
reactive with the antibodies raised against the polypeptide encoded by the second nucleic 
acid, as described below. Thus, a polypeptide is typically substantially identical to a second 
polypeptide, e.g., where the two peptides differ only by conservative substitutions. Another 

10 indication that two nucleic acid sequences are substantially identical is that the two molecules 
or their complements hybridize to each other under stringent conditions, as described below. 
Yet another indication that two nucleic acid sequences are substantially identical is that the 
same primers can be used to amplify the sequences. 

A "host cell" is a naturally occurring cell or a transformed cell that contains an 

15 expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like (see, e.g., the American Type Culture 
Collection catalog or web site, www.atcc.org). 

20 The terms "isolated," "purified," or "biologically pure" refer to material that is 

substantially or essentially free from components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance liquid 
chromatography. A protein or nucleic acid that is the predominant species present in a 

25 preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
some open reading frames that naturally flank the gene and encode proteins other than protein 
encoded by the gene. The term "purified" in some embodiments denotes that a nucleic acid 
or protein gives rise to essentially one band in an electrophoretic gel. Preferably, it means 
that the nucleic acid or protein is at least 85% pure, more preferably at least 95% pure, and 

30 most preferably at least 99% pure. "Purify" or "purification" in other embodiments means 
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removing at least one contaminant from the composition to be purified. In this sense, 
purification does not require that the purified compound be homogenous, e.g., 100% pure. 

The terms * 'polypeptide," "peptide" and "protein" are used interchangeably 
herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers 
5 in which one or more amino acid residue is an artificial chemical mimetic of a corresponding 
naturally occurring amino acid, as well as to naturally occurring amino acid polymers, those 
containing modified residues, and non-naturally occurring amino acid polymer. 

The term "amino acid" refers to naturally occurring and synthetic amino acids, 
as well as amino acid analogs and amino acid mimetics that function similarly to the naturally 

10 occurring amino acids. Naturally occurring amino acids are those encoded by the genetic 
code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to compounds that have 
the same basic chemical structure as a naturally occurring amino acid, e.g., an a carbon that is 
bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 

1 5 norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs may have 

modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic 
chemical structure as a naturally occurring amino acid. Amino acid mimetics refers to 
chemical compounds that have a structure that is different from the general chemical 
structure of an amino acid, but that functions similarly to a naturally occurring amino acid. 

20 Amino acids may be referred to herein by either their commonly known three 

letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical 
Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

"Conservatively modified variants" applies to both amino acid and nucleic 

25 acid sequences. With respect to particular nucleic acid sequences, conservatively modified 
variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical or associated, e.g., naturally contiguous, sequences. Because of the 
degeneracy of the genetic code, a large number of functionally identical nucleic acids encode 

30 most proteins. For instance, the codons GCA, GCC, GCG and GCU all encode the amino 
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acid alanine. Thus, at every position where an alanine is specified by a codon, the codon can 
be altered to another of the corresponding codons described without altering the encoded 
polypeptide. Such nucleic acid variations are "silent variations," which are one species of 
conservatively modified variations. Every nucleic acid sequence herein which encodes a 
5 polypeptide also describes silent variations of the nucleic acid. One of skill will recognize 
that in certain contexts each codon in a nucleic acid (except AUG, which is ordinarily the 
only codon for methionine, and TGG, which is ordinarily the only codon for tryptophan) can 
be modified to yield a functionally identical molecule. Accordingly, often silent variations of 
a nucleic acid which encodes a polypeptide is implicit in a described sequence with respect to 

10 the expression product, but not with respect to actual probe sequences. 

As to amino acid sequences, one of skill will recognize that individual 
substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 
sequence which alters, adds or deletes a single amino acid or a small percentage of amino 
acids in the encoded sequence is a "conservatively modified variant" where the alteration 

15 results in the substitution of an amino acid with a chemically similar amino acid. 

Conservative substitution tables providing functionally similar amino acids are well known in 
the art. Such conservatively modified variants are in addition to and do not exclude 
polymorphic variants, interspecies homologs, and alleles of the invention.typically 
conservative substitutions for one another: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), 

20 Glutamic acid (E); 3) Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) 

Isoleucine (1), Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine ( Y), 
Tryptophan (W); 7) Serine (S), Threonine (T); and 8) Cysteine (C), Methionine (M) (see, 
e.g., Creighton, Proteins (1984)). 

Macromolecular structures such as polypeptide structures can be described in 

25 terms of various levels of organization. For a general discussion of this organization, see, 
e.g., Alberts et al 9 Molecular Biology of the Cell (3 rd ed., 1994) and Cantor & Schimmel, 
Biophysical Chemistiy Part I: The Conformation of Biological Macromolecules (1980). 
"Primary structure" refers to the amino acid sequence of a particular peptide. "Secondary 
structure" refers to locally ordered, three dimensional structures within a polypeptide. These 

30 structures are commonly known as domains. Domains are portions of a polypeptide that 
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often form a compact unit of the polypeptide and are typically 25 to approximately 500 
amino acids long. Typical domains are made up of sections of lesser organization such as 
stretches of P-sheet and a -helices. "Tertiary structure" refers to the complete three 
dimensional structure of a polypeptide monomer. "Quaternary structure" refers to the three 
5 dimensional structure formed, usually by the noncovalent association of independent tertiary 
units. Anisotropic terms are also known as energy terms. 

"Nucleic acid" or "oligonucleotide" or "polynucleotide" or grammatical 
equivalents used herein means at least two nucleotides covalently linked together. 
Oligonucleotides are typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50 or more 

10 nucleotides in length, up to about 100 nucleotides in length. Nucleic acids and 

polynucleotides are a polymers of any length, including longer lengths, e.g., 200, 300, 500, 
1000, 2000, 3000, 5000, 7000, 10,000, etc. A nucleic acid of the present invention will 
generally contain phosphodiester bonds, although in some cases, nucleic acid analogs are 
included that may have alternate backbones, comprising, e.g., phosphoramidate, 

1 5 phosphorothioate, phosphorodithioate, or O-methylphophoroamidite linkages (see Eckstein, 
Oligonucleotides and Analogues: A Practical Approach, Oxford University Press); and 
peptide nucleic acid backbones and linkages. Other analog nucleic acids include those with 
positive backbones; non-ionic backbones, and non-ribose backbones, including those 
described in U.S. Patent Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7, ASC 

20 Symposium Series 580, Carbohydrate Modifications in Antisense Research, Sanghui & 

Cook, eds.. Nucleic acids containing one or more carbocyclic sugars are also included within 
one definition of nucleic acids. Modifications of the ribose-phosphate backbone may be done 
for a variety of reasons, e.g. to increase the stability and half-life of such molecules in 
physiological environments or as probes on a biochip. Mixtures of naturally occurring 

25 nucleic acids and analogs can be made; alternatively, mixtures of different nucleic acid 
analogs, and mixtures of naturally occurring nucleic acids and analogs may be made. 

A variety of references disclose such nucleic acid analogs, including, for 
example, phosphoramidate (Beaucage et al, Tetrahedron 49(10):1925 (1993) and references 
therein; Letsinger, J. Org. Chem. 35:3800 (1970); Sprinzl et al., Eur. J. Biochem. 81:579 

30 (1977); Letsinger et al., Nucl Acids Res. 14:3487 (1986); Sawai et al, Chem. Lett. 805 
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(1984), Letsinger et al., J. Am. Chem. Soc. 110:4470 (1988); and Pauwels et al, Chemica 
Scripta 26:141 91986)), phosphorothioate (Mag et al, Nucleic Acids Res. 19:1437 (1991); 
and U.S. Patent No. 5,644,048), phosphorodithioate (Briu et al., J. Am. Chem. Soc. 111:2321 
(1989), O-methylphophoroamidite linkages (see Eckstein, Oligonucleotides and Analogues: 
5 A Practical Approach, Oxford University Press), and peptide nucleic acid backbones and 

linkages (see Egholm, J. Am. Chem. Soc. 114:1895 (1992); Meier et al, Chem. Int. Ed. Engl. 
31:1008 (1992); Nielsen, Nature, 365:566 (1993); Carlsson et al. Nature 380:207 (1996), all 
of which are incorporated by reference). Other analog nucleic acids include those with 
positive backbones (Denpcy et al, Proc. Natl. Acad. Sci. USA 92:6097 (1995); non-ionic 

10 backbones (U.S. Patent Nos. 5,386,023, 5,637,684, 5,602,240, 5,216,141 and 4,469,863; 
Kiedrowshi et al, Angew. Chem. Intl. Ed. English 30:423 (1991); Letsinger et al, J. Am. 
Chem. Soc. 110:4470 (1988); Letsinger et al. Nucleoside & Nucleotide 13:1597 (1994); 
Chapters 2 and 3, ASC Symposium Series 580, "Carbohydrate Modifications in Antisense 
Research", Ed. Y.S. Sanghui and P. Dan Cook; Mesmaeker et al, Bioorganic & Medicinal 

15 Chem. Lett. 4:395 (1994); Jeffs et al, I Biomolecular NMR 34:17 (1994); Tetrahedron Lett. 
37:743 (1996)) and non-ribose backbones, including those described in U.S. Patent Nos. 
5,235,033 and 5,034,506, and Chapters 6 and 7, ASC Symposium Series 580, "Carbohydrate 
Modifications in Antisense Research", Ed. Y.S. Sanghui and P. Dan Cook. Nucleic acids 
containing one or more carbocyclic sugars are also included within one definition of nucleic 

20 acids (see Jenkins et al, Chem. Soc. Rev. (1995) pp 169-176). Several nucleic acid analogs 
are described in Rawls, C & E News June 2, 1997 page 35. All of these references are hereby 
expressly incorporated by reference. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide 
nucleic acid analogs. These backbones are substantially non-ionic under neutral conditions, in 

25 contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
This results in two advantages. First, the PNA backbone exhibits improved hybridization 
kinetics. PNAs have larger changes in the melting temperature (T m ) for mismatched versus 
perfectly matched basepairs. DNA and RNA typically exhibit a 2-4°C drop in T m for an 
internal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9°C. Similarly, 

30 due to their non-ionic nature, hybridization of the bases attached to these backbones is 
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relatively insensitive to salt concentration. In addition, PNAs are not degraded by cellular 
enzymes, and thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or 
contain portions of both double stranded or single stranded sequence. As will be appreciated 

5 by those in the art, the depiction of a single strand also defines the sequence of the 

complementary strand; thus the sequences described herein also provide the complement of 
the sequence. The nucleic acid may be DNA, both genomic and cDNA, RNA or a hybrid, 
where the nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and 
combinations of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, 

10 xanthine hypoxanthine, isocytosine, isoguanine, etc. "Transcript" typically refers to a 

naturally occurring RNA, e.g., a pre-mRNA, hnRNA, or mRNA. As used herein, the term 
"nucleoside" includes nucleotides and nucleoside and nucleotide analogs, and modified 
nucleosides such as amino modified nucleosides. In addition, "nucleoside" includes non- 
naturally occurring analog structures. Thus, e.g. the individual units of a peptide nucleic 

15 acid, each containing a base, are referred to herein as a nucleoside. 

A "label" or a "detectable moiety" is a composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, chemical, or other physical 
means. For example, useful labels include 32 P, fluorescent dyes, electron-dense reagents, 
enzymes {e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins 

20 or other entities which can be made detectable, e.g., by incorporating a radiolabel into the 
peptide or used to detect antibodies specifically reactive with the peptide. The labels may be 
incorporated into the breast cancer nucleic acids, proteins and antibodies at any position. 
Any method known in the art for conjugating the antibody to the label may be employed, 
including those methods described by Hunter et al., Nature , 144:945 (1962); David et al., 

25 Biochemistry, 13:1014 (1974); Pain et al, J. Immunol. Meth., 40:219 (1981); and Nygren, L 
Histochem. and Cvtochem. , 30:407 (1982). 

An "effector" or "effector moiety" or "effector component" is a molecule that 
is bound (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 

30 The "effector" can be a variety of molecules including, e.g., detection moieties including 
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radioactive compounds, fluorescent compounds, an enzyme or substrate, tags such as epitope 
tags, a toxin; activatable moieties, a chemotherapeutic agent; a lipase; an antibiotic; or a 
radioisotope emitting "hard" e.g., beta radiation. 

A "labeled nucleic acid probe or oligonucleotide" is one that is bound, either 
5 covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Alternatively, method 
using high affinity interactions may achieve the same results where one of a pair of binding 
partners binds to the other, e.g., biotin, streptavidin. 

10 As used herein a "nucleic acid probe or oligonucleotide" is defined as a 

nucleic acid capable of binding to a target nucleic acid of complementary sequence through 
one or more types of chemical bonds, usually through complementary base pairing, usually 
through hydrogen bond formation. As used herein, a probe may include natural (i.e., A, G, C, 
or T) or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe 

1 5 may be joined by a linkage other than a phosphodiester bond, so long as it does not 

functionally interfere with hybridization. Thus, e.g., probes may be peptide nucleic acids in 
which the constituent bases are joined by peptide bonds rather than phosphodiester linkages. 
It will be understood by one of skill in the art that probes may bind target sequences lacking 
complete complementarity with the probe sequence depending upon the stringency of the 

20 hybridization conditions. The probes are preferably directly labeled as with isotopes, 

chromophores, lumiphores, chromogens, or indirectly labeled such as with biotin to which a 
streptavidin complex may later bind. By assaying for the presence or absence of the probe, 
one can detect the presence or absence of the select sequence or subsequence. Diagnosis or 
prognosis may be based at the genomic level, or at the level of RNA or protein expression. 

25 The term "recombinant" when used with reference, e.g., to a cell, or nucleic 

acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been 
modified by the introduction of a heterologous nucleic acid or protein or the alteration of a 
native nucleic acid or protein, or that the cell is derived from a cell so modified. Thus, e.g., 
recombinant cells express genes that are not found within the native (non-recombinant) form 

30 of the cell or express native genes that are otherwise abnormally expressed, under expressed 
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or not expressed at all. By the term "recombinant nucleic acid" herein is meant nucleic acid, 
originally formed in vitro, in general, by the manipulation of nucleic acid, e.g., using 
polymerases and endonucleases, in a form not normally found in nature. In this manner, 
operably linkage of different sequences is achieved. Thus an isolated nucleic acid, in a linear 
form, or an expression vector formed in vitro by ligating DNA molecules that are not 
normally joined, are both considered recombinant for the purposes of this invention. It is 
understood that once a recombinant nucleic acid is made and reintroduced into a host cell or 
organism, it will replicate non-recombinantly, i.e., using the in vivo cellular machinery of the 
host cell rather than in vitro manipulations; however, such nucleic acids, once produced 
recombinant^, although subsequently replicated non-recombinantly, are still considered 
recombinant for the purposes of the invention. Similarly, a "recombinant protein" is a protein 
made using recombinant techniques, i.e., through the expression of a recombinant nucleic 
acid as depicted above. 

The term "heterologous" when used with reference to portions of a nucleic 
acid indicates that the nucleic acid comprises two or more subsequences that are not normally 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences, e.g., from unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 
coding region from another source. Similarly, a heterologous protein will often refer to two 
or more subsequences that are not found in the same relationship to each other in nature (e.g. , 
a fusion protein). 

A "promoter" is defined as an array of nucleic acid control sequences that 
direct transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic 
acid sequences near the start site of transcription, such as, in the case of a polymerase II type 
promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 
active under environmental or developmental regulation. The term "operably linked" refers 
to a functional linkage between a nucleic acid expression control sequence (such as a 
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promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

An "expression vector" is a nucleic acid construct, generated recombinantly or 
5 synthetically, with a series of specified nucleic acid elements that permit transcription of a 
particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be 
transcribed operably linked to a promoter. 

The phrase "selectively (or specifically) hybridizes to" refers to the binding, 

10 duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under 

stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
total cellular or library DNA or RNA). 

The phrase "stringent hybridization conditions" refers to conditions under 
which a probe will hybridize to its target subsequence, typically in a complex mixture of 

15 nucleic acids, but to no other sequences. Stringent conditions are sequence-dependent and 
will be different in different circumstances. Longer sequences hybridize specifically at 
higher temperatures. An extensive guide to the hybridization of nucleic acids is found in 
Tijssen, Techniques in Biochemistiy and Molecular Biology-Hybridization with Nucleic 
Probes , "Overview of principles of hybridization and the strategy of nucleic acid assays" 

20 (1993). Generally, stringent conditions are selected to be about 5-10°C lower than the 

thermal melting point (T m ) for the specific sequence at a defined ionic strength pH. The T m is 
the temperature (under defined ionic strength, pH, and nucleic concentration) at which 50% 
of the probes complementary to the target hybridize to the target sequence at equilibrium (as 
the target sequences are present in excess, at T m , 50% of the probes are occupied at 

25 equilibrium). Stringent conditions will be those in which the salt concentration is less than 
about TOM sodium ion, typically about 0.01 to 1 .0 M sodium ion concentration (or other 
salts) at pH 7.0 to 83 and the temperature is at least about 30°C for short probes (e.g., 10 to 
50 nucleotides) and at least about 60°C for long probes (e.g., greater than 50 nucleotides). 
Stringent conditions may also be achieved with the addition of destabilizing agents such as 

30 formamide. For selective or specific hybridization, a positive signal is at least two times 
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background, preferably 10 times background hybridization. Exemplary stringent 
hybridization conditions can be as following: 50% formamide, 5x SSC, and 1% SDS, 
incubating at 42°C, or, 5x SSC, 1% SDS, incubating at 65°C, with wash in 0.2x SSC, and 
0.1% SDS at 65°C. For PCR, a temperature of about 36°C is typical for low stringency 
5 amplification, although annealing temperatures may vary between about 32°C and 48°C 
depending on primer length. For high stringency PCR amplification, a temperature of about 
62°C is typical, although high stringency annealing temperatures can range from about 50°C 
to about 65°C, depending on the primer length and specificity. Typical cycle conditions for 
both high and low stringency amplifications include a denaturation phase of 90°C - 95°C for 

10 30 sec - 2 min., an annealing phase lasting 30 sec. - 2 min., and an extension phase of about 
72°C for 1 - 2 min. Protocols and guidelines for low and high stringency amplification 
reactions are provided, e.g., in Innis et al (1990) PCR Protocols, A Guide to Methods and 
Applications, Academic Press, Inc. N.Y.). 

Nucleic acids that do not hybridize to each other under stringent conditions are 

1 5 still substantially identical if the polypeptides which they encode are substantially identical. 
This occurs, e.g., when a copy of a nucleic acid is created using the maximum codon 
degeneracy permitted by the genetic code. In such cases, the nucleic acids typically hybridize 
under moderately stringent hybridization conditions. Exemplary "moderately stringent 
hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 M NaCl, 

20 1% SDS at 37°C, and a wash in IX SSC at 45°C. A positive hybridization is at least twice 
background. Those of ordinary skill will readily recognize that alternative hybridization and 
wash conditions can be utilized to provide conditions of similar stringency. Additional 
guidelines for determining hybridization parameters are provided in numerous reference, e.g., 
and Current Protocols in Molecular Biology, ed. Ausubel, et al. 

25 The phrase "functional effects" in the context of assays for testing compounds 

that modulate activity of a breast cancer protein includes the determination of a parameter 
that is indirectly or directly under the influence of the breast cancer protein or nucleic acid, 
e.g., a functional, physical, or chemical effect, such as the ability to decrease breast cancer. It 
includes ligand binding activity; cell growth on soft agar; anchorage dependence; contact 
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inhibition and density limitation of growth; cellular proliferation; cellular transformation; 
growth factor or serum dependence; tumor specific marker levels; invasiveness into Matrigel; 
tumor growth and metastasis in vivo; mRNA and protein expression in cells undergoing 
metastasis, and other characteristics of breast cancer cells. "Functional effects" include in 
5 vitro, in vivo, and ex vivo activities. 

By "determining the functional effect" is meant assaying for a compound that 
increases or decreases a parameter that is indirectly or directly under the influence of a breast 
cancer protein sequence, e.g., functional, enzymatic, physical and chemical effects. Such 
functional effects can be measured by any means known to those skilled in the art, e.g., 

10 changes in spectroscopic characteristics (e.g., fluorescence, absorbance, refractive index), 
hydrodynamic (e.g., shape), chromatographic, or solubility properties for the protein, 
measuring inducible markers or transcriptional activation of the breast cancer protein; 
measuring binding activity or binding assays, e.g. binding to antibodies or other ligands, and 
measuring cellular proliferation. Determination of the functional effect of a compound on 

1 5 breast cancer can also be performed using breast cancer assays known to those of skill in the 
art such as an in vitro assays, e.g., cell growth on soft agar; anchorage dependence; contact 
inhibition and density limitation of growth; cellular proliferation; cellular transformation; 
growth factor or serum dependence; tumor specific marker levels; invasiveness into Matrigel; 
tumor growth and metastasis in vivo; mRNA and protein expression in cells undergoing 

20 metastasis, and other characteristics of breast cancer cells. The functional effects can be 

evaluated by many means known to those skilled in the art, e.g., microscopy for quantitative 
or qualitative measures of alterations in morphological features, measurement of changes in 
RNA or protein levels for breast cancer-associated sequences, measurement of RNA stability, 
identification of downstream or reporter gene expression (CAT, luciferase, p-gal, GFP and 

25 the like), e.g., via chemiluminescence, fluorescence, colorimetric reactions, antibody binding, 
inducible markers, and ligand binding assays. 

"Inhibitors", "activators", and "modulators" of breast cancer polynucleotide 
and polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules 
or compounds identified using in vitro and in vivo assays of breast cancer polynucleotide and 

30 polypeptide sequences. Inhibitors are compounds that, e.g., bind to, partially or totally block 
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activity, decrease, prevent, delay activation, inactivate, desensitize, or down regulate the 
activity or expression of breast cancer proteins, e.g., antagonists. Antisense nucleic acids 
may seem to inhibit expression and subsequent function of the protein. "Activators" are 
compounds that increase, open, activate, facilitate, enhance activation, sensitize, agonize, or 

5 up regulate breast cancer protein activity. Inhibitors, activators, or modulators also include 
genetically modified versions of breast cancer proteins, e.g., versions with altered activity, as 
well as naturally occurring and synthetic ligands, antagonists, agonists, antibodies, small 
chemical molecules and the like. Such assays for inhibitors and activators include, e.g., 
expressing the breast cancer protein in vitro, in cells, or cell membranes, applying putative 

10 modulator compounds, and then determining the functional effects on activity, as described 
above. Activators and inhibitors of breast cancer can also be identified by incubating breast 
cancer cells with the test compound and determining increases or decreases in the expression 
of 1 or more breast cancer proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50 or more breast 
cancer proteins, such as breast cancer proteins encoded by the sequences set out in Tables 1- 

15 25. 

Samples or assays comprising breast cancer proteins that are treated with a 
potential activator, inhibitor, or modulator are compared to control samples without the 
inhibitor, activator, or modulator to examine the extent of inhibition. Control samples 
(untreated with inhibitors) are assigned a relative protein activity value of 100%. Inhibition 

20 of a polypeptide is achieved when the activity value relative to the control is about 80%, 
preferably 50%, more preferably 25-0%. Activation of a breast cancer polypeptide is 
achieved when the activity value relative to the control (untreated with activators) is 110%, 
more preferably 1 50%, more preferably 200-500% (i.e., two to five fold higher relative to the 
control), more preferably 1000-3000% higher. 

25 The phrase "changes in cell growth" refers to any change in cell growth and 

proliferation characteristics in vitro or in vivo, such as formation of foci, anchorage 
independence, semi-solid or soft agar growth, changes in contact inhibition and density 
limitation of growth, loss of growth factor or serum requirements, changes in cell 
morphology, gaining or losing immortalization, gaining or losing tumor specific markers, 

30 ability to form or suppress tumors when injected into suitable animal hosts, and/or 
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immortalization of the cell. See, e.g., Freshney, Culture of Animal Cells a Manual of Basic 
Technique pp. 231-241 (3 rd ed. 1994). 

"Tumor cell" refers to precancerous, cancerous, and normal cells in a tumor. 
"Cancer cells," 'transformed" cells or "transformation" in tissue culture, refers 
5 to spontaneous or induced phenotypic changes that do not necessarily involve the uptake of 
new genetic material. Although transformation can arise from infection with a transforming 
virus and incorporation of new genomic DNA, or uptake of exogenous DNA, it can also arise 
spontaneously or following exposure to a carcinogen, thereby mutating an endogenous gene. 
Transformation is associated with phenotypic changes, such as immortalization of cells, 

10 aberrant growth control, nonmorphological changes, and/or malignancy (see, Freshney, 
Culture of Animal Cells a Manual of Basic Technique (3 rd ed. 1994)). 

"Antibody" refers to a polypeptide comprising a framework region from an 
immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 

15 epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable region 
genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 
gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 
IgM, IgA, IgD and IgE, respectively. Typically, the antigen-binding region of an antibody or 
its functional equivalent will be most critical in specificity and affinity of binding. See Paul, 

20 Fundamental Immunology. 

An exemplary immunoglobulin (antibody) structural unit comprises a 
tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair 
having one "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus 
of each chain defines a variable region of about 100 to 1 10 or more amino acids primarily 

25 responsible for antigen recognition. The terms variable light chain (V L ) and variable heavy 
chain (V H ) refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well- 
characterized fragments produced by digestion with various peptidases. Thus, e.g., pepsin 
digests an antibody below the disulfide linkages in the hinge region to produce F(ab)' 2 , a 

30 dimer of Fab which itself is a light chain joined to V H -C H 1 by a disulfide bond. The F(ab)' 2 
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may be reduced under mild conditions to break the disulfide linkage in the hinge region, 
thereby converting the F(ab)' 2 dimer into an Fab' monomer. The Fab' monomer is 
essentially Fab with part of the hinge region (see Fundamental Immunology (Paul ed., 3d ed. 
1993). While various antibody fragments are defined in terms of the digestion of an intact 

5 antibody, one of skill will appreciate that such fragments may be synthesized de novo either 
chemically or by using recombinant DNA methodology. Thus, the term antibody, as used 
herein, also includes antibody fragments either produced by the modification of whole 
antibodies, or those synthesized de novo using recombinant DNA methodologies (e.g., single 
chain Fv) or those identified using phage display libraries (see, e.g., McCafferty et al, Nature 

10 348:552-554(1990)) 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 
antibodies, many technique known in the art can be used (see, e.g., Kohler & Milstein, 
Nature 256:495-497 (1975); Kozbor et al 9 Immunology Today 4:72 (1983); Cole et ai, pp. 
77-96 in Monoclonal Antibodies and Cancer Therapy (1985); Coligan, Current Protocols in 

1 5 Immunology (1991); Harlow & Lane, Antibodies, A Laboratory Manual (1988); and Goding, 
Monoclonal Antibodies: Principles and Practice (2d ed. 1986)). Techniques for the 
production of single chain antibodies (U.S. Patent 4,946,778) can be adapted to produce 
antibodies to polypeptides of this invention. Also, transgenic mice, or other organisms such 
as other mammals, may be used to express humanized antibodies. Alternatively, phage 

20 display technology can be used to identify antibodies and heteromeric Fab fragments that 
specifically bind to selected antigens (see, e.g., McCafferty et ai, Nature 348:552-554 
(1990); Marks et al 9 Biotechnology 10:779-783 (1992)). 

A "chimeric antibody" is an antibody molecule in which (a) the constant 
region, or a portion thereof, is altered, replaced or exchanged so that the antigen binding site 

25 ( variable region) is linked to a constant region of a different or altered class, effector function 
and/or species, or an entirely different molecule which confers new properties to the chimeric 
antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the variable 
region, or a portion thereof, is altered, replaced or exchanged with a variable region having a 
different or altered antigen specificity. 

30 
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Identification of breast cancer-associated sequences 

In one aspect, the expression levels of genes are determined in different 
patient samples for which diagnosis information is desired, to provide expression profiles. 
An expression profile of a particular sample is essentially a "fingerprint" of the state of the 
5 sample; while two states may have any particular gene similarly expressed, the evaluation of 
a number of genes simultaneously allows the generation of a gene expression profile that is 
characteristic of the state of the cell. That is, normal tissue (e.g., normal breast or other 
tissue) may be distinguished from cancerous or metastatic cancerous tissue of the breast, or 
breast cancer tissue or metastatic breast cancerous tissue can be compared with tissue 
10 samples of breast and other tissues from surviving cancer patients. By comparing expression 
profiles of tissue in known different breast cancer states, information regarding which genes 
are important (including both up- and down-regulation of genes) in each of these states is 
obtained. 

The identification of sequences that are differentially expressed in breast 
15 cancer versus non-breast cancer tissue allows the use of this information in a number of ways. 
For example, a particular treatment regime may be evaluated: does a chemotherapeutic drug 
act to down-regulate breast cancer, and thus tumor growth or recurrence, in a particular 
patient. Similarly, diagnosis and treatment outcomes may be done or confirmed by 
comparing patient samples with the known expression profiles. Metastatic tissue can also be 
20 analyzed to determine the stage of breast cancer in the tissue. Furthermore, these gene 

expression profiles (or individual genes) allow screening of drug candidates with an eye to 
mimicking or altering a particular expression profile; e.g., screening can be done for drugs 
that suppress the breast cancer expression profile. This may be done by making biochips 
comprising sets of the important breast cancer genes, which can then be used in these screens. 
25 These methods can also be done on the protein basis; that is, protein expression levels of the 
breast cancer proteins can be evaluated for diagnostic purposes or to screen candidate agents. 
In addition, the breast cancer nucleic acid sequences can be administered for gene therapy 
purposes, including the administration of antisense nucleic acids, or the breast cancer proteins 
(including antibodies and other modulators thereof) administered as therapeutic drugs. 
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Thus the present invention provides nucleic acid and protein sequences that 
are differentially expressed in breast cancer, herein termed "breast cancer sequences." As 
outlined below, breast cancer sequences include those that are up-regulated (i.e., expressed at 
a higher level) in breast cancer, as well as those that are down-regulated (i.e., expressed at a 

5 lower level). In a preferred embodiment, the breast cancer sequences are from humans; 
however, as will be appreciated by those in the art, breast cancer sequences from other 
organisms may be useful in animal models of disease and drug evaluation; thus, other breast 
cancer sequences are provided, from vertebrates, including mammals, including rodents (rats, 
mice, hamsters, guinea pigs, etc.), primates, farm animals (including sheep, goats, pigs, cows, 

10 horses, etc.) and pets, e.g., (dogs, cats, etc.). Breast cancer sequences from other organisms 
may be obtained using the techniques outlined below. 

Breast cancer sequences can include both nucleic acid and amino acid 
sequences. As will be appreciated by those in the art and is more fully outlined below, breast 
cancer nucleic acid sequences are useful in a variety of applications, including diagnostic 

1 5 app lications, which will detect naturally occurring nucleic acids, as well as screening 

applications; e.g., biochips comprising nucleic acid probes or PCR microliter plates with 
selected probes to the breast cancer sequences can be generated. 

A breast cancer sequence can be initially identified by substantial nucleic acid 
and/or amino acid sequence homology to the breast cancer sequences outlined herein. Such 

20 homology can be based upon the overall nucleic acid or amino acid sequence, and is 

generally determined as outlined below, using either homology programs or hybridization 
conditions. 

For identifying breast cancer-associated sequences, the breast cancer screen 
typically includes comparing genes identified in different tissues, e.g., normal and cancerous 
25 tissues, or tumor tissue samples from patients who have metastatic disease vs. non metastatic 
tissue. Other suitable tissue comparisons include comparing breast cancer samples with 
metastatic cancer samples from other cancers, such as lung, breast, gastrointestinal cancers, 
ovarian, etc. Samples of different stages of breast cancer, e.g., survivor tissue, drug resistant 
states, and tissue undergoing metastasis, are applied to biochips comprising nucleic acid 
30 probes. The samples are first microdissected, if applicable, and treated as is known in the art 
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for the preparation of mRNA. Suitable biochips are commercially available, e.g. from 
Affymetrix. Gene expression profiles as described herein are generated and the data 
analyzed. 

In one embodiment, the genes showing changes in expression as between 
5 normal and disease states are compared to genes expressed in other normal tissues, preferably 
normal breast, but also including, and not limited to lung, heart, brain, liver, breast, kidney, 
muscle, colon, small intestine, large intestine, spleen, bone and placenta. In a preferred 
embodiment, those genes identified during the breast cancer screen that are expressed in any 
significant amount in other tissues are removed from the profile, although in some 

10 embodiments, this is not necessary. That is, when screening for drugs, it is usually preferable 
that the target be disease specific, to minimize possible side effects. 

In a preferred embodiment, breast cancer sequences are those that are up- 
regulated in breast cancer; that is, the expression of these genes is higher in the breast cancer 
tissue as compared to non-cancerous tissue. "Up-regulation" as used herein often means at 

1 5 least about a two-fold change, preferably at least about a three fold change, with at least about 
five-fold or higher being preferred. All unigene cluster identification numbers and accession 
numbers herein are for the GenBank sequence database and the sequences of the accession 
numbers are hereby expressly incorporated by reference. GenBank is known in the art, see, 
e.g., Benson, DA, et al, Nucleic Acids Research 26: 1-7 (1998) and 

20 http://www.ncbi.nlm.nih.gov/. Sequences are also available in other databases, e.g., 

European Molecular Biology Laboratory (EMBL) and DNA Database of Japan (DDBJ). 
U.S. Patent Application N. 09/687,576, with the same assignee as the present application, 
further discloses related sequences, compositions, and methods of diagnosis and treatment of 
breast cancer is hereby expressly incorporated by reference. 

25 In another preferred embodiment, breast cancer sequences are those that are 

down-regulated in the breast cancer; that is, the expression of these genes is lower in breast 
cancer tissue as compared to non-cancerous tissue (see, e.g., Tables 1,2, 3, 15, 16 etc.). 
"Down-regulation" as used herein often means at least about a two-fold change, preferably at 
least about a three fold change, with at least about five-fold or higher being preferred. 
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Informatics 

The ability to identify genes that are over or under expressed in breast cancer 
can additionally provide high-resolution, high-sensitivity datasets which can be used in the 
areas of diagnostics, therapeutics, drug development, pharmacogenetics, protein structure, 
biosensor development, and other related areas. For example, the expression profiles can be 
used in diagnostic or prognostic evaluation of patients with breast cancer. Or as another 
example, subcellular toxicological information can be generated to better direct drug structure 
and activity correlation (see Anderson, Pharmaceutical Proteomics: Targets, Mechanism, 
and Function, paper presented at the IBC Proteomics conference, Coronado, CA (June 1 1-12, 
1998)). Subcellular toxicological information can also be utilized in a biological sensor 
device to predict the likely toxicological effect of chemical exposures and likely tolerable 
exposure thresholds (see U.S. Patent No. 5,81 1,231). Similar advantages accrue from 
datasets relevant to other biomolecules andbioactive agents (e.g., nucleic acids, saccharides, 

lipids, drugs, and the like). 

Thus, in another embodiment, the present invention provides a database that 
includes at least one set of assay data. The data contained in the database is acquired, e.g., 
using array analysis either singly or in a library format. The database can be in substantially 
any form in which data can be maintained and transmitted, but is preferably an electronic 
database. The electronic database of the invention can be maintained on any electronic 
device allowing for the storage of and access to the database, such as a personal computer, 
but is preferably distributed on a wide area network, such as the World Wide Web. 

The focus of the present section on databases that include peptide sequence 
data is for clarity of illustration only. It will be apparent to those of skill in the art that similar 
databases can be assembled for any assay data acquired using an assay of the invention. 

The compositions and methods for identifying and/or quantitating the relative 
and/or absolute abundance of a variety of molecular and macromolecular species from a 
biological sample undergoing breast cancer, i.e., the identification of breast cancer-associated 
sequences described herein, provide an abundance of information, which can be correlated 
with pathological conditions, predisposition to disease, drug testing, therapeutic monitoring, 
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gene-disease causal linkages, identification of correlates of immunity and physiological 
status, among others. Although the data generated from the assays of the invention is suited 
for manual review and analysis, in a preferred embodiment, prior data processing using high- 
speed computers is utilized. 
5 An array of methods for indexing and retrieving biomolecular information is 

known in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational 
database system for storing biomolecular sequence information in a manner that allows 
sequences to be catalogued and searched according to one or more protein function 
hierarchies. U.S. Patent 5,953,727 discloses a relational database having sequence records 

10 containing information in a format that allows a collection of partial-length DNA sequences 
to be catalogued and searched according to association with one or more sequencing projects 
for obtaining full-length sequences from the collection of partial length sequences. U.S. 
Patent 5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 
sequence similar to a sequence data item in a gene database based on the degree of similarity 

15 between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a method 

using mass spectroscopy fragmentation patterns of peptides to identify amino acid sequences 
in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness-of-fit measure. U.S. Patent 5,926,818 discloses a multi- 
dimensional database comprising a functionality for multi-dimensional data analysis 

20 described as on-line analytical processing (OLAP), which entails the consolidation of 

projected and actual data according to more than one consolidation path or dimension. U.S. 
Patent 5,295,261 reports a hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 
fields stored in a hierarchical topological map which can be viewed as a tree structure or as 

25 the merger of two or more such tree structures. 

See also Mount et aL, Bioinformatics (2001); Biological Sequence Analysis: 
Probabilistic Models of Proteins and Nucleic Acids (Durbin et aL, eds., 1 999); 
Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins (Baxevanis & 
Oeullette eds., 1998)); Rashidi & Buehler, Bioinformatics: Basic Applications in Biological 

30 Science and Medicine (1999); Introduction to Computational Molecular Biology (Setubal et 
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aL, eds 1997); Bioinformatics: Methods and Protocols (Misener & Krawetz, eds, 2000); 
Bioinformatics: Sequence, Structure, and Databanks: A Practical Approach (Higgins & 
Taylor, eds., 2000); Brown, Bioinformatics: A Biologist s Guide to Biocomputing and the 
Internet (2001); Han & Kamber, Data Mining: Concepts and Techniques (2000); and 
5 Waterman, Introduction to Computational Biology: Maps, Sequences, and Genomes (1995). 

The present invention provides a computer database comprising a computer 
and software for storing in computer-retrievable form assay data records cross-tabulated, e.g., 
with data specifying the source of the target-containing sample from which each sequence 
specificity record was obtained. 
10 In an exemplary embodiment, at least one of the sources of target-containing 

sample is from a control tissue sample known to be free of pathological disorders. In a 
variation, at least one of the sources is a known pathological tissue specimen, e.g., a 
neoplastic lesion or another tissue specimen to be analyzed for breast cancer. In another 
variation, the assay records cross-tabulate one or more of the following parameters for each 
1 5 target species in a sample: (1) a unique identification code, which can include, e.g., a target 
molecular structure and/or characteristic separation coordinate (e.g., electrophoretic 
coordinates); (2) sample source; and (3) absolute and/or relative quantity of the target species 

present in the sample. 

The invention also provides for the storage and retrieval of a collection of 

20 target data in a computer data storage apparatus, which can include magnetic disks, optical 
disks, magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, 
magnetic bubble memory devices, and other data storage devices, including CPU registers 
and on-CPU data storage arrays. Typically, the target data records are stored as a bit pattern 
in an array of magnetic domains on a magnetizable medium or as an array of charge states or 

25 transistor gate states, such as an array of cells in a DRAM device (e.g., each cell comprised of 
a transistor and a charge storage area, which may be on the transistor). In one embodiment, 
the invention provides such storage devices, and computer systems built therewith, 
comprising a bit pattern encoding a protein expression fingerprint record comprising unique 
identifiers for at least 10 target data records cross-tabulated with target source. 
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When the target is a peptide or nucleic acid, the invention preferably provides 
a method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 
or retrieved from a computer storage device or database and at least one other sequence. The 
5 comparison can include a sequence analysis or comparison algorithm or computer program 
embodiment thereof (e.g., FASTA, TFASTA, GAP, BESTFIT) and/or the comparison may 
be of the relative amount of a peptide or nucleic acid sequence in a pool of sequences 
determined from a polypeptide or nucleic acid sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an IBM- 

10 compatible (DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format 

(e.g., Linux, SunOS, Solaris, AIX, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or 
hard (fixed, Winchester) disk drive, comprising a bit pattern encoding data from an assay of 
the invention in a file format suitable for retrieval and processing in a computerized sequence 
analysis, comparison, or relative quantitation method. 

15 The invention also provides a network, comprising a plurality of computing 

devices linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, 
ISDN line, wireless network, optical fiber, or other suitable signal transmission medium, 
whereby at least one network device (e.g., computer, disk array, etc.) comprises a pattern of 
magnetic domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM 

20 cells) composing a bit pattern encoding data acquired from an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 

25 database comprising a plurality of assay results obtained by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for 
comparing a query target to a database containing an array of data structures, such as an assay 
result obtained by the method of the invention, and ranking database targets based on the 
degree of identity and gap weight to the target data. A central processor is preferably 

30 initialized to load and execute the computer program for alignment and/or comparison of the 
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assay results. Data for a query target is entered into the central processor via an I/O device. 
Execution of the computer program results in the central processor retrieving the assay data 
from the data file, which comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to 

5 secondary memory, which is typically random access memory (e.g., DRAM, SRAM, 
SGRAM, or SDRAM). Targets are ranked according to the degree of correspondence 
between a selected assay characteristic (e.g., binding to a selected affinity moiety) and the 
same characteristic of the query target and results are output via an I/O device. For example, 
a central processor can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, 

10 PA-8000, SPARC, MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or 
public domain molecular biology software package (e.g., UWGCG Sequence Analysis 
Software, Darwin); a data file can be an optical or magnetic disk, a data server, a memory 
device (e.g., DRAM, SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, 
etc.); an I/O device can be a terminal comprising a video display and a keyboard, a modem, 

1 5 an ISDN terminal adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or 
other suitable I/O device. 

The invention also preferably provides the use of a computer system, such as 
that described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 
collection of peptide sequence specificity records obtained by the methods of the invention, 

20 which may be stored in the computer; (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. 

Characteristics of breast cancer-associated proteins 

25 Breast cancer proteins of the present invention may be classified as secreted 

proteins, transmembrane proteins or intracellular proteins. In one embodiment, the breast 
cancer protein is an intracellular protein. Intracellular proteins may be found in the 
cytoplasm and/or in the nucleus. Intracellular proteins are involved in all aspects of cellular 
function and replication (including, e.g., signaling pathways); aberrant expression of such 

30 proteins often results in unregulated or disregulated cellular processes (see, e.g., Molecular 
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Biology of the Cell (Alberts, ed., 3rd ed., 1994). For example, many intracellular proteins 
have enzymatic activity such as protein kinase activity, protein phosphatase activity, protease 
activity, nucleotide cyclase activity, polymerase activity and the like. Intracellular proteins 
also serve as docking proteins that are involved in organizing complexes of proteins, or 
5 targeting proteins to various subcellular localizations, and are involved in maintaining the 
structural integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence 
in the proteins of one or more motifs for which defined functions have been attributed. In 
addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 

10 conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylated 
targets in a sequence dependent manner. PTB domains, which are distinct from SH2 
domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proline-rich 
targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name only a 

1 5 few, have been shown to mediate protein-protein interactions. Some of these may also be 
involved in binding to phospholipids or other second messengers. As will be appreciated by 
one of ordinary skill in the art, these motifs can be identified on the basis of primary 
sequence; thus, an analysis of the sequence of proteins may provide insight into both the 
enzymatic potential of the molecule and/or molecules with which the protein may associate. 

20 One useful database is Pfam (protein families), which is a large collection of multiple 

sequence alignments and hidden Markov models covering many common protein domains. 
Versions are available via the internet from Washington University in St. Louis, the Sanger 
Center in England, and the Karolinska Institute in Sweden {see, e.g., Bateman et aL, Nua 
Acids Res. 28:263-266 (2000); Sonnhammer et al, Proteins 28:405-420 (1997); Bateman et 

25 al, Nuc. Acids Res. 27:260-262 (1999); and Sonnhammer et aL, Nuc. Acids Res, 26:320-322- 
(1998)). 

In another embodiment, the breast cancer sequences are transmembrane 
proteins. Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. 
They may have an intracellular domain, an extracellular domain, or both. The intracellular 
30 domains of such proteins may have a number of functions including those already described 
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for intracellular proteins. For example, the intracellular domain may have enzymatic activity 
and/or may serve as a binding site for additional proteins. Frequently the intracellular 
domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 
kinases have both protein kinase activity and SH2 domains. In addition, autophosphorylation 
of tyrosines on the receptor molecule itself, creates binding sites for additional SH2 domain 

containing proteins. 

Transmembrane proteins may contain from one to many transmembrane 
domains. For example, receptor tyrosine kinases, certain cytokine receptors, receptor 
guanylyl cyclases and receptor serine/threonine protein kinases contain a single 
transmembrane domain. However, various other proteins including channels and adenylyl 
cyclases contain numerous transmembrane domains. Many important cell surface receptors 
such as G protein coupled receptors (GPCRs) are classified as "seven transmembrane 
domain" proteins, as they contain 7 membrane spanning regions. Characteristics of 
transmembrane domains include approximately 20 consecutive hydrophobic amino acids that 
may be followed by charged amino acids. Therefore, upon analysis of the amino acid 
sequence of a particular protein, the localization and number of transmembrane domains 
within the protein may be predicted (see, e.g. PSORT web site http://psort.nibb.ac.jp/). 
Important transmembrane protein receptors include, but are not limited to the insulin 
receptor, insulin-like growth factor receptor, human growth hormone receptor, glucose 
transporters, transferrin receptor, epidermal growth factor receptor, low density lipoprotein 
receptor, epidermal growth factor receptor, leptin receptor, interleukin receptors, e.g. IL-1 

receptor, EL-2 receptor, 

The extracellular domains of transmembrane proteins are diverse; however, 
conserved motifs are found repeatedly among various extracellular domains. Conserved 
structure and/or functions have been ascribed to different extracellular motifs. Many 
extracellular domains are involved in binding to other molecules. In one aspect, extracellular 
domains are found on receptors. Factors that bind the receptor domain include circulating 
ligands, which may be peptides, proteins, or small molecules such as adenosine and the like. 
For example, growth factors such as EGF, FGF and PDGF are circulating growth factors that 
bind to their cognate receptors to initiate a variety of cellular responses. Other factors include 
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cytokines, mitogenic factors, neurotrophic factors and the like. Extracellular domains also 
bind to cell-associated molecules. In this respect, they mediate cell-cell interactions. Cell- 
associated ligands can be tethered to the cell, e.g., via a glycosylphosphatidylinositol (GPI) 
anchor, or may themselves be transmembrane proteins. Extracellular domains also associate 
5 with the extracellular matrix and contribute to the maintenance of the cell structure. 

Breast cancer proteins that are transmembrane are particularly preferred in the 
present invention as they are readily accessible targets for immunotherapeutics, as are 
described herein. In addition, as outlined below, transmembrane proteins can be also useful 
in imaging modalities. Antibodies may be used to label such readily accessible proteins in 

10 situ. Alternatively, antibodies can also label intracellular proteins, in which case samples are 
typically permeablized to provide access to intracellular proteins. 

It will also be appreciated by those in the art that a transmembrane protein can 
be made soluble by removing transmembrane sequences, e.g., through recombinant methods. 
Furthermore, transmembrane proteins that have been made soluble can be made to be 

15 secreted through recombinant means by adding an appropriate signal sequence. 

In another embodiment, the breast cancer proteins are secreted proteins; the 
secretion of which can be either constitutive or regulated. These proteins have a signal 
peptide or signal sequence that targets the molecule to the secretory pathway. Secreted 
proteins are involved in numerous physiological events; by virtue of their circulating nature, 

20 they serve to transmit signals to various other cell types. The secreted protein may function in 
an autocrine manner (acting on the cell that secreted the factor), a paracrine manner (acting 
on cells in close proximity to the cell that secreted the factor) or an endocrine manner (acting 
on cells at a distance). Thus secreted molecules find use in modulating or altering numerous 
aspects of physiology. Breast cancer proteins that are secreted proteins are particularly 

25 preferred in the present invention as they serve as good targets for diagnostic markers, e.g., 
for blood, plasma, serum, or stool tests. 

Use of breast cancer nucleic acids 

As described above, breast cancer sequence is initially identified by 
30 substantial nucleic acid and/or amino acid sequence homology or linkage to the breast cancer 
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sequences outlined herein. Such homology can be based upon the overall nucleic acid or 
amino acid sequence, and is generally determined as outlined below, using either homology 
programs or hybridization conditions. Typically, linked sequences on a mRNA are found on 
the same molecule. 

5 The breast cancer nucleic acid sequences of the invention, e.g., the sequences 

in Tables 1-25, can be fragments of larger genes, i.e., they are nucleic acid segments. 
"Genes" in this context includes coding regions, non-coding regions, and mixtures of coding 
and non-coding regions. Accordingly, as will be appreciated by those in the art, using the 
sequences provided herein, extended sequences, in either direction, of the breast cancer genes 
10 can be obtained, using techniques well known in the art for cloning either longer sequences or 
the foil length sequences; see Ausubel, et al, supra. Much can be done by informatics and 
many sequences can be clustered to include multiple sequences corresponding to a single 
gene, e.g., systems such as UniGene (see, http://www.ncbi.nlm.nih.gov/UniGene/). 

Once the breast cancer nucleic acid is identified, it can be cloned and, if 
1 5 necessary, its constituent parts recombined to form the entire breast cancer nucleic acid 
coding regions or the entire mRNA sequence. Once isolated from its natural source, e.g., 
contained within a plasmid or other vector or excised therefrom as a linear nucleic acid 
segment, the recombinant breast cancer nucleic acid can be farther-used as a probe to identify 
and isolate other breast cancer nucleic acids, e.g., extended coding regions. It can also be 
20 used as a "precursor" nucleic acid to make modified or variant breast cancer nucleic acids 
and proteins. 

The breast cancer nucleic acids of the present invention are used in several 
ways. In a first embodiment, nucleic acid probes to the breast cancer nucleic acids are made 
and attached to biochips to be used in screening and diagnostic methods, as outlined below, 
25 or for administration, e.g., for gene therapy, vaccine, and/or antisense applications. 

Alternatively, the breast cancer nucleic acids that include coding regions of breast cancer 
proteins can be put into expression vectors for the expression of breast cancer proteins, again 
for screening purposes or for administration to a patient. 

In a preferred embodiment, nucleic acid probes to breast cancer nucleic acids 
30 (both the nucleic acid sequences outlined in the figures and/or the complements thereof) are 
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made. The nucleic acid probes attached to the biochip are designed to be substantially 
complementary to the breast cancer nucleic acids, i.e. the target sequence (either the target 
sequence of the sample or to other probe sequences, e.g., in sandwich assays), such that 
hybridization of the target sequence and the probes of the present invention occurs. As 
5 outlined below, this complementarity need not be perfect; there may be any number of base 
pair mismatches which will interfere with hybridization between the target sequence and the 
single stranded nucleic acids of the present invention. However, if the number of mutations 
is so great that no hybridization can occur under even the least stringent of hybridization 
conditions, the sequence is not a complementary target sequence. Thus, by "substantially 

10 complementary" herein is meant that the probes are sufficiently complementary to the target 
sequences to hybridize under normal reaction conditions, particularly high stringency 
conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single 
and partially double stranded. The strandedness of the probe is dictated by the structure, 

15 composition, and properties of the target sequence. In general, the nucleic acid probes range 
from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally whole 
genes are not used. In some embodiments, much longer nucleic acids can be used, up to 
hundreds of bases. 

20 In a preferred embodiment, more than one probe per sequence is used, with 

either overlapping probes or probes to different sections of the target being used. That is, 
two, three, four or more probes, with three being preferred, are used to build in a redundancy 
for a particular target. The probes can be overlapping (i.e., have some sequence in common), 
or separate. In some cases, PCR primers may be used to amplify signal for higher sensitivity. 
25 As will be appreciated by those in the art, nucleic acids can be attached or 

immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 
equivalents herein is meant the association or binding between the nucleic acid probe and the 
solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent. By "non- 
30 covalent binding" and grammatical equivalents herein is meant one or more of electrostatic, 
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hydrophilic, and hydrophobic interactions. Included in non-covalent binding is the covalent 
attachment of a molecule, such as, streptavidin to the support and the non-covalent binding of 
the biotinylated probe to the streptavidin. By "covalent binding" and grammatical 
equivalents herein is meant that the two moieties, the solid support and the probe, are 

5 attached by at least one bond, including sigma bonds, pi bonds and coordination bonds. 
Covalent bonds can be formed directly between the probe and the solid support or can be 
formed by a cross linker or by inclusion of a specific reactive group on either the solid 
support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 

I o in general, the probes are attached to the biochip in a wide variety of ways, as 

will be appreciated by those in the art. As described herein, the nucleic acids can either be 
synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid 
1 5 support" or other grammatical equivalents herein is meant a material that can be modified to 
contain discrete individual sites appropriate for the attachment or association of the nucleic 
acid probes and is amenable to at least one detection method. As will be appreciated by those 
in the art, the number of possible substrates are very large, and include, but are not limited to, 
glass and modified or functionalized glass, plastics (including acrylics, polystyrene and 
20 copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, 

polyurethanes, TeflonJ, etc.), polysaccharides, nylon or nitrocellulose, resins, silica or silica- 
based materials including silicon and modified silicon, carbon, metals, inorganic glasses, 
plastics, etc. In general, the substrates allow optical detection and do not appreciably 
fluoresce. A preferred substrate is described in copending application entitled Reusable Low 
25 Fluorescent Plastic Biochip, U.S. Application Serial No. 09/270,214, filed March 1 5, 1999, 
herein incorporated by reference in its entirety. 

Generally the substrate is planar, although as will be appreciated by those in 
the art, other configurations of substrates may be used as well. For example, the probes may 
be placed on the inside surface of a tube, for flow-through sample analysis to minimize 
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sample volume. Similarly, the substrate may be flexible, such as a flexible foam, including 
closed cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical functional groups for subsequent attachment of the two. Thus, e.g., 
5 the biochip is derivatized with a chemical functional group including, but not limited to,' 
amino groups, carboxy groups, oxo groups and thiol groups, with amino groups being 
particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, e.g. using linkers as are known in the art; e.g., 
10 homo-or hetero-bifunctional linkers as are well known (see 1994 Pierce Chemical Company 
catalog, technical section on cross- linkers, pages 1 55-200). In addition, in some cases, 
additional linkers, such as alkyl groups (including substituted and heteroalkyl groups) may be 
used. 

In this embodiment, oligonucleotides are synthesized as is known in the art, 
15 and then attached to the surface of the solid support. As will be appreciated by those skilled 
in the art, either the 5* or 3' terminus may be attached to the solid support, or attachment may 
be via an internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very 
strong, yet non-covalent. For example, biotinylated oligonucleotides can be made, which 
20 bind to surfaces covalently coated with streptavidin, resulting in attachment. 

Alternatively, the oligonucleotides may be synthesized on the surface, as is 
known in the art. For example, photoactivation techniques utilizing photopolymerization 
compounds and techniques are used. In a preferred embodiment, the nucleic acids can be 
synthesized in situ, using well known photolithographic techniques, such as those described 
25 in WO 95/251 16; WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references 
cited within, all of which are expressly incorporated by reference; these methods of 
attachment form the basis of the Affimetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression 
level of breast cancer-associated sequences. These assays are typically performed in 
30 conjunction with reverse transcription. In such assays, a breast cancer-associated nucleic acid 
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sequence acts as a template in an amplification reaction (e.g., Polymerase Chain Reaction, or 
PCR). In a quantitative amplification, the amount of amplification product will be 
proportional to the amount of template in the original sample. Comparison to appropriate 
controls provides a measure of the amount of breast cancer-associated RNA Methods of 
quantitative amplification are well known to those of skill in the art. Detailed protocols for 
quantitative PCR are provided, e.g., in Innis et aL, PCR Protocols, A Guide to Methods and 

Applications (1990). 

In some embodiments, a TaqMan based assay is used to measure expression. 
TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 
dye and a 3' quenching agent. The probe hybridizes to a PCR product, but cannot itself be 
extended due to a blocking agent at the 3' end. When the PCR product is amplified in 
subsequent cycles, the 5 5 nuclease activity of the polymerase, e.g., AmpliTaq, results in the 
cleavage of the TaqMan probe. This cleavage separates the 5' fluorescent dye and the 3 J 
quenching agent, thereby resulting in an increase in fluorescence as a function of 
amplification {see, e.g., literature provided by Pericin-Elmer, e.g., www2.perkin-elmer.com). 

Other suitable amplification methods include, but are not limited to, ligase 
chain reaction (LCR) {see Wu & Wallace, Genomics 4:560 (1989), Landegren et aL, Science 
241:1077 (1988), and Barringer et ai 9 Gene 89:117 (1990)), transcription amplification 
(Kwoh et aL, Proc. Natl. Acad. Set USA 86:1 173 (1989)), self-sustained sequence replication 
(Guatelli et aL, Proc. Nat. Acad. Set USA 87: 1874 (1990)), dot PCR, and linker adapter PCR, 
etc. 

Expression of breast cancer proteins from nucleic acids 

In a preferred embodiment, breast cancer nucleic acids, e.g., encoding breast 
cancer proteins are used to make a variety of expression vectors to express breast cancer 
proteins which can then be used in screening assays, as described below. Expression vectors 
and recombinant DNA technology are well known to those of skill in the art {see, e.g., 
Ausubel, supra, and Gene Expression Systems (Fernandez & Hoeffler, eds, 1999)) and are 
used to express proteins. The expression vectors may be either self-replicating 
extrachromosomal vectors or vectors which integrate into a host genome. Generally, these 

43 



WO 02/(159377 



PCT/USO2/02242 



expression vectors include transcriptional and translational regulatory nucleic acid operably 
linked to the nucleic acid encoding the breast cancer protein. The term "control sequences" 
refers to DNA sequences used for the expression of an operably linked coding sequence in a 
particular host organism. Control sequences that are suitable for prokaryotes, e.g., include a 
5 promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are 
known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional 
relationship with another nucleic acid sequence. For example, DNA for a presequence or 
secretory leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein 

10 that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked 
to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site 
is operably linked to a coding sequence if it is positioned so as to facilitate translation. 
Generally, "operably linked" means that the DNA sequences being linked are contiguous, 
and, in the case of a secretory leader, contiguous and in reading phase. However, enhancers 

15 do not have to be contiguous. Linking is typically accomplished by ligation at convenient 
restriction sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are 
used in accordance with conventional practice. Transcriptional and translational regulatory 
nucleic acid will generally be appropriate to the host cell used to express the breast cancer 
protein. Numerous types of appropriate expression vectors, and suitable regulatory 

20 sequences are known in the art for a variety of host cells. 

In general, transcriptional and translational regulatory sequences may include, 
but are not limited to, promoter sequences, ribosomal binding sites, transcriptional start and 
stop sequences, translational start and stop sequences, and enhancer or activator sequences. 
In a preferred embodiment, the regulatory sequences include a promoter and transcriptional 

25 start and stop sequences. 

Promoter sequences encode either constitutive or inducible promoters. The 
promoters may be either naturally occurring promoters or hybrid promoters. Hybrid 
promoters, which combine elements of more than one promoter, are also known in the art, 
and are useful in the present invention. 
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In addition, an expression vector may comprise additional elements. For 
example, the expression vector may have two replication systems, thus allowing it to be 
maintained in two organisms, e.g. in mammalian or insect cells for expression and in a 
procaryotic host for cloning and amplification. Furthermore, for integrating expression 
5 vectors, the expression vector contains at least one sequence homologous to the host cell 
genome, and preferably two homologous sequences which flank the expression construct. 
The integrating vector may be directed to a specific locus in the host cell by selecting the 
appropriate homologous sequence for inclusion in the vector. Constructs for integrating 
vectors are well known in the art (e.g., Fernandez & Hoeffler, supra). 
I o In addition, in a preferred embodiment, the expression vector contains a 

selectable marker gene to allow the selection of transformed host cells. Selection genes are 
well known in the art and will vary with the host cell used. 

The breast cancer proteins of the present invention are produced by culturing a 
host cell transformed with an expression vector containing nucleic acid encoding a breast 
15 cancer protein, under the appropriate conditions to induce or cause expression of the breast 
cancer protein. Conditions appropriate for breast cancer protein expression will vary with the 
choice of the expression vector and the host cell, and will be easily ascertained by one skilled 
in the art through routine experimentation or optimization. For example, the use of 
constitutive promoters in the expression vector will require optimizing the growth and 
20 proliferation of the host cell, while the use of an inducible promoter requires the appropriate 
growth conditions for induction. In addition, in some embodiments, the timing of the harvest 
is important. For example, the baculoviral systems used in insect cell expression are lytic 
viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect 
25 and animal cells, including mammalian cells. Of particular interest are Saccharomyces 
cerevisiae and other yeasts, E. coli, Bacillus subtilis, Sf9 cells, C129 cells, 293 cells, 
Neurospora, BHK, CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial 
cells), THP1 cells (a macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the breast cancer proteins are expressed in 
30 mammalian cells. Mammalian expression systems are also known in the art, and include 
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retroviral and adenoviral systems. One expression vector system is a retroviral vector system 
such as is generally described in PCT/US97/01019 and PCT/US97/01048, both of which are 
hereby expressly incorporated by reference. Of particular use as mammalian promoters are 
the promoters from mammalian viral genes, since the viral genes are often highly expressed 
5 and have a broad host range. Examples include the SV40 early promoter, mouse mammary 
tumor virus LTR promoter, adenovirus major late promoter, herpes simplex virus promoter, 
and the CMV promoter (see, e.g., Fernandez & Hoeffler, supra). Typically, transcription 
termination and polyadenylation sequences recognized by mammalian cells are regulatory 
regions located 3' to the translation stop codon and thus, together with the promoter elements, 

10 flank the coding sequence. Examples of transcription terminator and polyadenlyation signals 
include those derived form SV40. 

The methods of introducing exogenous nucleic acid into mammalian hosts, as 
well as other hosts, is well known in the art, and will vary with the host cell used. 
Techniques include dextran-mediated transfection, calcium phosphate precipitation, 

15 polybrene mediated transfection, protoplast fusion, electroporation, viral infection, 

encapsulation of the polynucleotide(s) in liposomes, and direct microinjection of the DNA 
into nuclei. 

In a preferred embodiment, breast cancer proteins are expressed in bacterial 
systems. Bacterial expression systems are well known in the art. Promoters from 

20 bacteriophage may also be used and are known in the art. In addition, synthetic promoters 
and hybrid promoters are also useful; e.g., the tac promoter is a hybrid of the trp and lac 
promoter sequences. Furthermore, a bacterial promoter can include naturally occurring 
promoters of non-bacterial origin that have the ability to bind bacterial RNA polymerase and 
initiate transcription. In addition to a functioning promoter sequence, an efficient ribosome 

25 binding site is desirable. The expression vector may also include a signal peptide sequence 
that provides for secretion of the breast cancer protein in bacteria. The protein is either 
secreted into the growth media (gram-positive bacteria) or into the periplasmic space, located 
between the inner and outer membrane of the cell (gram-negative bacteria). The bacterial 
expression vector may also include a selectable marker gene to allow for the selection of 

30 bacterial strains that have been transformed. Suitable selection genes include genes which 
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render the bacteria resistant to drugs such as ampicillin, chloramphenicol, erythromycin, 
kanamycin, neomycin and tetracycline. Selectable markers also include biosynthetic genes, 
such as those in the histidine, tryptophan and leucine biosynthetic pathways. These 
components are assembled into expression vectors. Expression vectors for bacteria are well 
5 known in the art, and include vectors for Bacillus subtilis, E. coli, Streptococcus cremoris, 
and Streptococcus lividans, among others (e.g., Fernandez & Hoeffler, supra). The bacterial 
expression vectors are transformed into bacterial host cells using techniques well known in 
the art, such as calcium chloride treatment, electroporation, and others. 

In one embodiment, breast cancer proteins are produced in insect cells. 
1 0 Expression vectors for the transformation of insect cells, and in particular, baculovirus-based 
expression vectors, are well known in the art. 

In a preferred embodiment, breast cancer protein is produced in yeast cells. 
Yeast expression systems are well known in the art, and include expression vectors for 
Saccharomyces cerevisiae, Candida albicans and C. maltosa, Hansenula polymorpha, 
1 5 Kluyveromyces fi'agilis and K. lactis, Pichia giullerimondii and P. pastoris, 
Schizosaccharomyces pombe, and Yarrowia lipolytica. 

The breast cancer protein may also be made as a fusion protein, using 
techniques well known in the art. Thus, e.g., for the creation of monoclonal antibodies, if the 
desired epitope is small, the breast cancer protein may be fused to a carrier protein to form an 
20 immunogen. Alternatively, the breast cancer protein may be made as a fusion protein to 
increase expression, or for other reasons. For example, when the breast cancer protein is a 
breast cancer peptide, the nucleic acid encoding the peptide may be linked to other nucleic 
acid for expression purposes. 

In a preferred embodiment, the breast cancer protein is purified or isolated 
25 after expression. Breast cancer proteins may be isolated or purified in a variety of ways 
known to those skilled in the art depending on what other components are present in the 
sample. Standard purification methods include electrophoreric, molecular, immunological 
and chromatographic techniques, including ion exchange, hydrophobic, affinity, and reverse- 
phase HPLC chromatography, and chromatofocusmg. For example, the breast cancer protein 
30 may be purified using a standard anti-breast cancer protein antibody column. Ultrafiltration 
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and diafiltration techniques, in conjunction with protein concentration, are also useful. For 
general guidance in suitable purification techniques, see Scopes, Protein Purification (1982). 
The degree of purification necessary will vary depending on the use of the breast cancer 
protein. In some instances no purification will be necessary. 
5 Once expressed and purified if necessary, the breast cancer proteins and 

nucleic acids are useful in a number of applications. They may be used as immunoselection 
reagents, as vaccine reagents, as screening agents, etc. 

Variants of breast cancer proteins 

1 0 In one embodiment, the breast cancer proteins are derivative or variant breast 

cancer proteins as compared to the wild-type sequence. That is, as outlined more fully below, 
the derivative breast cancer peptide will often contain at least one amino acid substitution, 
deletion or insertion, with amino acid substitutions being particularly preferred. The amino 
acid substitution, insertion or deletion may occur at any residue within the breast cancer 

15 peptide. 

Also included within one embodiment of breast cancer proteins of the present 
invention are amino acid sequence variants. These variants typically fall into one or more of 
three classes: substitutional, insertional or deletional variants. These variants ordinarily are 
prepared by site specific mutagenesis of nucleotides in the DNA encoding the breast cancer 

20 protein, using cassette or PCR mutagenesis or other techniques well known in the art, to 
produce DNA encoding the variant, and thereafter expressing the DNA in recombinant cell 
culture as outlined above. However, variant breast cancer protein fragments having up to 
about 100-150 residues may be prepared by in vitro synthesis using established techniques. 
Amino acid sequence variants are characterized by the predetermined nature of the variation, 

25 a feature that sets them apart from naturally occurring allelic or interspecies variation of the 
breast cancer protein amino acid sequence. The variants typically exhibit the same 
qualitative biological activity as the naturally occurring analogue, although variants can also 
be selected which have modified characteristics as will be more fully outlined below. 

While the site or region for introducing an amino acid sequence variation is 

30 predetermined, the mutation per se need not be predetermined. For example, in order to 
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optimize the performance of a mutation at a given site, random mutagenesis may be 
conducted at the target codon or region and the expressed breast cancer variants screened for 
the optimal combination of desired activity. Techniques for making substitution mutations at 
predetermined sites in DNA having a known sequence are well known, e.g., M13 primer 
5 mutagenesis and PCR mutagenesis. Screening of the mutants is done using assays of breast 
cancer protein activities. 

Amino acid substitutions are typically of single residues; insertions usually 
will be on the order of from about 1 to 20 amino acids, although considerably larger 
insertions may be tolerated. Deletions range from about 1 to about 20 residues, although in 

10 some cases deletions may be much larger. 

Substitutions, deletions, insertions or any combination thereof may be used to 
arrive at a final derivative. Generally these changes are done on a few amino acids to 
minimize the alteration of the molecule. However, larger changes may be tolerated in certain 
circumstances. When small alterations in the characteristics of the breast cancer protein are 

15 desired, substitutions are generally made in accordance with the amino acid substitution 
relationships provided in the definition section. 

The variants typically exhibit the same qualitative biological activity and will 
elicit the same immune response as the naturally-occurring analog, although variants also are 
selected to modify the characteristics of the breast cancer proteins as needed. Alternatively, 

20 the variant may be designed such that the biological activity of the breast cancer protein is 
altered. For example, glycosylation sites may be altered or removed. 

Substantial changes in function or immunological identity are made by 
selecting substitutions that are less conservative than those described above. For example, 
substitutions may be made which more significantly affect: the structure of the polypeptide 

25 backbone in the area of the alteration, for example the alpha-helical or beta-sheet structure; 
the charge or hydrophobicity of the molecule at the target site; or the bulk of the side chain. 
The substitutions which in general are expected to produce the greatest changes in the 
polypeptide's properties are those in which (a) a hydrophilic residue, e.g. seryl or threonyl is 
substituted for (or by) a hydrophobic residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or 

30 alanyl; (b) a cysteine or proline is substituted for (or by) any other residue; (c) a residue 
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having an electropositive side chain, e.g. lysyl, arginyl, or histidyl, is substituted for (or by) 
an electronegative residue, e.g. glutamyl or aspartyl; or (d) a residue having a bulky side 
chain, e.g. phenylalanine, is substituted for (or by) one not having a side chain, e.g. glycine. 

Covalent modifications of breast cancer polypeptides are included within the 
5 scope of this invention. One type of covalent modification includes reacting targeted amino 
acid residues of a breast cancer polypeptide with an organic derivatizing agent that is capable 
of reacting with selected side chains or the N-or C-terminal residues of a breast cancer 
polypeptide. Derivatization with bifunctional agents is useful, for instance, for crosslinking 
breast cancer polypeptides to a water-insoluble support matrix or surface for use in the 

10 method for purifying anti-breast cancer polypeptide antibodies or screening assays, as is more 
fully described below. Commonly used crosslinking agents include, e.g., 1,1- 
bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, e.g., esters 
with 4-azidosalicylic acid, homobi functional imidoesters, including disuccinimidyl esters 
such as 3,3'-dithiobis(succinimidylpropionate), bifunctional maleimides such as bis-N- 

1 5 maleimido-1 ,8-octane and agents such as methyl-3-((p-azidophenyl)dithio)propioimidate. . 

Other modifications include deamidation of glutaminyl and asparaginyl 
residues to the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of 
proline and lysine, phosphorylation of hydroxyl groups of seryl, threonyl or tyrosyl residues, 
methylation of the amino groups of the lysine, arginine, and histidine side chains (Creighton, 

20 Proteins: Structure and Molecular Properties, pp. 79-86 (1983)), acetylation of the N- 
terminal amine, and amidation of any C-terminal carboxyl group. 

Another type of covalent modification of the breast cancer polypeptide 
included within the scope of this invention comprises altering the native glycosylation pattern 
of the polypeptide. "Altering the native glycosylation pattern" is intended for purposes 

25 herein to mean deleting one or more carbohydrate moieties found in native sequence breast 
cancer polypeptide, and/or adding one or more glycosylation sites that are not present in the 
native sequence breast cancer polypeptide. Glycosylation patterns can be altered in many 
ways. For example the use of different cell types to express breast cancer-associated 
sequences can result in different glycosylation patterns. 
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Addition of glycosylation sites to breast cancer polypeptides may also be 
accomplished by altering the amino acid sequence thereof The alteration may be made, e.g., 
by the addition of, or substitution by, one or more serine or threonine residues to the native 
sequence breast cancer polypeptide (for O-linked glycosylation sites). The breast cancer 

5 amino acid sequence may optionally be altered through changes at the DNA level, 

particularly by mutating the DNA encoding the breast cancer polypeptide at preselected bases 
such that codons are generated that will translate into the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the 
breast cancer polypeptide is by chemical or enzymatic coupling of glycosides to the 

10 polypeptide. Such methods are described in the art, e.g., in WO 87/05330, and in Aplin & 
Wriston, CRC Oil Rev. Biochem., pp. 259-306 (1981). 

Removal of carbohydrate moieties present on the breast cancer polypeptide 
may be accomplished chemically or enzymatically or by mutational substitution of codons 
encoding for amino acid residues that serve as targets for glycosylation. Chemical 

1 5 deglycosylation techniques are known in the art and described, for instance, by Hakimuddin, 
et aiyArch. Biochem. Biophys., 259:52 (1987) and by Edge et al.Anal Biochem., 118:131 
(1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the 
use of a variety of endo-and exo-glycosidases as described by Thotakura et al, Meth. 
EnzymoL, 138:350(1987). 

20 Another type of covalent modification of breast cancer comprises linking the 

breast cancer polypeptide to one of a variety of nonproteinaceous polymers, e.g., 
polyethylene glycol, polypropylene glycol, or polyoxyalkylenes, in the manner set forth in 
U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

Breast cancer polypeptides of the present invention may also be modified in a 

25 way to form chimeric molecules comprising a breast cancer polypeptide fused to another, 
heterologous polypeptide or amino acid sequence. In one embodiment, such a chimeric 
molecule comprises a fusion of a breast cancer polypeptide with a tag polypeptide which 
provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is 
generally placed at the amino-or carboxyl-terminus of the breast cancer polypeptide. The 

30 presence of such epitope-tagged forms of a breast cancer polypeptide can be detected using 
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an antibody against the tag polypeptide. Also, provision of the epitope tag enables the breast 
cancer polypeptide to be readily purified by affinity purification using an anti-tag antibody or 
another type of affinity matrix that binds to the epitope tag. In an alternative embodiment, 
the chimeric molecule may comprise a fusion of a breast cancer polypeptide with an 
5 immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the 
chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

Various tag polypeptides and their respective antibodies are well known in the 
art. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; 
HIS6 and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field et 

10 al, Mol Cell. Biol 8:2159-2165 (1988)); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 
9E10 antibodies thereto (Evan et al., Molecular and Cellular Biology 5:3610-3616 (1985)); 
and the Herpes Simplex vims glycoprotein D (gD) tag and its antibody (Paborsky et al, 
Protein Engineering 3(6):547-553 (1990)). Other tag polypeptides include the Flag-peptide 
(Hopp et al, BioTechnology 6:1204-1210 (1988)); the KT3 epitope peptide (Martin et al, 

1 5 Science 255:192-194 (1992)); tubulin epitope peptide (Skinner et al. f J. Biol. Chem. 

266:15163-15166 (1991)); and the T7 gene 10 protein peptide tag (Lutz-Freyermuth et al, 
Proc. Natl Acad. Scl USA 87:6393-6397 (1990)). 

Also included are other breast cancer proteins of the breast cancer family, and 
breast cancer proteins from other organisms, which are cloned and expressed as outlined 

20 below. Thus, probe or degenerate polymerase chain reaction (PCR) primer sequences may be 
used to find other related breast cancer proteins from humans or other organisms. As will be 
appreciated by those in the art, particularly useful probe and/or PCR primer sequences 
include the unique areas of the breast cancer nucleic acid sequence. As is generally known in 
the art, preferred PCR primers are from about 15 to about 35 nucleotides in length, with from 

25 about 20 to about 30 being preferred, and may contain inosine as needed. The conditions for 
the PCR reaction are well known in the art (e.g., Innis, PCR Protocols, supra). 



Antibodies to breast cancer proteins 

In a preferred embodiment, when the breast cancer protein is to be used to 
30 generate antibodies, e.g., for immunotherapy or immunodiagnosis, the breast cancer protein 

52 



WO 02/059377 



PCT/US02/02242 



should share at least one epitope or determinant with the full length protein. By "epitope" or 
"determinant" herein is typically meant a portion of a protein which will generate and/or bind 
an antibody or T-cell receptor in the context of MHC. Thus, in most instances, antibodies 
made to a smaller breast cancer protein will be able to bind to the full-length protein, 
5 particularly linear epitopes. In a preferred embodiment, the epitope is unique; that is, 
antibodies generated to a unique epitope show little or no cross-reactivity. 

Methods of preparing polyclonal antibodies are known to the skilled artisan 
(e.g., Coligan, supra; and Harlow & Lane, supra). Polyclonal antibodies can be raised in a 
mammal, e.g., by one or more injections of an immunizing agent and, if desired, an adjuvant. 

1 0 Typically, the immunizing agent and/or adjuvant will be injected in the mammal by multiple 
subcutaneous or intraperitoneal injections. The immunizing agent may include a protein 
encoded by a nucleic acid of the figures or fragment thereof or a fusion protein thereof. It 
may be useful to conjugate the immunizing agent to a protein known to be immunogenic in 
the mammal being immunized. Examples of such immunogenic proteins include but are not 

15 limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean 
trypsin inhibitor. Examples of adjuvants which may be employed include Freund's complete 
adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose 
dicorynomycolate). The immunization protocol may be selected by one skilled in the art 
without undue experimentation. 

20 The antibodies may, alternatively, be monoclonal antibodies. Monoclonal 

antibodies may be prepared using hybridoma methods, such as those described by Kohler & 
Milstein, Nature 256:495 (1975). In a hybridoma method, a mouse, hamster, or other 
appropriate host animal, is typically immunized with an immunizing agent to elicit 
lymphocytes that produce or are capable of producing antibodies that will specifically bind to 

25 the immunizing agent. Alternatively, the lymphocytes may be immunized in vitro. The 

immunizing agent will typically include a polypeptide encoded by a nucleic acid of Tables 1- 
25 or fragment thereof, or a fusion protein thereof. Generally, either peripheral blood 
lymphocytes ("PBLs") are used if cells of human origin are desired, or spleen cells or lymph 
node cells are used if non-human mammalian sources are desired. The lymphocytes are then 

30 fused with an immortalized cell line using a suitable fusing agent, such as polyethylene 

53 



WO 02/059377 



PCT/US02/02242 



glycol, to form a hybridoma cell (Goding, Monoclonal Antibodies: Principles and Practice, 
pp. 59-103 (1986)). Immortalized cell lines are usually transformed mammalian cells, 
particularly myeloma cells of rodent, bovine and human origin. Usually, rat or mouse 
myeloma cell lines are employed. The hybridoma cells may be cultured in a suitable culture 
5 medium that preferably contains one or more substances that inhibit the growth or survival of 
the unftised, immortalized cells. For example, if the parental cells lack the enzyme 
hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium 
for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT 
medium"), which substances prevent the growth of HGPRT-deficient cells. 

10 In one embodiment, the antibodies are bispecific antibodies, Bispecific 

antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens or that have binding specificities for two 
epitopes on the same antigen. In one embodiment, one of the binding specificities is for a 
protein encoded by a nucleic acid Tables 1-25 or a fragment thereof, the other one is for any 

15 other antigen, and preferably for a cell-surface protein or receptor or receptor subunit, 

preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
multivalent reagents. 

In a preferred embodiment, the antibodies to breast cancer protein are capable 
of reducing or eliminating a biological function of abreast cancer protein, as is described 

20 below. That is, the addition of anti-breast cancer protein antibodies (either polyclonal or 

preferably monoclonal) to breast cancer tissue (or cells containing breast cancer) may reduce 
or eliminate the breast cancer. Generally, at least a 25% decrease in activity, growth, size or 
the like is preferred, with at least about 50% being particularly preferred and about a 95- 
100% decrease being especially preferred. 

25 In a preferred embodiment the antibodies to the breast cancer proteins are 

humanized antibodies (e.g., Xenerex Biosciences, Mederex, Inc., Abgenix, Inc., Protein 
Design Labsjnc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric 
molecules of immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, 
Fab, Fab', F(ab')2 or other antigen-binding subsequences of antibodies) which contain 

30 minimal sequence derived from non-human immunoglobulin. Humanized antibodies include 
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human immunoglobulins (recipient antibody) in which residues from a complementary 
determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, 
affinity and capacity. In some instances, Fv framework residues of the human 

5 immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies 
may also comprise residues which are found neither in the recipient antibody nor in the 
imported CDR or framework sequences. In general, a humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which all or 
substantially all of the CDR regions correspond to those of a non-human immunoglobulin 

1 0 and all or substantially all of the framework (FR) regions are those of a human 

immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 
at least a portion of an immunoglobulin constant region (Fc), typically that of a human 
immunoglobulin (Jones et al, Nature 321:522-525 (1986); Riechmann et al, Nature 
332:323-329 (1988); and Presta, Curr. Op. Struct Biol 2:593-596 (1992)). Humanization 

15 can be essentially performed following the method of Winter and co-workers (Jones et al, 
Nature 321:522-525 (1986); Riechmann et al, Nature 332:323-327 (1988); Verhoeyen et al, 
Science 239: 1 534-1 536 (1988)), by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. Accordingly, such humanized antibodies are 
chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact 

20 human variable domain has been substituted by the corresponding sequence from a non- 
human species. 

Human antibodies can also be produced using various techniques known in the 
art, including phage display libraries (Hoogenboom & Winter, J. Mol Biol 227:381 (1991); 
Marks et al, J. Mol Biol 222:581 (1991)). The techniques of Cole et al and Boerner et al 

25 are also available for the preparation of human monoclonal antibodies (Cole et al, 

Monoclonal Antibodies and Cancer Therapy, p- 77 (1985) and Boerner et al, J. Immunol 
147(l):86-95 (1991)). Similarly, human antibodies can be made by introducing of human 
immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous 
immunoglobulin genes have been partially or completely inactivated. Upon challenge, 

30 human antibody production is observed, which closely resembles that seen in humans in all 
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respects, including gene rearrangement, assembly, and antibody repertoire. This approach is 
described, e.g., in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 
5,661,016, and in the following scientific publications: Marks et al., Bio/Technology 10:779- 
783 (1992); Lonberg et al., Nature 368:856-859 (1994); Morrison, Nature 368:812-13 
5 (1994); Fishwild et al, Nature Biotechnology 14:845-51 (1996); Neuberger, Nature 

Biotechnology 14:826 (1996); Lonberg & Huszar, Intern. Rev. Immunol 13:65-93 (1995). 

By immunotherapy is meant treatment of breast cancer with an antibody raised 
against breast cancer proteins. As used herein, immunotherapy can be passive or active. 
Passive immunotherapy as defined herein is the passive transfer of antibody to a recipient 

10 (patient). Active immunization is the induction of antibody and/or T-cell responses in a 

recipient (patient). Induction of an immune response is the result of providing the recipient 
with an antigen to which antibodies are raised. As appreciated by one of ordinary skill in the 
art, the antigen may be provided by injecting a polypeptide against which antibodies are 
desired to be raised into a recipient, or contacting the recipient with a nucleic acid capable of 

15 expressing the antigen and under conditions for expression of the antigen, leading to an 
immune response. 

In a preferred embodiment the breast cancer proteins against which antibodies 
are raised are secreted proteins as described above. Without being bound by theory, 
antibodies used for treatment, bind and prevent the secreted protein from binding to its 

20 receptor, thereby inactivating the secreted breast cancer protein. 

In another preferred embodiment, the breast cancer protein to which 
antibodies are raised is a transmembrane protein. Without being bound by theory, antibodies 
used for treatment, bind the extracellular domain of the breast cancer protein and prevent it 
from binding to other proteins, such as circulating ligands or cell-associated molecules. The 

25 antibody may cause down-regulation of the transmembrane breast cancer protein. As will be 
appreciated by one of ordinary skill in the art, the antibody may be a competitive, non- 
competitive or uncompetitive inhibitor of protein binding to the extracellular domain of the 
breast cancer protein. The antibody is also an antagonist of the breast cancer protein. 
Further, the antibody prevents activation of the transmembrane breast cancer protein. In one 

30 aspect, when the antibody prevents the binding of other molecules to the breast cancer 
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protein, the antibody prevents growth of the cell. The antibody may also be used to target or 
sensitize the cell to cytotoxic agents, including, but not limited to TNF-a, TNF-p, IL-1, INF-y 
and IL-2, or chemotherapeutic agents including 5FU, vinblastine, actinomycin D, cisplatin, 
methotrexate, and the like. In some instances the antibody belongs to a sub-type that 
5 activates serum complement when complexed with the transmembrane protein thereby 
mediating cytotoxicity or antigen-dependent cytotoxicity (ADCC). Thus, breast cancer is 
treated by administering to a patient antibodies directed against the transmembrane breast 
cancer protein. Antibody-labeling may activate a co-toxin, localize a toxin payload, or 
otherwise provide means to locally ablate cells. 
10 In another preferred embodiment, the antibody is conjugated to an effector 

moiety. The effector moiety can be any number of molecules, including labelling moieties 
such as radioactive labels or fluorescent labels, or can be a therapeutic moiety. In one aspect 
the therapeutic moiety is a small molecule that modulates the activity of the breast cancer 
protein. In another aspect the therapeutic moiety modulates the activity of molecules 
1 5 associated with or in close proximity to the breast cancer protein. The therapeutic moiety 
may inhibit enzymatic activity such as protease or collagenase or protein kinase activity 
associated with breast cancer. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic 
agent. In this method, targeting the cytotoxic agent to breast cancer tissue or cells, results in 
20 a reduction in the number of afflicted cells, thereby reducing symptoms associated with 

breast cancer. Cytotoxic agents are numerous and varied and include, but are not limited to, 
cytotoxic drugs or toxins or active fragments of such toxins. Suitable toxins and their 
corresponding fragments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A 
chain, curcin, crotin, phenomycin, enomycin and the like. Cytotoxic agents also include 
25 radiochemicals made by conjugating radioisotopes to antibodies raised against breast cancer 
proteins, or binding of a radionuclide to a chelating agent that has been covalently attached to 
the antibody. Targeting the therapeutic moiety to transmembrane breast cancer proteins not 
only serves to increase the local concentration of therapeutic moiety in the breast cancer 
afflicted area, but also serves to reduce deleterious side effects that may be associated with 
30 the therapeutic moiety. 
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In another preferred embodiment, the breast cancer protein against which the 
antibodies are raised is an intracellular protein. In this case, the antibody may be conjugated 
to a protein which facilitates entry into the cell. In one case, the antibody enters the cell by 
endocytosis. In another embodiment, a nucleic acid encoding the antibody is administered to 
5 the individual or cell. Moreover, wherein the breast cancer protein can be targeted within a 
cell, i.e., the nucleus, an antibody thereto contains a signal for that target localization, i.e., a 
nuclear localization signal. 

The breast cancer antibodies of the invention specifically bind to breast cancer 
proteins. By "specifically bind" herein is meant that the antibodies bind to the protein with a 
10 K<j of at least about 0.1 mM, more usually at least about 1 |iM, preferably at least about 0.1 
jiM or better, and most preferably, 0.01 \iM or better. Selectivity of binding is also 
important. 

Detection of breast cancer sequence for diagnostic and therapeutic applications 

1 5 In one aspect, the RNA expression levels of genes are determined for different 

cellular states in the breast cancer phenotype. Expression levels of genes in normal tissue 
(i.e., not undergoing breast cancer) and in breast cancer tissue (and in some cases, for varying 
severities of breast cancer that relate to prognosis, as outlined below) are evaluated to provide 
expression profiles. An expression profile of a particular cell state or point of development is 

20 essentially a "fingerprinf ' of the state. While two states may have any particular gene 
similarly expressed, the evaluation of a number of genes simultaneously allows the 
generation of a gene expression profile that is reflective of the state of the cell. By comparing 
expression profiles of cells in different states, information regarding which genes are 
important (including both up- and down-regulation of genes) in each of these states is 

25 obtained. Then, diagnosis may be performed or confirmed to determine whether a tissue 
sample has the gene expression profile of normal or cancerous tissue. This will provide for 
molecular diagnosis of related conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to 
qualitative or quantitative differences in the temporal and/or cellular gene expression 

30 patterns within and among cells and tissue. Thus, a differentially expressed gene can 
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qualitatively have its expression altered, including an activation or inactivation, in, e.g., 
normal versus breast cancer tissue. Genes may be turned on or turned off in a particular state, 
relative to another state thus permitting comparison of two or more states. A qualitatively 
regulated gene will exhibit an expression pattern within a state or cell type which is 
5 detectable by standard techniques. Some genes will be expressed in one state or cell type, but 
not in both. Alternatively, the difference in expression may be quantitative, e.g., in that 
expression is increased or decreased; i.e., gene expression is either upregulated, resulting in 
an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via 
1 0 standard characterization techniques as outlined below, such as by use of Affymetrix 
GeneChip™ expression arrays, Lockhart, Nature Biotechnology 14:1675-1680 (1996), 
hereby expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PCR, northern analysis and RNase protection. As outlined 
above, preferably the change in expression (i.e., upregulation or downregulation) is at least 
1 5 about 50%, more preferably at least about 1 00%, more preferably at least about 1 50%, more 
preferably at least about 200%, with from 300 to at least 1000% being especially preferred. 

Evaluation may be at the gene transcript, or the protein level. The amount of 
gene expression may be monitored using nucleic acid probes to the DNA or RNA equivalent 
of the gene transcript, and the quantification of gene expression levels, or, alternatively, the 
20 final gene product itself (protein) can be monitored, e.g., with antibodies to the breast cancer 
protein and standard immunoassays (ELISAs, etc.) or other techniques, including mass 
spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins corresponding to breast 
cancer genes, i.e., those identified as being important in a breast cancer phenotype, can be 
evaluated in a breast cancer diagnostic test. 
25 In a preferred embodiment, gene expression monitoring is performed 

simultaneously on a number of genes. Multiple protein expression monitoring can be 
performed as well. Similarly, these assays may be performed on an individual basis as well. 

In this embodiment, the breast cancer nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of breast cancer sequences in 
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a particular cell. The assays are further described below in the example. PCR techniques can 
be used to provide greater sensitivity. 

In a preferred embodiment nucleic acids encoding the breast cancer protein are 
detected. Although DNA or RNA encoding the breast cancer protein may be detected, of 
5 particular interest are methods wherein an niRNA encoding a breast cancer protein is 
detected. Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is 
complementary to and hybridizes with the mRNA and includes, but is not limited to, 
oligonucleotides, cDNA or RNA. Probes also should contain a detectable label, as defined 
herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 

10 examined on a solid support such as nylon membranes and hybridizing the probe with the 
sample. Following washing to remove the non-specifically bound probe, the label is 
detected. In another method detection of the mRNA is performed in situ. In this method 
permeabilized cells or tissue samples are contacted with a detectably labeled nucleic acid 
probe for sufficient time to allow the probe to hybridize with the target mRNA. Following 

1 5 washing to remove the non-specifically bound probe, the label is detected. For example a 
digoxygenin labeled riboprobe (RNA probe) that is complementary to the mRNA encoding a 
breast cancer protein is detected by binding the digoxygenin with an anti-digoxygenin 
secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3- 
indoyl phosphate. 

20 In a preferred embodiment, various proteins from the three classes of proteins 

as described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The breast cancer proteins, antibodies, nucleic acids, modified proteins and cells 
containing breast cancer sequences are used in diagnostic assays. This can be performed on 
an individual gene or corresponding polypeptide level. In a preferred embodiment, the 

25 expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

As described and defined herein, breast cancer proteins, including 
intracellular, transmembrane or secreted proteins, find use as markers of breast cancer. 
30 Detection of these proteins in putative breast cancer tissue allows for detection or diagnosis 
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of breast cancer. In one embodiment, antibodies are used to detect breast cancer proteins. A 
preferred method separates proteins from a sample by electrophoresis on a gel (typically a 
denaturing and reducing protein gel, but may be another type of gel, including isoelectric 
focusing gels and the like). Following separation of proteins, the breast cancer protein is 
detected, e.g., by immunoblotting with antibodies raised against the breast cancer protein. 
Methods of immunoblotting are well known to those of ordinary skill in the art. 

In another preferred method, antibodies to the breast cancer protein find use in 
in situ imaging techniques, e.g., in histology (e.g., Methods in Cell Biology: Antibodies in 
Cell Biology, volume 37 (Asai, ed. 1993)). In this method cells are contacted with from one 
to many antibodies to the breast cancer protein(s). Following washing to remove non- 
specific antibody binding, the presence of the antibody or antibodies is detected. In one 
embodiment the antibody is detected by incubating with a secondary antibody that contains a 
detectable label. In another method the primary antibody to the breast cancer protein(s) 
contains a detectable label, e.g. an enzyme marker that can act on a substrate. In another 
preferred embodiment each one of multiple primary antibodies contains a distinct and 
detectable label. This method finds particular use in simultaneous screening for a plurality of 
breast cancer proteins. As will be appreciated by one of ordinary skill in the art, many other 
histological imaging techniques are also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the 
ability to detect and distinguish emissions of different wavelengths. In addition, a 
fluorescence activated cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing breast 
cancer from blood, serum, plasma, stool, and other samples. Such samples, therefore, are 
useful as samples to be probed or tested for the presence of breast cancer proteins. 
Antibodies can be used to detect a breast cancer protein by previously described 
immunoassay techniques including ELISA, immunoblotting (western blotting), 
immunoprecipitacion, BIACORE technology and the like. Conversely, the presence of 
antibodies may indicate an immune response against an endogenous breast cancer protein. 

In a preferred embodiment, in situ hybridization of labeled breast cancer 
nucleic acid probes to tissue arrays is done. For example, arrays of tissue samples, including 
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breast cancer tissue and/or normal tissue, are made. In situ hybridization {see, e.g., Ausubel, 
supra) is then performed. When comparing the fingerprints between an individual and a 
standard, the skilled artisan can make a diagnosis, a prognosis, or a prediction based on the 
findings. It is further understood that the genes which indicate the diagnosis may differ from 
5 those which indicate the prognosis and molecular profiling of the condition of the cells may 
lead to distinctions between responsive or refractory conditions or may be predictive of 
outcomes. 

In a preferred embodiment, the breast cancer proteins, antibodies, nucleic 
acids, modified proteins and cells containing breast cancer sequences are used in prognosis 

10 assays. As above, gene expression profiles can be generated that correlate to breast cancer, 
in terms of long term prognosis. Again, this may be done on either a protein or gene level, 
with the use of genes being preferred. As above, breast cancer probes may be attached to 
biochips for the detection and quantification of breast cancer sequences in a tissue or patient 
The assays proceed as outlined above for diagnosis. PCR method may provide more 

1 5 sensitive and accurate quantification. 

Assays for therapeutic compounds 

In a preferred embodiment members of the proteins, nucleic acids, and 
antibodies as described herein are used in drug screening assays. The breast cancer proteins, 

20 antibodies, nucleic acids, modified proteins and cells containing breast cancer sequences are 
used in drug screening assays or by evaluating the effect of drug candidates on a "gene 
expression profile" or expression profile of polypeptides. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes after treatment with a candidate 

25 agent (e.g., Zlokarnik, et al y Science 279:84-8 (1998); Heid, Genome Res 6:986-94, 1996). 

In a preferred embodiment, the breast cancer proteins, antibodies, nucleic 
acids, modified proteins and cells containing the native or modified breast cancer proteins are 
used in screening assays. That is, the present invention provides novel methods for screening 
for compositions which modulate the breast cancer phenotype or an identified physiological 

30 function of a breast cancer protein. As above, this can be done on an individual gene level or 
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by evaluating the effect of drug candidates on a "gene expression profile". In a preferred 
embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokarnik, supra. 
5 Having identified the differentially expressed genes herein, a variety of assays 

may be executed. In a preferred embodiment, assays may be run on an individual gene or 
protein level. That is, having identified a particular gene as up regulated in breast cancer, test 
compounds can be screened for the ability to modulate gene expression or for binding to the 
breast cancer protein. "Modulation" thus includes both an increase and a decrease in gene 
1 0 expression. The preferred amount of modulation will depend on the original change of the 
gene expression in normal versus tissue undergoing breast cancer, with changes of at least 
10%, preferably 50%, more preferably 100-300%, and in some embodiments 300-1000% or 
greater. Thus, if a gene exhibits a 4- fold increase in breast cancer tissue compared to normal 
tissue, a decrease of about four-fold is often desired; similarly, a 10- fold decrease in breast 
15 cancer tissue compared to normal tissue often provides a target value of a 10- fold increase in 
expression to be induced by the test compound. 

The amount of gene expression may be monitored using nucleic acid probes 
and the quantification of gene expression levels, or, alternatively, the gene product itself can 
be monitored, e.g., through the use of antibodies to the breast cancer protein and standard 
20 immunoassays. Proteomics and separation techniques may also allow quantification of 
expression. 

In a preferred embodiment, gene expression or protein monitoring of a number 
of entities, i.e., an expression profile, is monitored simultaneously. Such profiles will 
typically involve a plurality of those entities described herein.. 
25 In this embodiment, the breast cancer nucleic acid probes are attached to 

biochips as outlined herein for the detection and quantification of breast cancer sequences in 
a particular cell. Alternatively, PCR may be used. Thus, a series, e.g., of microtiter plate, 
may be used with dispensed primers in desired wells. A PCR reaction can then be performed 
and analyzed for each well. 
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Expression monitoring can be performed to identify compounds that modify 
the expression of one or more breast cancer- associated sequences, e.g., a polynucleotide 
sequence set out inTable 17. Generally, in a preferred embodiment, a test modulator is added 
to the cells prior to analysis. Moreover, screens are also provided to identify agents that 
5 modulate breast cancer, modulate breast cancer proteins, bind to a breast cancer protein, or 
interfere with the binding of a breast cancer protein and an antibody or other binding partner. 

The term "test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes any molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 

10 indirectly alter the breast cancer phenotype or the expression of a breast cancer sequence, 
e.g., a nucleic acid or protein sequence. In preferred embodiments, modulators alter 
expression profiles, or expression profile nucleic acids or proteins provided herein. In one 
embodiment, the modulator suppresses abreast cancer phenotype, e.g. to a normal tissue 
fingerprint. In another embodiment, a modulator induced a breast cancer phenotype. 

1 5 Generally, a plurality of assay mixtures are run in parallel with different agent concentrations 
to obtain a differential response to the various concentrations. Typically, one of these 
concentrations serves as a negative control, i.e., at zero concentration or below the level of 
detection. 

Drug candidates encompass numerous chemical classes, though typically they 
20 are organic molecules, preferably small organic compounds having a molecular weight of 
more than 100 and less than about 2,500 daltons. Preferred small molecules are less than 
2000, or less than 1500 or less than 1000 or less than 500 D. Candidate agents comprise 
functional groups necessary for structural interaction with proteins, particularly hydrogen 
bonding, and typically include at least an amine, carbonyl, hydroxyl or carboxyl group, 
25 preferably at least two of the functional chemical groups. The candidate agents often 
comprise cyclical carbon or heterocyclic structures and/or aromatic or polyaromatic 
structures substituted with one or more of the above functional groups. Candidate agents are 
also found among biomolecules including peptides, saccharides, fatty acids, steroids, purines, 
pyrimidines, derivatives, structural analogs or combinations thereof. Particularly preferred 
30 are peptides. 
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In one aspect, a modulator will neutralize the effect of a breast cancer protein. 
By "neutralize" is meant that activity of a protein is inhibited or blocked and the consequent 
effect on the cell 

In certain embodiments, combinatorial libraries of potential modulators will be 

5 screened for an ability to bind to a breast cancer polypeptide or to modulate activity. 

Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating the property 
and activity of those variant compounds. Often, high throughput screening (HTS) methods 

10 are employed for such an analysis. 

In one preferred embodiment, high throughput screening methods involve 
providing a library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 

1 5 display a desired characteristic activity. The compounds thus identified can serve as 

conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical 
compounds generated by either chemical synthesis or biological synthesis by combining a 
number of chemical "building blocks" such as reagents. For example, a linear combinatorial 

20 chemical library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of 
chemical building blocks called amino acids in every possible way for a given compound 
length (i.e., the number of amino acids in a polypeptide compound). Millions of chemical 
compounds can be synthesized through such combinatorial mixing of chemical building 
blocks (Gallop et al, J. Med. Chem. 37(9):1233-1251 (1994)). 

25 Preparation and screening of combinatorial chemical libraries is well known to 

those of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries {see, e.g., U.S. Patent No. 5,010,175, Furka, PepL Prot. Res. 37:487-493 
(1991), Houghton et al, Nature, 354:84-88 (1991)), peptoids (PCT Publication No WO 
91/19735), encoded peptides (PCT Publication WO 93/20242), random bio-oligomers (PCT 

30 Publication WO 92/00091), benzodiazepines (U.S. Pat. No. 5,288,514), diversomers such as 
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hydantoins, benzodiazepines and dipeptides (Hobbs etal, Proc. Nat. Acad. Sci. USA 
90:6909-6913 (1993)), vinylogous polypeptides (Hagihara et al. y J. Amer. Chem. Soc. 
1 14:6568 (1992)), nonpeptidal peptidomimetics with aBeta-D-Glucose scaffolding 
(Hirschmann et al, J. Amer. Chem. Soc. 114:9217-9218 (1992)), analogous organic syntheses 
5 of small compound libraries (Chen et al, J. Amer. Chem. Soc. 1 16:2661 (1994)), 

oligocarbamates (Cho, et al, Science 261:1303 (1993)), and/or peptidyl phosphonates 
(Campbell et al, 1 Org. Chem. 59:658 (1994)). See, generally, Gordon et al, J. Med. Chem. 
37:1385 (1994), nucleic acid libraries (see, e.g., Strategene, Corp.), peptide nucleic acid 
libraries (see, e.g., U.S. Patent 5,539,083), antibody libraries (see, e.g., Vaughn etal, Nature 

10 Biotechnology 14(3):309-314 (1996), and PCT/US96/10287), carbohydrate libraries (see, 
e.g., Liang et al, Science 274:1520-1522 (1996), and U.S. Patent No. 5,593,853), and small 
organic molecule libraries (see, e.g., benzodiazepines, Baum, C&EN, Jan 18, page 33 (1993); 
isoprenoids, U.S. Patent No. 5,569,588; thiazolidinones and metathiazanones, U.S. Patent No. P 
5,549,974; pyrrolidines, U.S. Patent Nos. 5,525,735 and 5,519,134; morpholino compounds, 

15 U.S.PatentNo. 5,506,337; benzodiazepines, U.S. Patent No. 5,288,514; and the like). 

Devices for the preparation of combinatorial libraries are commercially 
available (see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, 
Rainin, Woburn, MA, 43 3 A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, 
Bedford, MA). 

20 • A number of well known robotic systems have also been developed for 

solution phase chemistries. These systems include automated workstations like the 
automated synthesis apparatus developed by Takeda Chemical Industries, LTD. (Osaka, 
Japan) and many robotic systems utilizing robotic arms (Zymate II, Zymark Corporation, 
Hopkinton, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif), which mimic the manual 

25 synthetic operations performed by a chemist. Any of the above devices are suitable for use 
with the present invention. The nature and implementation of modifications to these devices 
(if any) so that they can operate as discussed herein will be apparent to persons skilled in the 
relevant art. In addition, numerous combinatorial libraries are themselves commercially 
available (see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, Tripos, Inc., St. Louis, 
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MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek Biosciences, 
Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect enhancement or inhibition of breast cancer gene transcription, 
5 inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 
polypeptide activity. 

High throughput assays for the presence, absence, quantification, or other 
properties of particular nucleic acids or protein products are well known to those of skill in 
the art. Similarly, binding assays and reporter gene assays are similarly well known. Thus, 
10 e.g., U.S. Patent No. 5,559,410 discloses high throughput screening methods for proteins, 
U.S. Patent No. 5,585,639 discloses high throughput screening methods for nucleic acid 
binding (i.e., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 
throughput methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available 
15 (see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, eta). These systems 
typically automate entire procedures, including all sample and reagent pipetting, liquid 
dispensing, timed incubations, and final readings of the microplate in detector(s ) appropriate 
for the assay. These configurable systems provide high throughput and rapid start up as well 
20 as a high degree of flexibility and customization. The manufacturers of such systems provide 
detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 
transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring 
25 proteins or fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing 
proteins, or random or directed digests of proteinaceous cellular extracts, may be used. In 
this way libraries of proteins may be made for screening in the methods of the invention. 
Particularly preferred in this embodiment are libraries of bacterial, fungal, viral, and 
mammalian proteins, with the latter being preferred, and human proteins being especially 
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preferred. Particularly useful test compound will be directed to the class of proteins to which 
the target belongs, e.g., substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5 to about 
30 amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 
5 to about 15 being particularly preferred. The peptides may be digests of naturally occurring 
proteins as is outlined above, random peptides, or "biased" random peptides. By 
"randomized" or grammatical equivalents herein is meant that each nucleic acid and peptide 
consists of essentially random nucleotides and amino acids, respectively. Since generally 
these random peptides (or nucleic acids, discussed below) are chemically synthesized, they 

10 may incorporate any nucleotide or amino acid at any position. The synthetic process can be 
designed to generate randomized proteins or nucleic acids, to allow the formation of all or 
most of the possible combinations over the length of the sequence, thus forming a library of 
randomized candidate bioactive proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence 

1 5 preferences or constants at any position. In a preferred embodiment, the library is biased. 
That is, some positions within the sequence are either held constant, or are selected from a 
limited number of possibilities. For example, in a preferred embodiment, the nucleotides or 
amino acid residues are randomized within a defined class, e.g., of hydrophobic amino acids, 
hydrophilic residues, sterically biased (either small or large) residues, towards the creation of 

20 nucleic acid binding domains, the creation of cysteines, for cross-linking, prolines for SI I-3 
domains, serines, threonines, tyrosines or histidines for phosphorylation sites, etc., or to 
purines, etc. 

Modulators of breast cancer can also be nucleic acids, as defined above. 

As described above generally for proteins, nucleic acid modulating agents may 
25 be naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. 
For example, digests of procaryotic or eucaryotic genomes may be used as is outlined above 
for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical 
moieties, a wide variety of which are available in the literature. 
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After the candidate agent has been added and the cells allowed to incubate for 
some period of time, the sample containing a target sequence to be analyzed is added to the 
biochip. If required, the target sequence is prepared using known techniques. For example, 
the sample may be treated to lyse the cells, using known lysis buffers, electroporation, etc., 
with purification and/or amplification such as PCR performed as appropriate. For example, 
an in vitro transcription with labels covalently attached to the nucleotides is performed. 
Generally, the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a 
fluorescent, a chemiluminescent, a chemical, or a radioactive signal, to provide a means of 
detecting the target sequence's specific binding to a probe. The label also can be an enzyme, 
such as, alkaline phosphatase or horseradish peroxidase, which when provided with an 
appropriate substrate produces a product that can be detected. Alternatively, the label can be 
a labeled compound or small molecule, such as an enzyme inhibitor, that binds but is not 
catalyzed or altered by the enzyme. The label also can be a moiety or compound, such as, an 
epitope tag or biotin which specifically binds to streptavidin. For the example of biotin, the 
streptavidin is labeled as described above, thereby, providing a detectable signal for the 
bound target sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct 
hybridization assays or can comprise "sandwich assays", which include the use of multiple 
probes, as is generally outlined in U.S. Patent Nos. 5,681,702, 5,597,909, 5,545,730, 

5.594.117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 

5.594.1 18, 5,359,100, 5,124,246 and 5,681,697, all of which are hereby incorporated by 
reference. In this embodiment, in general, the target nucleic acid is prepared as outlined 
above, and then added to the biochip comprising a plurality of nucleic acid probes, under 
conditions that allow the formation of a hybridization complex. 

A variety of hybridization conditions may be used in the present invention, 
including high, moderate and low stringency conditions as outlined above. The assays are 
generally run under stringency conditions which allows formation of the label probe 
hybridization complex only in the presence of target. Stringency can be controlled by 
altering a step parameter that is a thermodynamic variable, including, but not limited to, 
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temperature, formamide concentration, salt concentration, chaotropic salt concentration pH, 
organic solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is 
generally outlined in U.S. Patent No. 5,681,697. Thus it may be desirable to perform certain 
5 steps at higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. 
Components of the reaction may be added simultaneously, or sequentially, in different orders, 
with preferred embodiments outlined below. In addition, the reaction may include a variety 
of other reagents. These include salts, buffers, neutral proteins, e.g. albumin, detergents, etc, 
10 which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 

The assay data are analyzed to determine the expression levels, and changes in 
15 expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the breast cancer phenotype. 
In one embodiment, screening is performed to identify modulators that can induce or 
suppress a particular expression profile, thus preferably generating the associated phenotype. 
In another embodiment, e.g., for diagnostic applications, having identified differentially 
20 expressed genes important in a particular state, screens can be performed to identify 

modulators that alter expression of individual genes. In an another embodiment, screening is 
performed to identify modulators that alter a biological function of the expression product of 
a differentially expressed gene. Again, having identified the importance of a gene in a 
particular state, screens are performed to identify agents that bind and/or modulate the 
25 biological activity of the gene product. 

In addition screens can be done for genes that are induced in response to a 
candidate agent. After identifying a modulator based upon its ability to suppress a breast 
cancer expression pattern leading to a normal expression pattern, or to modulate a single 
breast cancer gene expression profile so as to mimic the expression of the gene from normal 
30 tissue, a screen as described above can be performed to identify genes that are specifically 
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modulated in response to the agent. Comparing expression profiles between normal tissue 
and agent treated breast cancer tissue reveals genes that are not expressed in normal tissue or 
breast cancer tissue, but are expressed in agent treated tissue. These agent-specific sequences 
can be identified and used by methods described herein for breast cancer genes or proteins. 

5 In particular these sequences and the proteins they encode find use in marking or identifying 
agent treated cells. In addition, antibodies can be raised against the agent induced proteins 
and used to target novel therapeutics to the treated breast cancer tissue sample. 

Thus, in one embodiment, a test compound is administered to a population of 
breast cancer cells, that have an associated breast cancer expression profile. By 

10 "administration" or "contacting" herein is meant that the candidate agent is added to the cells 
in such a manner as to allow the agent to act upon the cell, whether by uptake and 
intracellular action, or by action at the cell surface. In some embodiments, nucleic acid 
encoding a proteinaceous candidate agent (i.e., a peptide) may be put into a viral construct 
such as an adenoviral or retroviral construct, and added to the cell, such that expression of 

15 the peptide agent is accomplished, e.g., PCT US97/01019. Regulatable gene therapy systems 
can also be used. 

Once the test compound has been administered to the cells, the cells can be 
washed if desired and are allowed to incubate under preferably physiological conditions for 
some period of time. The cells are then harvested and a new gene expression profile is 
20 generated, as outlined herein. 

Thus, e.g., breast cancer tissue may be screened for agents that modulate, e.g., 
induce or suppress the breast cancer phenotype. A change in at least one gene, preferably 
many, of the expression profile indicates that the agent has an effect on breast cancer activity. 
By defining such a signature for the breast cancer phenotype, screens for new drugs that alter 
25 the phenotype can be devised. With this approach, the drug target need not be known and 
need not be represented in the original expression screening platform, nor does the level of 
transcript for the target protein need to change. 

In a preferred embodiment, as outlined above, screens may be done on 
individual genes and gene products (proteins). That is, having identified a particular 
30 differentially expressed gene as important in a particular state, screening of modulators of 
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either the expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "breast cancer proteins" or 
a "breast cancer modulatory protein". The breast cancer modulatory protein may be a 
fragment, or alternatively, be the full length protein to the fragment encoded by the nucleic 
5 acids of the Tables. Preferably, the breast cancer modulatory protein is a fragment. In a 
preferred embodiment, the breast cancer amino acid sequence which is used to determine 
sequence identity or similarity is encoded by a nucleic acid of Table 25. In another 
embodiment, the sequences are naturally occurring allelic variants of a protein encoded by a 
nucleic acid of Table 25. In another embodiment, the sequences are sequence variants as 

10 further described herein. 

Preferably, the breast cancer modulatory protein is a fragment of 
approximately 14 to 24 amino acids long. More preferably the fragment is a soluble 
fragment. Preferably, the fragment includes a non-transmembrane region. In a preferred 
embodiment, the fragment has an N-terminal Cys to aid in solubility. In one embodiment, the 

15 C-terminus of the fragment is kept as a free acid and the N- terminus is a free amine to aid in 
coupling, i.e., to cysteine. 

In one embodiment the breast cancer proteins are conjugated to an 
immunogenic agent as discussed herein. In one embodiment the breast cancer protein is 
conjugated to BSA. 

20 Measurements of breast cancer polypeptide activity, or of breast cancer or the 

breast cancer phenotype can be performed using a variety of assays. For example, the effects 
of the test compounds upon the function of the breast cancer polypeptides can be measured 
by examining parameters described above. A suitable physiological change that affects 
activity can be used to assess the influence of a test compound on the polypeptides of this 

25 invention. When the functional consequences are determined using intact cells or animals, 
one can also measure a variety of effects such as, in the case of breast cancer associated with 
tumors, tumor growth, tumor metastasis, neovascularization, hormone release, transcriptional 
changes to both known and uncharacterized genetic markers (e.g., northern blots), changes in 
cell metabolism such as cell growth or pH changes, and changes in intracellular second 
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messengers such as cGMP. In the assays of the invention, mammalian breast cancer 
polypeptide is typically used, e.g., mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in 
vitro. For example, a breast cancer polypeptide is first contacted with a potential modulator 
and incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one embodiment, 
the breast cancer polypeptide levels are determined in vitro by measuring the level of protein 
or mRNA. The level of protein is measured using immunoassays such as western blotting, 
ELISA and the like with an antibody that selectively binds to the breast cancer polypeptide or 
a fragment thereof. For measurement of mRNA, amplification, e.g., using PCR, LCR, or 
hybridization assays, e.g., northern hybridization, RNAse protection, dot blotting, are 
preferred. The level of protein or mRNA is detected using directly or indirectly labeled 
detection agents, e.g., fluorescently or radioactively labeled nucleic acids, radioactively or 
enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using the breast cancer 
protein promoter operably linked to a reporter gene such as luciferase, green fluorescent 
protein, CAT, or p-gal. The reporter construct is typically transfected into a cell. After 
treatment with a potential modulator, the amount of reporter gene transcription, translation, or 
activity is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on 
individual genes and gene products (proteins). That is, having identified a particular 
differentially expressed gene as important in a particular state, screening of modulators of the 
expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "breast cancer proteins" 
The breast cancer protein may be a fragment, or alternatively, be the full length protein to a 
fragment shown herein. 

In one embodiment, screening for modulators of expression of specific genes 
is performed. Typically, the expression of only one or a few genes are evaluated. In another 
embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
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differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or 
isolated gene product is used; that is, the gene products of one or more differentially 
5 expressed nucleic acids are made. For example, antibodies are generated to the protein gene 
products, and standard immunoassays are run to determine the amount of protein present. 
Alternatively, cells comprising the breast cancer proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a breast 
cancer protein and a candidate compound, and determining the binding of the compound to 

10 the breast cancer protein. Preferred embodiments utilize the human breast cancer protein, 
although other mammalian proteins may also be used, e.g. for the development of animal 
models of human disease. In some embodiments, as outlined herein, variant or derivative 
breast cancer proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the breast cancer 

1 5 protein or the candidate agent is non-diffusably bound to an insoluble support having isolated 
sample receiving areas (e.g. a microtiter plate, an array, etc.). The insoluble supports may be 
made of any composition to which the compositions can be bound, is readily separated from 
soluble material, and is otherwise compatible with the overall method of screening. The 
surface of such supports may be solid or porous and of any convenient shape. Examples of 

20 suitable insoluble supports include microtiter plates, arrays, membranes and beads. These are 
typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, 
teflon™, etc. Microtiter plates and arrays are especially convenient because a large number 
of assays can be carried out simultaneously, using small amounts of reagents and samples. 
The particular manner of binding of the composition is not crucial so long as it is compatible 

25 with the reagents and overall methods of the invention, maintains the activity of the 

composition and is nondiffusable. Preferred methods of binding include the use of antibodies 
(which do not sterically block either the ligand binding site or activation sequence when the 
protein is bound to the support), direct binding to "sticky" or ionic supports, chemical 
crosslinking, the synthesis of the protein or agent on the surface, etc. Following binding of 

30 the protein or agent, excess unbound material is removed by washing. The sample receiving 
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areas may then be blocked through incubation with bovine serum albumin (BSA), casein or 
other innocuous protein or other moiety. 

In a preferred embodiment, the breast cancer protein is bound to the support, 
and a test compound is added to the assay. Alternatively, the candidate agent is bound to the 
5 support and the breast cancer protein is added. Novel binding agents include specific 
antibodies, non-natural binding agents identified in screens of chemical libraries, peptide 
analogs, etc. Of particular interest are screening assays for agents that have a low toxicity for 
human cells. A wide variety of assays may be used for this purpose, including labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays for 
1 0 protein binding, functional assays (phosphorylation assays, etc.) and the like. 

The determination of the binding of the test modulating compound to the 
breast cancer protein may be done in a number of ways. In a preferred embodiment, the 
compound is labeled, and binding determined directly, e.g., by attaching all or a portion of 
the breast cancer protein to a solid support, adding a labeled candidate agent (e.g., a 
1 5 fluorescent label), washing off excess reagent, and determining whether the label is present 
on the solid support. Various blocking and washing steps may be utilized as appropriate. 

In some embodiments, only one of the components is labeled, e.g., the 
proteins (or proteinaceous candidate compounds) can be labeled. Alternatively, more than 
one component can be labeled with different labels, e.g., 125 I for the proteins and a fluorophor 
20 for the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

In one embodiment, the binding of the test compound is determined by 
competitive binding assay. The competitor is a binding moiety known to bind to the target 
molecule (i.e., a breast cancer protein), such as an antibody, peptide, binding partner, ligand, 

25 etc. Under certain circumstances, there may be competitive binding between the compound 
and the binding moiety, with the binding moiety displacing the compound. In one 
embodiment, the test compound is labeled. Either the compound, or the competitor, or both, 
is added first to the protein for a time sufficient to allow binding, if present. Incubations may 
be performed at a temperature which facilitates optimal activity, typically between 4 and 

30 40°C. Incubation periods are typically optimized, e.g., to facilitate rapid high throughput 
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screening. Typically between 0.1 and 1 hour will be sufficient. Excess reagent is generally 
removed or washed away. The second component is then added, and the presence or absence 
of the labeled component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the test 
5 compound. Displacement of the competitor is an indication that the test compound is binding 
to the breast cancer protein and thus is capable of binding to, and potentially modulating, the 
activity of the breast cancer protein. In this embodiment, either component can be labeled. 
Thus, e.g., if the competitor is labeled, the presence of label in the wash solution indicates 
displacement by the agent. Alternatively, if the test compound is labeled, the presence of the 

10 label on the support indicates displacement. 

In an alternative embodiment, the test compound is added first, with 
incubation and washing, followed by the competitor. The absence of binding by the 
competitor may indicate that the test compound is bound to the breast cancer protein with a 
higher affinity. Thus, if the test compound is labeled, the presence of the label on the 

15 support, coupled with a lack of competitor binding, may indicate that the test compound is 
capable of binding to the breast cancer protein. 

In a preferred embodiment, the methods comprise differential screening to 
identity agents that are capable of modulating the activity of the breast cancer proteins. In 
this embodiment, the methods comprise combining a breast cancer protein and a competitor 

20 in a first sample. A second sample comprises a test compound, a breast cancer protein, and a 
competitor. The binding of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the breast cancer protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 

25 agent is capable of binding to the breast cancer protein. 

Alternatively, differential screening is used to identify drug candidates that 
bind to the native breast cancer protein, but cannot bind to modified breast cancer proteins. 
The structure of the breast cancer protein may be modeled, and used in rational drug design 
to synthesize agents that interact with that site. Drug candidates that affect the activity of a 
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breast cancer protein are also identified by screening drugs for the ability to either enhance or 

reduce the activity of the protein. 

Positive controls and negative controls may be used in the assays. Preferably 
control and test samples are performed in at least triplicate to obtain statistically significant 
results. Incubation of all samples is for a time sufficient for the binding of the agent to the 
protein. Following incubation, samples are washed free of non-specifically bound material 
and the amount of bound, generally labeled agent determined. For example, where a 
radiolabel is employed, the samples may be counted in a scintillation counter to determine the 

amount of bound compound. 

A variety of other reagents may be included in the screening assays. These 
include reagents like salts, neutral proteins, e.g. albumin, detergents, etc. which may be used 
to facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 
of components may be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
compound capable of modulating the activity of a breast cancer protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising breast cancer 
proteins. Preferred cell types include almost any cell. The cells contain a recombinant 
nucleic acid that encodes a breast cancer protein. In a preferred embodiment, a library of 
candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous 
or subsequent exposure of physiological signals, e.g. hormones, antibodies, peptides, 
antigens, cytokines, growth factors, action potentials, pharmacological agents including 
chemotberapeutics, radiation, carcinogenics, or other cells (i.e. cell-cell contacts). In another 
example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate breast cancer agents are identified. 
Compounds with pharmacological activity are able to enhance or interfere with the activity of 
the breast cancer protein. Once identified, similar structures are evaluated to identify critical 
structural feature of the compound. 
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In one embodiment, a method of inhibiting breast cancer cell division is 
provided. The method comprises administration of a breast cancer inhibitor. In another 
embodiment, a method of inhibiting breast cancer is provided. The method comprises 
administration of a breast cancer inhibitor. In a further embodiment, methods of treating cells 
5 or individuals with breast cancer are provided. The method comprises administration of a 
breast cancer inhibitor. 

In one embodiment, a breast cancer inhibitor is an antibody as discussed 
above. In another embodiment, the breast cancer inhibitor is an antisense molecule. 

A variety of cell growth, proliferation, and metastasis assays are known to 
10 those of skill in the art, as described below. 

Soft agar growth or colony formation in suspension 

Normal cells require a solid substrate to attach and grow. When the cells are 
transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow in stirred suspension culture or suspended in semi-solid media, 

1 5 such as semi-solid or soft agar. The transformed cells, when transfected with tumor 

suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 
grow. Soft agar growth or colony formation in suspension assays can be used to identify 
modulators of breast cancer sequences, which when expressed in host cells, inhibit abnormal 
cellular proliferation and transformation. A therapeutic compound would reduce or eliminate 

20 the host cells' ability to grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft. 

Techniques for soft agar growth or colony formation in suspension assays are 
described in Freshney, Culture of Animal Cells a Manual of Basic Technique (3 rd ed., 1994), 
herein incorporated by reference. See also, the methods section of Garkavtsev et al (1996), 

25 supra, herein incorporated by reference. 

Contact inhibition and dmsity limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until 
they touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and 
30 continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to a 
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higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 
pattern of normal surrounding cells. Alternatively, labeling index with ( 3 H)-thymidine at 
saturation density can be used to measure density limitation of growth. See Freshney (1994), 

5 supra. The transformed cells, when transfected with tumor suppressor genes, regenerate a 
normal phenotype and become contact inhibited and would grow to a lower density. 

In this assay, labeling index with ( 3 II)-thyrnidine at saturation density is a 
preferred method of measuring density limitation of growth. Transformed host cells are 
transfected with abreast cancer-associated sequence and are grown for 24 hours at saturation 

1 0 density in non-limiting medium conditions. The percentage of cells labeling with ( 3 H> 
thymidine is determined autoradiographically. See, Freshney (1994), supra. 

Growth factor or serum dependence 

Transformed cells have a lower serum dependence than their normal 
15 counterparts (see, e.g., Temin, J. Natl. Cancer fnsti. 37:167-175 (1966); Eagle et al., J. Exp. 
Med. 131:836-879 (1970)); Freshney, supra. This is in part due to release of various growth 
factors by the transformed cells. Growth factor or serum dependence of transformed host 
cells can be compared with that of control. 

20 Tumor specific markers levels 

Tumor cells release an increased amount of certain factors (hereinafter "tumor 
specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
is released from human glioma at a higher level than from normal brain cells (see, e.g., 
Gullino, Angiogenesis, tumor vascularization, and potential interference with tumor growth. 
25 in Biological Responses in Cancer, pp. 178-184 (Mihich (ed.) 1985)). Similarly, Tumor 
angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 
counterparts. See, e.g., Folkman, Angiogenesis and Cancer, Sem Cancer Biol. (1992)). 

Various techniques which measure the release of these factors are described in 
Freshney (\99A),supra. Also, see, Unkless et al. , J. Biol. Chem. 249:4295-4305 (1974); 
30 Strickland & Beers, J.Biol. Chem. 251:5694-5702 (1976); Whur et al, Br. J. Cancer 42:305- 
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312 (1980); Gullino, Angiogenesis, tumor vascularization, and potential interference with 
tumor growth, in Biological Responses in Cancer, pp. 178-184 (Mihich (ed.) 1985); 
Freshney Anticancer Res. 5:111-130 (1985). 

5 Invasiveness into Matrigel 

The degree of invasiveness into Matrigel or some other extracellular matrix 
constituent can be used as an assay to identify compounds that modulate breast cancer- 
associated sequences. Tumor cells exhibit a good correlation between malignancy and 
invasiveness of cells into Matrigel or some other extracellular matrix constituent. In this 
10 assay, tumorigenic cells are typically used as host cells. Expression of a tumor suppressor 
gene in these host cells would decrease invasiveness of the host cells. 

Techniques described in Freshney (1994), supra, can be used. Briefly, the 
level of invasion of host cells can be measured by using filters coated with Matrigel or some 
other extracellular matrix constituent. Penetration into the gel, or through to the distal side of 
1 5 the filter, is rated as invasiveness, and rated histologically by number of cells and distance 

moved, or by prelabeling the cells with 125 I and counting the radioactivity on the distal side of 
the filter or bottom of the dish. See, e.g., Freshney (1984), supra. 

Tumor growth in vivo 

20 Effects of breast cancer-associated sequences on cell growth can be tested in 

transgenic or immune-suppressed mice. Knock-out transgenic mice can be made, in which 
the breast cancer gene is disrupted or in which a breast cancer gene is inserted. Knock-out 
transgenic mice can be made by insertion of a marker gene or other heterologous gene into 
the endogenous breast cancer gene site in the mouse genome via homologous recombination. 

25 Such mice can also be made by substituting the endogenous breast cancer gene with a 

mutated version of the breast cancer gene, or by mutating the endogenous breast cancer gene, 
e.g., by exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, which 

30 is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 
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that possess germ cells partially derived from the mutant cell line. Therefore, by breeding the 
chimeric mice it is possible to obtain a new line of mice containing the introduced genetic 
lesion (see, e.g., Capecchi et al t Science 244:1288 (1989)). Chimeric targeted mice can be 
derived according to Hogan et al. 9 Manipulating the Mouse Embryo: A Laboratory Manual, 
Cold Spring Harbor Laboratory (1988) and Teratocarcinomas and Embryonic Stem Cells: A 
Practical Approach, Robertson, ed., IRL Press, Washington, D.C., (1987). 

Alternatively, various immune-suppressed or immune-deficient host animals 
can be used. For example, genetically athymic "nude" mouse (see, e.g., Giovanella et ai 9 J. 
Natl. Cancer Inst 52:921 (1974)), a SCID mouse, a thymectomized mouse, or an irradiated 
mouse (see, e.g., Bradley et al., Br, J. Cancer 38:263 (1978); Selby et aL, Br. J. Cancer 
41:52 (1980)) can be used as a host. Transplantable tumor cells (typically about 10 6 cells) 
injected into isogenic hosts will produce invasive tumors in a high proportions of cases, while 
normal cells of similar origin will not. In hosts which developed invasive tumors, cells 
expressing a breast cancer-associated sequences are injected subcutaneously. After a suitable 
length of time, preferably 4-8 weeks, tumor growth is measured (e.g., by volume or by its two 
largest dimensions) and compared to the control. Tumors that have statistically significant 
reduction (using, e.g., Student's T test) are said to have inhibited growth. 

Polynucleotide modulators of breast cancer 

Antisense Polynucleotides 

In certain embodiments, the activity of a breast cancer-associated protein is 
down-regulated, or entirely inhibited, by the use of antisense polynucleotide, i.e., a nucleic 
acid complementary to, and which can preferably hybridize specifically to, a coding mRNA 
nucleic acid sequence, e.g., a breast cancer protein mRNA, or a subsequence thereof. 
Binding of the antisense polynucleotide to the mRNA reduces the translation and/or stability 
of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise 
naturally-occurring nucleotides, or synthetic species formed from naturally- occurring 
subunits or their close homologs. Antisense polynucleotides may also have altered sugar 
moieties or inter-sugar linkages. Exemplary among these are the phosphorothioate and other 
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sulfur containing species which are known for use in the art. Analogs are comprehended by 
this invention so long as they function effectively to hybridize with the breast cancer protein 
mRNA. See, e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

Such antisense polynucleotides can readily be synthesized using recombinant 
5 means, or can be synthesized in vitro. Equipment for such synthesis is sold by several 
vendors, including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 

Antisense molecules as used herein include antisense or sense 
oligonucleotides. Sense oligonucleotides can, e.g., be employed to block transcription by 

10 binding to the anti-sense strand. The antisense and sense oligonucleotide comprise a single- 
stranded nucleic acid sequence (either RNA or DNA) capable of binding to target mRNA 
(sense) or DNA (antisense) sequences for breast cancer molecules. A preferred antisense 
molecule is for a breast cancer sequences in Tables 1-25, or for a ligand or activator thereof. 
Antisense or sense oligonucleotides, according to the present invention, comprise a fragment 

1 5 generally at least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The 
ability to derive an antisense or a sense oligonucleotide, based upon a cDNA sequence 
encoding a given protein is described in, e.g., Stein & Cohen (Cancer Res. 48:2659 (1988 and 
van der Krol et al. (BioTechniques 6:958 (1988)). 

20 Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and 
inhibit transcription of breast cancer-associated nucleotide sequences. A ribozyme is an 
RNA molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes 
have been described, including group I ribozymes, hammerhead ribozymes, hairpin 

25 ribozymes, RNase P, and axhead ribozymes {see, e.g., Castanotto et al.,Adv. in 

Pharmacology 25: 289-317 (1994) for a general review of the properties of different 
ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel et al, 
Nucl Acids Res. 18:299-304 (1990); European Patent Publication No. 0 360 257; U.S. Patent 
30 No. 5,254,678. Methods of preparing are well known to those of skill in the art (see, e.g., 
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WO 94/26877; Ojwang et aL, Proc. Natl Acad. ScL USA 90:6340-6344 (1993); Yamada et 
aL, Human Gene Tlierapy 1 :39-45 (1994); Leavitt et aL, Proc. Nail Acad. Set USA 92:699- 
703 (1995); Leavitt et aL, Human Gene Therapy 5:1151-120 (1994); and Yamada et aL, 
Virology 205: 121-126 (1994)). 
5 Polynucleotide modulators of breast cancer may be introduced into a cell 

containing the target nucleotide sequence by formation of a conjugate with a ligand binding 
molecule, as described in WO 91/04753. Suitable ligand binding molecules include, but are 
not limited to, cell surface receptors, growth factors, other cytokines, or other ligands that 
bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule does 
1 0 not substantially interfere with the ability of the ligand binding molecule to bind to its 

corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of breast 
cancer may be introduced into a cell containing the target nucleic acid sequence, e.g., by 
formation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 
1 5 understood that the use of antisense molecules or knock out and knock in models may also be 
used in screening assays as discussed above, in addition to methods of treatment. 

Thus, in one embodiment, methods of modulating breast cancer in cells or 
organisms are provided. In one embodiment, the methods comprise administering to a cell an 
anti-breast cancer antibody that reduces or eliminates the biological activity of an endogenous 
20 breast cancer protein. Alternatively, the methods comprise administering to a cell or 
organism a recombinant nucleic acid encoding a breast cancer protein. This may be 
accomplished in any number of ways. In a preferred embodiment, e.g. when the breast 
cancer sequence is down-regulated in breast cancer, such state may be reversed by increasing 
the amount of breast cancer gene product in the cell. This can be accomplished, e.g., by 
25 overexpressing the endogenous breast cancer gene or administering a gene encoding the 
breast cancer sequence, using known gene-therapy techniques, e.g.. In a preferred 
embodiment, the gene therapy techniques include the incorporation of the exogenous gene 
using enhanced homologous recombination (EHR), e.g. as described in PCT/US93/03868, 
hereby incorporated by reference in its entirety. Alternatively, e.g. when the breast cancer 
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sequence is up-regulated in breast cancer, the activity of the endogenous breast cancer gene is 
decreased, e.g. by the administration of a breast cancer an ti sense nucleic acid. 

In one embodiment, the breast cancer proteins of the present invention may be 
used to generate polyclonal and monoclonal antibodies to breast cancer proteins. Similarly, 
5 the breast cancer proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to purify breast cancer 
antibodies useful for production, diagnostic, or therapeutic purposes. In a preferred 
embodiment, the antibodies are generated to epitopes unique to a breast cancer protein; that 
is, the antibodies show little or no cross-reactivity to other proteins. The breast cancer 
10 antibodies may be coupled to standard affinity chromatography columns and used to purify 
breast cancer proteins. The antibodies may also be used as blocking polypeptides, as outlined 
above, since they will specifically bind to the breast cancer protein. 

Methods of identifying variant breast cancer-associated sequences 

15 Without being bound by theory, expression of various breast cancer sequences 

is correlated with breast cancer. Accordingly, disorders based on mutant or variant breast 
cancer genes may be determined. In one embodiment, the invention provides methods for 
identifying cells containing variant breast cancer genes, e.g., determining all or part of the 
sequence of at least one endogenous breast cancer genes in a cell. This may be accomplished 

20 using any number of sequencing techniques. In a preferred embodiment, the invention 

provides methods of identifying the breast cancer genotype of an individual, e.g., determining 
all or part of the sequence of at least one breast cancer gene of the individual. This is 
generally done in at least one tissue of the individual, and may include the evaluation of a 
number of tissues or different samples of the same tissue. The method may include 

25 comparing the sequence of the sequenced breast cancer gene to a known breast cancer gene, 
i.e., a wild-type gene. 

The sequence of all or part of the breast cancer gene can then be compared to 
the sequence of a known breast cancer gene to determine if any differences exist. This can be 
done using any number of known homology programs, such as Bestfit, etc. In a preferred 

30 embodiment, the presence of a difference in the sequence between the breast cancer gene of 
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the patient and the known breast cancer gene correlates with a disease state or a propensity 
for a disease state, as outlined herein. 

In a preferred embodiment, the breast cancer genes are used as probes to 
determine the number of copies of the breast cancer gene in the genome. 

In another preferred embodiment, the breast cancer genes are used as probes to 
determine the chromosomal localization of the breast cancer genes. Information such as 
chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
chromosomal abnormalities such as translocations, and the like are identified in the breast 
cancer gene locus. 

Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of a breast cancer protein 
or modulator thereof, is administered to a patient. By "therapeutically effective dose" herein 
is meant a dose that produces effects for which it is administered. The exact dose will depend 
on the purpose of the treatment, and will be ascertainable by one skilled in the art using 
known techniques (e.g., Ansel et aL, Pharmaceutical Dosage Forms and Drug Delivery; 
Lieberman, Pharmaceutical Dosage Forms (vols. 1-3, 1992), Dekker, ISBN 0824770846, 
082476918X, 0824712692, 0824716981; Lloyd, TlxeArt, Science and Technology of 
Pharmaceutical Compounding (1999); and Pickar, Dosage Calculations (1999)). As is 
known in the art, adjustments for breast cancer degradation, systemic versus localized 
delivery, and rate of new protease synthesis, as well as the age, body weight, general health, 
sex, diet, time of administration, drug interaction and the severity of the condition may be 
necessary, and will be ascertainable with routine experimentation by those skilled in the art. 
U.S. Patent Application N. 09/687,576, further discloses the use of compositions and 
methods of diagnosis and treatment in breast cancer is hereby expressly incorporated by 
reference. 

A "patient" for the purposes of the present invention includes both humans 
and other animals, particularly mammals. Thus the methods are applicable to both human 
therapy and veterinary applications. In the preferred embodiment the patient is a mammal, 
preferably a primate, and in the most preferred embodiment the patient is human. 
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The administration of the breast cancer proteins and modulators thereof of the 
present invention can be done in a variety of ways as discussed above, including, but not 
limited to, orally, subcutaneously, intravenously, intranasally, transdermally, 
intraperitoneal^, intramuscularly, intrapulmonary, vaginally, rectally, or intraocularly. In 
5 some instances, e.g., in the treatment of wounds and inflammation, the breast cancer proteins 
and modulators may be directly applied as a solution or spray. 

The pharmaceutical compositions of the present invention comprise a breast 
cancer protein in a form suitable for administration to a patient. In the preferred embodiment, 
the pharmaceutical compositions are in a water soluble form, such as being present as 

1 0 pharmaceutical^ acceptable salts, which is meant to include both acid and base addition 
salts. "Pharmaceutically acceptable acid addition salt" refers to those salts that retain the 
biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 

15 propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, flimaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 
methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
"Pharmaceutically acceptable base addition salts" include those derived from inorganic bases 
such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 

20 manganese, aluminum salts and the like. Particularly preferred are the ammonium, 

potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylamine, 

25 tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the 
following: carrier proteins such as serum albumin; buffers; fillers such as microcrystalline 
cellulose, lactose, corn and other starches; binding agents; sweeteners and other flavoring 
agents; coloring agents; and polyethylene glycol. 
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The pharmaceutical compositions can be administered in a variety of unit 
dosage forms depending upon the method of administration. For example, unit dosage forms 
suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that breast cancer protein modulators (e.g., antibodies, 
antisense constructs, ribozymes, small organic molecules, etc.) when administered orally, 
should be protected from digestion. This is typically accomplished either by complexing the 
molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 
packaging the molecule(s) in an appropriately resistant carrier, such as a liposome or a 
protection barrier. Means of protecting agents from digestion are well known in the art. 

The compositions for administration will commonly comprise a breast cancer 
protein modulator dissolved in a pharmaceutically acceptable carrier, preferably an aqueous 
carrier. A variety of aqueous carriers can be used, e.g., buffered saline and the like. These 
solutions are sterile and generally free of undesirable matter. These compositions may be 
sterilized by conventional, well known sterilization techniques. The compositions may 
contain pharmaceutically acceptable auxiliary substances as required to approximate 
physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents 
and the like, e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, 
sodium lactate and the like. The concentration of active agent in these formulations can vary 
widely, and will be selected primarily based on fluid volumes, viscosities, body weight and 
the like in accordance with the particular mode of administration selected and the patient's 
needs (e.g., Remington s Pharmaceutical Science (15th ed., 1980) and Goodman & Gillman, 
TJie Pharmacologial Basis of Therapeutics (Hardman et a/.,eds., 1996)). 

Thus, a typical pharmaceutical composition for intravenous administration 
would be about 0.1 to 1 0 mg per patient per day. Dosages from 0.1 up to about 100 mg per 
patient per day may be used, particularly when the drug is administered to a secluded site and 
not into the blood stream, such as into a body cavity or into a lumen of an organ. 
Substantially higher dosages are possible in topical administration. Actual methods for 
preparing parenterally administrate compositions will be known or apparent to those skilled 
in the art, e.g., Remington 's Pharmaceutical Science and Goodman and Gillman, The 
pharmacologial Basis of Therapeutics, supra. 

87 



WO 02/(159377 



PCT/US02/02242 



The compositions containing modulators of breast cancer proteins can be 
administered for therapeutic or prophylactic treatments. In therapeutic applications, 
compositions are administered to a patient suffering from a disease (e.g., a cancer) in an 
amount sufficient to cure or at least partially arrest the disease and its complications. An 
5 amount adequate to accomplish this is defined as a "therapeutically effective dose." Amounts 
effective for this use will depend upon the severity of the disease and the general state of the 
patient's health. Single or multiple administrations of the compositions may be administered 
depending on the dosage and frequency as required and tolerated by the patient. In any event, 
the composition should provide a sufficient quantity of the agents of this invention to 

10 effectively treat the patient. An amount of modulator that is capable of preventing or slowing 
the development of cancer in a mammal is referred to as a "prophylactically effective dose." 
The particular dose required for a prophylactic treatment will depend upon the medical 
condition and history of the mammal, the particular cancer being prevented, as well as other 
factors such as age, weight, gender, administration route, efficiency, etc. Such prophylactic 

15 treatments may be used, e.g., in a mammal who has previously had cancer to prevent a 

recurrence of the cancer, or in a mammal who is suspected of having a significant likelihood 
of developing cancer. 

It will be appreciated that the present breast cancer protein-modulating 
compounds can be administered alone or in combination with additional breast cancer 

20 modulating compounds or with other therapeutic agent, e.g., other anti-cancer agents or 
treatments. 

In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
comprising nucleic acid sequences set forth in Tables 1-25, such as antisense polynucleotides 
or ribozymes, will be introduced into cells, in vitro or in vivo. The present invention provides 
25 methods, reagents, vectors, and cells useful for expression of breast cancer-associated 
polypeptides and nucleic acids using in vitro (cell-free), ex vivo or in vivo (cell or 
organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 
30 introducing foreign nucleotide sequences into host cells may be used. These include the use 
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of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors and any of the other well known methods for introducing cloned 
genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell (see, 
e.g., Berger & Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology 
5 volume 152 (Berger), Ausubel et al, eds., Current Protocols (supplemented through 1999), 
and Sambrook et al, Molecular Cloning - A Laboratory Manual (2nd ed., Vol. 1-3, 1989. 

In a preferred embodiment, breast cancer proteins and modulators are 
administered as therapeutic agents, and can be formulated as outlined above. Similarly, 
breast cancer genes (including both the full-length sequence, partial sequences, or regulatory 
10 sequences of the breast cancer coding regions) can be administered in a gene therapy 

application. These breast cancer genes can include antisense applications, either as gene 
therapy (i.e. for incorporation into the genome) or as antisense compositions, as will be 
appreciated by those in the art. 

Breast cancer polypeptides and polynucleotides can also be administered as 
1 5 vaccine compositions to stimulate HTL, CTL and antibody responses. . Such vaccine 

compositions can include, e.g., lipidated peptides (see r e.g.,Vitiello, A. et al,J. Clin. Invest. 
95:341 (1995)), peptide compositions encapsulated in poly(DL-lactide-co-glycolide) ("PLG") 
microspheres (see, e.g., Eldridge, et al, Molec. Immunol. 28:287-294, (1 991); Alonso et al, 
Vaccine 12:299-306 (1994); Jones etaL, Vaccine 13:675-681 (1995)), peptide compositions 
20 contained in immune stimulating complexes (ISCOMS) (see, e.g., Takahashi et al, Nature 
344:873-875 (1990); Hu et al, Clin Exp Immunol. 1 13:235-243 (1998)), multiple antigen 
peptide systems (MAPs) (see, e.g., Tarn, Proc. Natl. Acad. Sci. U.S.A. 85:5409-5413 (1988); 
Tarn, J. Immunol. Methods 196: 17-32 (1996)), peptides formulated as multivalent peptides; 
peptides for use in ballistic delivery systems, typically crystallized peptides, viral delivery 
25 vectors (Perkus, et al, In: Concepts in vaccine development (Kauftnann, ed. } p. 379, 1996); 
Chakrabarti, et al, Nature 320:535 (1986); Hu et al, Nature 320:537 (1986); Kieny, et al, 
AIDS Bio/Technology 4:790 (1986); Top etal,J. Infect. Dis. 124:148 (1971); Chandae/a/., 
Virology 175:535 (1990)), particles of viral or synthetic origin (see, e.g., Kofler et al,l 
Immunol Methods. 192:25 (1996); Eldridge etal, Sem. Hematol. 30:16 (1993); Falo etal, 
30 Nature Med. 7:649 (1995)), adjuvants (Warren et al, Annu. Rev. Immunol 4:369 (1986); 
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Gupta et al, Vaccine 1 1:293 (1993)), liposomes (Rcddy et aL, J. Immunol. 148:1585 (1992); 
Rock, Immunol. Today 17:131 (1996)), or, naked or particle absorbed cDNA (Ulmer, etal., 
Science 259:1745 (1993); Robinson et al, Vaccine 11:957 (1993); Shiver etaL, In: Concepts 
in vaccine development (Kaufmann, ed., p. 423, 1996); Cease & Berzofsky, Anna. Rev. 
5 Immunol. 12:923 (1994) and Eldridge et ai 9 Sent. Hematol 30:16 (1993)). Toxin-targeted 
delivery technologies, also known as receptor mediated targeting, such as those of Avant 
Immunotherapeutics, Inc. (Needham, Massachusetts) may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a 
substance designed to protect the antigen from rapid catabolism, such as aluminum hydroxide 

10 or mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 
Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 
as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 
MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline 
Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or 

15 aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated 
tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 
polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 

20 Vaccines can be administered as nucleic acid compositions wherein DNA or 

RNA encoding one or more of the polypeptides, or a fragment thereof, is administered to a 
patient. This approach is described, for instance, in Wolff et. al, Science 247:1465 (1990) as 
well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,1 18; 5,736,524; 5,679,647; 
WO 98/04720; and in more detail below. Examples of DNA-based delivery technologies 

25 include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, 

cationic lipid complexes, and particle-mediated ("gene gun") or pressure-mediated delivery 
(see, e.g., U.S. Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the 
invention can be expressed by viral or bacterial vectors. Examples of expression vectors 

30 include attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 
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vaccinia virus, e.g., as a vector to express nucleotide sequences that encode breast cancer 
polypeptides or polypeptide fragments. Upon introduction into a host, the recombinant 
vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 
Patent No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 
described in Stover et al., Nature 35 1 :456-460 (1991). A wide variety of other vectors useful 
for therapeutic administration or immunization e.g. adeno and adeno-associated virus vectors, 
retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the like, 
will be apparent to those skilled in the art from the description herein (see, e.g., Shata et al, 
MolMed Today 6:66-71 (2000); Shedlock et al.J Leukoc Biol 68:793-806 (2000); Hipp et 
a/., //i K/vo 14:571-85 (2000)). 

Methods for the use of genes as DNA vaccines are well known, and include 
placing a breast cancer gene or portion of a breast cancer gene under the control of a 
regulatable promoter or a tissue-specific promoter for expression in a breast cancer patient. 
The breast cancer gene used for DNA vaccines can encode foil-length breast cancer proteins, 
but more preferably encodes portions of the breast cancer proteins including peptides derived 
from the breast cancer protein. In one embodiment, a patient is immunized with a DNA 
vaccine comprising a plurality of nucleotide sequences derived from a breast cancer gene. For 
example, breast cancer- associated genes or sequence encoding subfragments of a breast 
cancer protein are introduced into expression vectors and tested for their immunogenicity in 
the context of Class I MHC and an ability to generate cytotoxic T cell responses. This 
procedure provides for production of cytotoxic T cell responses against cells which present 
antigen, including intracellular epitopes. 

In a preferred embodiment, the DNA vaccines include a gene encoding an 
adjuvant molecule with the DNA vaccine. Such adjuvant molecules include cytokines that 
increase the immunogenic response to the breast cancer polypeptide encoded by the DNA 
vaccine. Additional or alternative adjuvants are available. 

In another preferred embodiment breast cancer genes find use in generating 
animal models of breast cancer. When the breast cancer gene identified is repressed or 
diminished in cancer tissue, gene therapy technology, e.g., wherein antisense RNA directed 
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to the breast cancer gene will also diminish or repress expression of the gene. Animal models 
of breast cancer find use in screening for modulators of a breast cancer-associated sequence 
or modulators of breast cancer. Similarly, transgenic animal technology including gene 
knockout technology, e.g. as a result of homologous recombination with an appropriate gene 
5 targeting vector, will result in the absence or increased expression of the breast cancer 
protein. When desired, tissue- specific expression or knockout of the breast cancer protein 
may be necessary. 

It is also possible that the breast cancer protein is overexpressed in breast 
cancer. As such, transgenic animals can be generated that overexpress the breast cancer 
10 protein. Depending on the desired expression level, promoters of various strengths can be 
employed to express the transgene. Also, the number of copies of the integrated transgene 
can be determined and compared for a determination of the expression level of the transgene. 
Animals generated by such methods find use as animal models of breast cancer and are 
additionally useful in screening for modulators to treat breast cancer. 

15 

Kits for Use in Diagnostic and/or Prognostic Applications 

For use in diagnostic, research, and therapeutic applications suggested above, 
kits are also provided by the invention. In the diagnostic and research applications such kits 
may include any or all of the following: assay reagents, buffers, breast cancer-specific nucleic 

20 acids or antibodies, hybridization probes and/or primers, antisense polynucleotides, 

ribozymes, dominant negative breast cancer polypeptides or polynucleotides, small molecules 
inhibitors of breast cancer-associated sequences etc. A therapeutic product may include 
sterile saline or another pharmaceutically acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing directions 

25 (i.e., protocols) for the practice of the methods of this invention. While the instructional 
materials typically comprise written or printed materials they are not limited to such. Any 
medium capable of storing such instructions and communicating them to an end user is 
contemplated by this invention. Such media include, but are not limited to electronic storage 
media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the 
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like. Such media may include addresses to internet sites that provide such instructional 
materials. 

The present invention also provides for kits for screening for modulators of 
breast cancer-associated sequences. Such kits can be prepared from readily available 

5 materials and reagents. For example, such kits can comprise one or more of the following 
materials: a breast cancer-associated polypeptide or polynucleotide, reaction tubes, and 
instructions for testing breast cancer-associated activity. Optionally, the kit contains 
biologically active breast cancer protein. A wide variety of kits and components can be 
prepared according to the present invention, depending upon the intended user of the kit and 

10 the particular needs of the user. Diagnosis would typically involve evaluation of a plurality 
of genes or products. The genes will be selected based on correlations with important 
parameters in disease which may be identified in historical or outcome data. 



15 EXAMPLES 

Example 1 : Tissue Preparation, Labeling Chips, and Fingerprints 

Purifying total RNA from tissue sample us ing TRIzol Reagent 

20 The sample weight is first estimated. The tissue samples are homogenized in 

1 ml of TRIzol per 50 mg of tissue using a homogenizer (e.g., Polytron 3100). The size of 
the generator/probe used depends upon the sample amount. A generator that is too large for 
the amount of tissue to be homogenized will cause a loss of sample and lower RNA yield. A 
larger generator (e.g., 20 mm) is suitable for tissue samples weighing more than 0.6 g. Fill 

25 tubes should not be overfilled. If the working volume is greater than 2 ml and no greater than 
10 ml, a 15 ml polypropylene tube (Falcon 2059) is suitable for homogenization. 

Tissues should be kept frozen until homogenized. The TRIzol is added 
directly to the frozen tissue before homogenization. Following homogenization, the insoluble 
material is removed from the homogenate by centrifugation at 7500 x g for 15 min. in a 

30 Sorvall superspeed or 12,000 x g for 10 min. in an Eppendorf centrifuge at 40Q The cleared 
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homogenate is then transferred to a new tube(s). Samples may be frozen and stored at -60 to 

-70°C for at least one month or else continue with the purification. 

The next process is phase separation. The homogenized samples are incubated 
for 5 minutes at room temperature. Then, 0.2 ml of chloroform per 1ml of TRIzol reagent is 
5 added to the homogenization mixture. The tubes are securely capped and shaken vigorously 
by hand (do not vortex) for 15 seconds. The samples are then incubated at room temp, for 
2-3 minutes and next centrifuged at 6500 rpm in a Sorvall superspeed for 30 min. at 4oC. 

The next process is RNA Precipitation. The aqueous phase is transferred to a 
fresh tube. The organic phase can be saved if isolation of DNA or protein is desired. Then 
10 0.5 ml of isopropyl alcohol is added per 1ml of TRIzol reagent used in the original 

homogenization. Then, the tubes are securely capped and inverted to mix. The samples are 
then incubated at room temp, for 10 minutes an centrifuged at 6500 rpm in Sorvall for 20 

min. at 4°C. 

The RNA is then washed. The supernatant is poured off and the pellet washed 
15 with cold 75% ethanol. 1 ml of 75% ethanol is used per 1 ml of the TRIzol reagent used in 
the initial homogenization. The tubes are capped securely and inverted several times to 
loosen pellet without vortexing . They are next centrifuged at <8000 rpm (<7500 x g) for 5 
minutes at 4°C 

The RNA wash is decanted. The pellet is carefully transferred to an 
20 Eppendorf tube (sliding down the tube into the new tube by use of a pipet tip to help guide it 
in if necessary). Tube(s ) sizes for precipitating the RNA depending on the working volumes. 
Larger tubes may take too long to dry. Dry pellet. The RNA is then resuspended in an 
appropriate volume (e.g., 2 -5 ug/ul) of DEPC H 2 0. The absorbance is then measured. 

The poly A+ mRNA may next be purified from total RNA by other methods 
25 such as Qiagen' s RNeasy kit. The poly A + mRNA is purified from total RNA by adding the 
oligotex suspension which has been heated to 37°C and mixing prior to adding to RNA. 
The Elution Buffer is incubated at 70°C. If there is precipitate in the buffer, warm up the 2 x 
Binding Buffer at 65°C The the total RNA is mixed with DEPC-treated water, 2 x Binding 
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Buffer, and Oligotex according to Table 2 on page 16 of the Oligotex Handbook and next 
incubated for 3 minutes at 65°C and 10 minutes at room temperature. 

The preparation is centrifuged for 2 minutes at 14,000 to 18,000 g, preferably, 
at a "soft setting," The supernatant is removed without disturbing Oligotex pellet. A little bit 
5 of solution can be left behind to reduce the loss of Oligotex. The supernatant is saved until 
satisfactory binding and elution of poly A+ mRNA has been found. 

Then, the preparation is gently resuspended in Wash Buffer OW2 and pipetted 
onto the spin column and centrifuged at full speed (soft setting if possible) for 1 minute. 

Next, the spin column is transferred to a new collection tube and gently 
10 resuspended in Wash Buffer OW2 and centrifuged as described herein. 

Then, the spin column is transferred to a new tube and eluted with 20 to 100 ul 
of preheated (70°C) Elution Buffer. The Oligotex resin is gently resuspended by pipetting up 
and down. The centrifugation is repeated as above and the elution repeated with fresh elution 
buffer or first eluate to keep the elution volume low. 
! 5 The absorbance is next read to determine the yield, using diluted Elution 

Buffer as the blank. 

Before proceeding with cDNA synthesis, the mRNA is precipitated before 
proceeding with cDNA synthesis, as components leftover or in the Elution Buffer from the 
Oligotex purification procedure will inhibit downstream enzymatic reactions of the mRNA. 

20 0.4 vol. of 7.5 M NH40Ac + 2.5 vol. of cold 100% ethanol is added and the preparation 
precipitated at -20°C 1 hour to overnight (or 20-30 min. at -70°C), and centrifuged at 
14,000-16,000 x g for 30 minutes at 4°C. Next, the pellet is washed with 0.5 ml of 80% 
ethanol (-20°C) and then centrifuged at 14,000-16,000 x g for 5 minutes at room temperature. 
The80% ethanol wash is then repeated. The last bit of ethanol from the pellet is then dried 

25 without use of a speed vacuum and the pellet is then resuspended in DEPC H 2 0 at 1 ug/ul 
concentration. 

Alternatively the RNA may he purified u sing other methods (e.g., Oiaqen's RNeasy kit). 
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No more than 100 ug is added to the RNeasy column. The sample volume is 
adjusted to 100 ul with RNase-free water. 350 ul Buffer RLT and then 250 ul ethanol 
(100%) are added to the sample. The preparation is then mixed by pipetting and applied to an 
RNeasy mini spin column for centrifugation (15 sec at >1 0,000 rpm). If yield is low, reapply 
5 the flowthrough to the column and centrifuge again. 

Then, transfer column to a new 2 ml collection tube and add 500 ul Buffer 
RPE and centrifuge for 15 sec at >10,000rpm. The flowthrough is discarded. 500 ul Buffer 
RPE and is then added and the preparation is centriuged for 15 sec at > 10,000 rpm. The 
flowthrough is discarded, and the column membrane dried by centrifuging for 2 min at 
10 maximum speed. The column is transferred to a new 1.5 -ml collection tube. 30-50 ul of 

RNase-free water is applied directly onto column membrane. The column is then centrifuged 
for 1 min at > 10,000 rpm and the elution step repeated. 

The absorbance is then read to determine yield. If necessary, the material may 
be ethanol precipitated with ammonium acetate and 2.5X volume 100% ethanol. 

15 

First Strand cDNA Synthesis 

The first strand can be make using using Gibco's "Superscript Choice System 
for cDNA Synthesis" kit. The starting material is 5 ug of total RNA or 1 ug of polyA+ 
mRNAL For total RNA, 2 ul of Superscript RT is used; for polyA+ mRNA, 1 ul of 

20 Superscript RT is used. The final volume of first strand synthesis mix is 20 ul. The RNA 
should be in a volume no greater than 10 ul. The RNA is incubated with 1 ul of 100 pmol 
T7-T24 oligo for 10 min at 70°C followed by addition on ice of 7 ul of: 4ul 5X 1 st Strand 
Buffer, 2 ul of 0.1M DTT, and 1 ul of lOmM dNTP mix. The preparation is then incubated at 
37°C for 2 min before addition of the Superscript RT followed by incubation at 37°C for 1 

25 hour. 

Second Strand Synthesis 

For the second strand synthesis, place 1st strand reactions on ice and add: 91 

ul DEPC H 2 0; 30 ul 5X 2nd Strand Buffer; 3 ul lOmM dNTP mix; 1 ul 10 U/ulE.coli DNA 

30 Ligase; 4 ul 10 U/ul Exoli DNA Polymerase; and 1 ul 2 U/ul RNase H. Mix and incubate 2 
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hours at 16°C. Add 2 ul T4 DNA Polymerase. Incubate 5 rain at 16°C. Add 10 ul of 0.5M 
EDTA. 

Cleaning up cDNA 

5 The cDNA is purified using Phenol:Chloroform:Isoamyl Alcohol (25:24:1) 

and Phase-Lock gel tubes. The PLG tubes are centrifuged for 30 sec at maximum speed. 
The cDNA mix is then transferred to PLG tube. An equal volume of 
phenol :chloroform:isamyl alcohol is then added, the preparation shaken vigorously (no 
vortexing), and centrifuged for 5 minutes at maximum speed. The top aqueous solution is 

10 transferred to a new tube and ethanol precipitated by adding 7.5X 5M NH40Ac and 2.5X 
volume of 100% ethanol. Next, it is centrifuged immediately at room temperature for 20 
min, maximum speed. The supernatant is removed, and the pellet washed with 2X with cold 
80% ethanol. As much ethanol wash as possible should be removed before air drying the 
pellet; and resuspending it in 3 ul RNase-free water. 

15 

Tn vitro Transcription <TVT) and labeling w ith biotin 

In vitro Transcription (IVT) and labeling with biotin is performed as follows: 
Pipet 1 .5 ul of cDNA into a thin-wall PCR tube. Make NTP labeling mix by combining 2 ul 
T7 lOxATP (75 mM) (Ambion); 2 ul T7 lOxGTP (75 mM) (Ambion); 1.5 ul T7 lOxCTP (75 

20 mM) (Ambion); 1.5 ul T7 lOxUTP (75 mM) (Ambion); 3.75 ul 10 mM Bio-ll-UTP 

(Boehringer-MannheimTRoche or Enzo); 3.75 ul 10 mM Bio-16-CTP (Enzo); 2 ul lOx T7 
transcription buffer (Ambion); and 2 ul lOx T7 enzyme mix (Ambion). The final volume is 
20 ul. Incubate 6 hours at 37°C in a PCR machine. The RNA can be furthered cleaned. 
Clean-up follows the previous instructions for RNeasy columns or Qiagen's RNeasy protocol 

25 handbook. The cRNA often needs to be ethanol precipitated by resuspension in a volume 
compatible with the fragmentation step. 

Fragmentation is performed as follows. 15 ug of labeled RNA is usually 
fragmented. Try to minimize the fragmentation reaction volume; a 1 0 ul volume is 
recommended but 20 ul is all right. Do not go higher than 20 ul because the magnesium in 

30 the fragmentation buffer contributes to precipitation in the hybridization buffer. Fragment 
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RNA by incubation at 94 C for 35 minutes in 1 x Fragmentation buffer (5 x Fragmentation 
buffer is 200 mM Tris-acetate, pH 8.1; 500 mM KOAc; 150 mM MgOAc). The labeled 
RNA transcript can be analyzed before and after fragmentation. Samples can be heated to 
65°C for 15 minutes and electrophoresed on 1% agarose/TBE gels to get an approximate idea 
5 of the transcript size range 

For hybridization, 200 ul (10 ug cRNA) of a hybridization mix is put on the 
chip. If multiple hybridizations are to be done (such as cycling through a 5 chip set), then it 
is recommended that an initial hybridization mix of 300 ul or more be made. The 
hybridization mix is: fragment labeled RNA (50 ng/ul final cone); 50 pM 948-b control 
10 oligo; 1.5 pM BioB; 5 pM BioC; 25 pMBioD; 100 pM CRE; 0.1 mg/ml herring sperm DNA; 
0.5 mg/ml acetylated BSA; and 300 ul with lxMES hyb buffer. 

The hybridization reaction is conducted with non-biotinylated IVT (purified 
by RNeasy columns) (see example 1 for steps from tissue to IVT): The following mixture is 
prepared: 

1 5 IVT antisense RNA; 4 ug: ul 

Random Hexamers (1 jj.g/jj.1): 4 ul 
H 2 0: ul 

14 ul 

Incubate the above 14 ul mixture at 70°C for 10 min.; then put on ice. 
20 The Reverse transcription procedure uses the following mixture: 

0.1MDTT: 3 ul 

50X dNTP mix: 0.6 ul 

H 2 0: 2.4 ul 

Cy3 or Cy5 dUTP (ImM): 3 ul 
25 SS RT II (BRL): 1 ul 



16 ul 

The above solution is added to the hybridization reaction and incubated for 30 min., 42°C. 

Then, 1 ul SSII is added and incubated for another hour before being placed on ice. 
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The 50X dNTP mix contains 25mM of cold dATP, dCTP, and dGTP, lOmM 
of dTTP and is made by adding 25 p.1 each of lOOmM dATP, dCTP, and dGTP; 10 pi of 
lOOmMdTTP to 15 plH 2 0.] 

RNA degradation is performed as follows. Add 86 pi H 2 0, 1 .5 |il 1M NaOH/ 
5 2 mM EDTA and incubate at 65°C, 10 min.. For U-Con 30, 500 pi TE/sample spin at 7000 g 
for 10 min, save flow through for purification. For Qiagen purification, suspend u-con 
recovered material in 500 pi buffer PB and proceed using Qiagen protocol. For DNAse 
digestion, add 1 ul of 1/100 dilution of DNAse/30 ul Rx and incubate at 37°C for 15 min. 
Incubate at 5 min 95°C to denature the DNAse. 

10 

Sam ple preparation 

For sample preparation, add Cot-1 DNA, 10 pi; 50X dNTPs, 1 pi; 20X SSC, 2.3 pi; Na pyro 
phosphate, 7.5 pi; 10 mg/ml Herring sperm DNA; 1 ul of 1/10 dilution to 21.8 final vol. Dry 
in speed vac. Resuspend in 15 pi H 2 0. Add 0.38 pi 10% SDS. Heat 95°C, 2 min and slow 
1 5 cool at room temp, for 20 min. Put on slide and hybridize overnight at 64°C. Washing after 
the hybridization: 3X SSC/0.03% SDS: 2 min., 37.5 ml 20X SSC+0.75ml 10% SDS in 
250ml H 2 0; IX SSC: 5 min., 12.5 ml 20X SSC in 250ml H 2 0; 0.2X SSC: 5 min., 2.5 ml 20X 
SSC in 250ml H 2 0. Dry slides and scan at appropiate PMT's and channels. 
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TABLE 1: Figure 1 from BRCA 001 US 

5 Table 1 shows genes, (incorporated in their entirety here and throughout the application 
where primekeys are provided), downregulated in tumor tissue compared to normal breast 
tissue. 



10 



15 



Pkey; Unique Eos probeset identifier number 

ExAccn: Exemplar Access ion number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Trtle: Unigene gene title 

R1 : Ratio of normal breast tissue to tumor 



Pkey ExAccn UnigenelD UnigeneTitle R1 

20 100472 D90084 Hs.1023 pyruvate dehydrogenase (lipoamkje) alpha 5 

100499 T51986 Hs 283108 hemoglobin, gamma G 10 

100545 M55405 gb Homo sapiens mucin (MUC-3) mRNA, part 5 

100549 BE142019 Hs 222056 Homo sapiens cDNA FLJ1 1572 fis, clone HE 10 

100613 X52078 Hs 101047 transcription factor 3 (E2A immunoglobut 5 

25 100635 BE259039 Hs 129953 Ewing sarcoma breakpoint region 1 5 

100645 X16841 Hs.167988 neural ceil adhesion molecule 1 5 

100654 A03758 NM_000477*:Homo sapiens albumin (ALB), rn 10 

100702 L27065 gb.Human neurofibromatosis 2 (NF2) mRNA, 5 

100915 M60832 Hs 249239 collagen, type VIII, alpha 2 5 

30 100971 BE379727 Hs 83213 fatty acid binding protein 4, adipocyte 10 

101125 AJ250562 Hs 82749 transmembrane 4 superfamtly member 2 5 

101166 M90424 Hs2099 lipocalin 1 (protein migrating faster th 5 

101184 NM_001674 Hs460 activating transcription factor 3 10 

101336 NfvL006732 Hs75678 FB J murine osteosarcoma viral oncogene h 10 

35 101367 X03350 Hs4 alcohol dehydrogenase 1B (class I), beta 10 

101447 M21305 gb Human alpha satellite and satellite 3 10 

101461 N98569 Hs 76422 phospholipase A2, group IIA (platelets, 10 

101511 M27826 Hs267319 endogenous retroviral protease 10 

101634 AV650262 Hs 75765 GR 02 oncogene 5 

40 101736 M74447 Hs502 transporter 2, AT P-binding cassette, sub 10 

102208 U22961 gb Human mRNA clone with similarity to L 10 

102297 NMJW1504 Hs.198252 G protein-coupled receptor 9 5 

102450 U48251 Hs 75871 protein kinase C binding protein 1 10 

102515 U89337 Hs 169886 tenascin XB 10 

45 102571 U60115 Hs 239069 four and a half LIM domains 1 5 

102800 AA313538 gb EST1 85419 Colon carcinoma (HCC) cell 10 

102857 IMM_006744 Hs 76461 retinol-bi riding protein 4, interstitial 10 

102990 AA829286 Hs 332053 serum amyloid A1 10 

103434 X98085 Hs 54433 tenascin R (restrictin, janusin) 5 

50 103747 AA081 995 gb:zn26d06.r1 Stratagene neuroepithelium 10 

103750 AA 126 129 gb:zm7Bc07.r1 Stratagene neuroepithelium 5 

103812 AA137107 Hs 326391 Homo sapiens, done MGC: 16638, mRNA, com 10 

103851 AA326216 Hs 8719 hypothetical protein MGC 1 1 36 5 

104080 ABO41036 Hs 57771 kallikrein 11 (KLK11; TLSP; PRSS20; hipp 5 

55 104093 R50727 Hs 336970 ESTs 10 

104106 AA422 123 gb zv26h12.r1 Soares_NhHMPu_S1 Homo sap; 5 

104109 AL353957 Hs 284181 hypothetical protein DKFZp434P0531 10 

104250 F06638 Hs 12440 Homo sapiens clone 24734 mRNA sequence 10 

104340 AA426 189 gb:zw1 1e09.r1 Soares_NhHMPu_S1 Homosapi 5 

60 104492 N73185 Hs94285 EST 10 

104506 N91071 Hs 109650 ESTs 10 

104511 N99542 Hs572 oroso mucoid 1 5 

104532 AI498763 Hs 203013 hypothetical protein FLJ 12748 10 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



104536 
104572 
104659 
104 £77 
104711 
104731 
104764 
105005 
105036 
105105 
105231 
105239 
105921 
105957 
106052 
106119 
106181 
106194 
106283 
106379 
106451 
106491 
106700 
106782 
106851 
106870 
106892 
106954 
106991 
107103 
107124 
107148 
107214 
107242 
107331 
107351 
107423 
107447 
107451 
107453 
107459 
107683 
107711 
107754 
107757 
107864 
107872 
107888 
107997 
108056 
108081 
108113 
108238 
108257 
108335 
108351 
106382 
108392 
108441 
108446 
106497 
108604 
108662 
108706 
108738 
108827 
109123 



R24024 
Y11312 
AVV969769 
AA00S764 
AA017245 
AA019300 
AI039243 
AI298208 
AA130390 
R61532 
AW970043 
AA221036 
AA421973 
BE242857 
N79885 
AL359624 
AI803651 
AW976171 
A1085846 
AL042069 
AW235928 
AA135688 
AA906434 
AW054886 
AI458623 
AI983730 
AI347578 
AF128847 
AJ223811 
AI446183 
AB006532 
AI005036 
AF 127026 
AB020672 
AI905985 
U51704 
W26652 
W28516 
AL042425 
AS092790 
W38002 
N53167 
W96141 
AA017462 
BE621721 
AA025060 
BE271708 
AA025836 
AL049176 
AA043675 
AA093668 
AA012881 
AA059473 
AA677927 
AA070500 
AA071193 
NM_006770 
AA075124 
AA079O79 
AA085383 
AA074897 
AA934589 
AF1 17646 
AA121820 
AA1 26583 
AI273692 
AI028376 



Hs. 158101 Homo sapiens cDNA FU 14673 fis, done NT 
Hs.132463 phosphoinositide-3-lOnase, class 2, beta 
Hs.105201 ESTs 
Hs.190380 ESTs 
Hs.32794 ESTs 

Hs.125070 ESTs, Moderately simitar to I54374 gene 
Hs.278585 ESTs 
Hs.28805 ESTs 

Hs.25549 hypothetical protein FU20898 
Hs.87016 hypothetical protein FLJ22938 
Hs.238039 hypothetical protein FU 11 090 

gb:zr03f12.r1 Stratagene IMT2 neuronal pr 
ESTs, Weakly similar toT25731 hypotheti 
hypothetical protein FIJ1 1159 
ESTs, Highly similar to T00391 hypotheti 
KIAA1453 protein 
ESTs 

hypothetical protein FU22233 
K1AA1808 protein 
DKFZP434N061 protein 
ESTs 

Homo sapiens, clone IMAGE:4139786 ( mRNA, 
zinc finger protein 216 
Homo sapiens mRNA for KIAA1863 protein, 
gb:tk04g09.x1 NCI_CGAP_Lu24 Homo sapiens 
serum deprivation response (phosphatidyl 
hypothetical protein MGC2605 
Indolethylamine N-methyttransf erase 
hypothetical protein 

ESTs, Highly similar to CYA5_HUMAN A DENY 
RecQ protein-like 4 
GS1999full 
myosin I A 
K1AA0865 protein 
ESTs 

ESTs, Moderately similar to ALU8JHUMAN A 
PTEN induced putative kinase 1 
hypothetical protein MGC1 1 308 
hypthetical protein PR02389 
hypothetical protein FU 14529 
Empirically selected from AFFX single pr 
ESTs 
ESTs 
ESTs 

hypothetical protein FU12387 similarto 
ESTs 

ESTs, Weakly similar to A55943 1-phospha 
ESTs 

chordin-like 
ESTs 

musdeblind (DrosophibHike 
hypothetical protein FU 1 1 838 
EST 
ESTs 

gb2m70h03.s1 Stratagene neuroepitheiium 
gb:zf79b12.s1 Scares jjineal_gland_N3HPG 
macrophage receptor with collagenous str 
gb2m86a01.s1 Stratagene ovarian cancer 
gbzm97c09.s1 Stratagene colon HT29 (937 
gb:zn13g03.s1 Stratagene hNT neuron (937 
gb:zm85a05.r1 Stratagene ovarian cancer 
Hs.49696 ESTs 

Hs.1 56637 Cas-Br-M (murine) ectropic retroviral tr 
Hs.74569 KIAA0842 protein 
Hs.1 58725 ESTs 
Hs.1 10470 ESTs 
Hs.73232 ESTs 



Hs.169119 

Hs.27021 

Hs.6382 

Hs.1 1387 

Hs.1 91608 

Hs.286194 

Hs.25522 

Hs.1 19021 

Hs.313182 

Hs.10083 

Hs.3776 

Hs.25682 

Hs.26530 

Hs.1 24015 

Hs,204038 

Hs.30127 

Hs.9572 

Hs.31442 

Hs.334305 

Hs.5394 

Hs.175411 

Hs.1 11805 

Hs.323428 

Hs.6163 

Hs.19210 

Hs .283976 

Hs.334703 

Hs.47623 

Hs.220687 

Hs.269244 

Hs.280792 

Hs.61246 

Hs.95110 

Hs.191637 

Hs.82223 

Hs.62633 

Hs.28578 

Hs.72531 

Hs.66783 

Hs.144269 



Hs.67726 
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109389 AA101325 Hs.86154 hypothetical protein FLJ 12457 10 

109546 F01449 Hs.26954 Homo sapiens mRNA; cDNA DKFZp762G123 (fr 5 

109919 R40604 Hs.129539 ESTs, Weakly similar to MCAT^HUMAN MITOC 10 

110006 AI094674 Hs,30524 ring finger protein 24 10 

5 110141 H46749 Hs.31540 ESTs 10 

110354 W22165 Hs, 22586 ESTs 5 

110433 AW294162 Hs.301062 UDP-N-acetyl-alpha-I>galactos amine :polyp 10 

110448 H51276 Hs.13526 hypothetical protein FLJ 1268 8 . 10 

110455 H52576 gb:yt85e08.r1 Soars sj5*meal_g)and_N3HPG 5 

10 110540 H72639 Hs 167608 ESTs 5 

110553 H60593 Hs.124990 ESTs 10 

110976 AL044174 Hs.1 59526 patched (Drosophila) homoJog 10 

110987 AI753316 Hs 26034 ESTs 5 

111158 N66616 Hs 138629 H.sapiens rnRNA for subtelomeric repeat s 5 

15 111168 AI798376 gb tr34b07.x1 NCI_CGAP„Ov23 Homo sapiens 10 

111187 AJ224864 Hs9688 leukocyte membrane antigen 5 

111307 AA641636 Hs 37477 ESTs, Weakly similar to T4 6908 hypotheti 5 

111400 R00144 Hs 189771 ESTs 10 

111498 AI1 68511 gb.ow90h09.s1 Soares Jetaljiver_spleen_ 10 

20 111651 R16733 Hs20499 ESTs 10 

111738 R26065 gbyh39d03.s1 Soares placenta Nb2 HP Homo 5 

111803 AA593731 Hs 325823 ESTs, Moderately similar to ALUS JWMAN A 10 

111995 R42333 Hs 302292 ESTs 10 

112071 AL1 17490 Hs 47225 Ras-associated protein Rap1 10 

25 112204 NM_006668 Hs 25121 cytochrome P450, sub family 46 (cholester 10 

112258 R51889 Hs 24990 ESTs 5 

112490 R31094 Hs 24378 ESTs 10 

112588 R77302 gb yi75h08.s1 Soares placenta Nb2HP Homo 10 

112654 BE618629 Hs 268809 ESTs 5 

30 112784 T98628 Hs 191290 ESTs 5 

112817 AI057205 Hs 14584 ESTs 5 

112885 AA581428 Hs 5021 EST 10 

112913 T16837 Hs4241 ESTs 5 

113149 T51 588 gbyb27e06.s1 Stratagene fetal spleen (9 10 

35 113174 T54659 Hs 301755 Homo sapiens cDNA FLJ 1 1465 fis, clone HE 5 

113203 AA743563 Hs 10305 ESTs 5 

113299 AW207424 Hs 332594 ESTs 10 

113367 N92359 Hs 14518 ESTs, Moderately similar to A48752 B-cel 10 

113457 R16763 Hs 268679 ESTs 5 

40 113563 AA913635 Hs 326413 Homo sapiens cDNA FLJ2081 2 fis, clone AD 10 

113574 R06874 Hs 268628 ESTs, Moderately similar to ALU1_HUMAN A 5 

113776 A 1791 905 Hs. 95549 hypothetical protein 10 

113790 AI244311 Hs 26912 ESTs 10 

113807 W07586 Hs8045 ESTs 3 

45 113958 W86195 gb:zh54e05,s1 SoaresJetal_liver_spleen_ 10 

114211 Z39319 Hs 27347 EST 10 

114254 AB0 18263 Hs 180338 tumor necrosis factor receptor superf ami 5 

114349 AA745978 Hs 28273 ESTs 5 

1 14449 AA0 20736 gb ze63b1 1 .s1 Soares retina N2b4HR Homo 5 

50 114484 AA034378 Hs 267319 endogenous retroviral protease 5 

114576 AA065096 gb zm50a02.s1 Stratagene fibroblast (937 5 

114624 AA08 1507 gb:zn05b10.r1 Stratagene hNT neuron (937 5 

114844 AA234826 Hs 87386 EST 5 

114906 AA234462 Hs 87350 ESTs 5 

55 115624 AK000725 Hs 50579 hypothetical protein FLJ207 18 3 

115666 AF173081 Hs 178215 Vertebrate LIN7 homoJog 1, Tax interact! 5 

115712 AB020649 Hs 74569 KJAA0842 protein 5 

115889 AA398841 Hs 39650 hypothetical protein FLJ205 17 10 

115949 AI478427 Hs 43125 esophageal cancer related gene 4 protein 10 

60 116107 AL133916 Hs 172572 hypothetical protein FLJ20093 10 

116180 AA463902 Hs 13522 ESTs, Weakly simflar to I38022 hypotheti 5 

116267 AW968703 Hs 30085 hypothetical protein FLJ23186 5 

116291 AW410377 Hs 41502 hypothetical protein FLJ 2 1276 5 

116527 AW194253 Hs.68607 ESTs 10 

65 116659 BE314852 Hs.168694 Homo sapiens clone 23763 unknown mRNA, p 5 

116708 F10528 Hs70001 ESTs, Moderately similar to JC 6169 nucte 5 

117058 AW801806 gb: IL5 -U M0 07 0-11 0400-062 -gO 7 UM0070 Homo 5 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



117151 
117226 
117323 
117571 
117624 
117673 
117847 
117877 
117919 
118049 
118413 
118613 
118664 
118858 
118902 
119039 
119159 
119216 
119317 
119366 
119378 
119528 
119792 
119800 
119817 
119835 
119923 
119961 
120379 
120931 
121037 
121282 
121382 
121764 
122034 
122441 
122756 
122771 
123601 
123623 
123941 
124215 
124276 
124680 
125099 
125121 
125188 
125284 
125906 
128484 
128511 
128538 
128606 
128850 
128870 
128903 
128931 
129001 
129091 
129101 
129146 
129213 
129228 
129265 
129285 
129346 
129368 



AI803656 Hs.42373 ESTs 

N20468 gb:yx39b10.s1 Soares melanocyte 2NbHM Ho 

A1472863 Hs.43387 ESTs 

N34417 Hs.44584 ESTs 

N26627 Hs.82364 ESTs, Weakly simitar to JC4124 pregnancy 

N40551 Hs.184043 Homo sapiens Els-1 binding protein (E1B) 

N49285 Hs.182391 ESTs 

AW263476 Hs.44268 myelin gene expression factor 2 

BE222341 Hs.279472 ESTs 

N53145 gb:yv55f09.s1 Soares fetal liver spleen 

AW955696 Hs.90960 ESTs 

AI078236 Hs.49688 ESTs 

N70907 Hs.230619 EST 

AL1 22040 Hs. 102981 Homo sapiens mRNA; cDNA DKFZp434Gl972 (f 

AA993527 Hs.293907 hypothetical protein FU23403 

AI160570 Hs.252097 pregnancy specific beta-1 -glycoprotein 6 

AF142419 Hs.15020 homolog of mouse quaking QKI (KH domain 

AA514422 Hs.221849 ESTs 

AK002001 Hs.51305 v-maf musculoaponeurotic fibrosarcoma (a 

J77892 gb:yd20f04.s1 Soares fetal liver spleen 

T81824 Hs.90949 EST 

W38051 Empirically selected from AFFX single pr 

AL049798 Hs.80552 dermatopontin 

AF086332 Hs:58314 ESTs 

AF088Q61 Hs.1 59690 ESTs 

AF086429 Hs.58429 ESTs 

AW8O3308 Hs.62954 ferritin, heavy polypeptide 1 

U34249 Hs.337461 Human putative zinc finger protein (2NFB 

AL042725 g b:DKFZp434B1822j1 434 (synonym: htes3) 

AW1 36934 Hs 97162 ESTs 

AA907743 Hs 142373 ESTs 

AA401695 Hs.97334 ESTs 

AA405763 Hs.1 11939 Homo sapiens cDNA FLJ20470 Hs, done KA 

AA421452 Hs.164851 ESTs, Weakly similar to K1AA0926 protein 

AK000229 Hs.98017 Homo sapiens cDNA FLJ20222 fis, clone CO 

AA447555 Hs.99116 EST 

AA458945 Hs.95898 ESTs 

AW 135093 Hs.97282 ESTs, Highly similar to G100_HUMAN 1 10 K 

AA609122 Hs.1 12645 Homo sapiens mRNA; cDN A DKFZp434D2472 (f 

AI024595 Hs.97508 a disintegrin and metailoproteinase doma 

AA621529 gb:af47a02.s1 Soares JotalJetus_Nb2HF8_ 

H62570 gb:yr44a01 .r1 Soares fetal liver spleen 

H8 3465 gb:ys91a11.s1 Soaresretina N2b5HR Homo 

AK001527 Hs.163953 hypothetical protein FLJ 10665 

NNL014312 Hs.1 12377 cortic al thymocyte receptor (X. laevis 

T98199 Hs.48403 hypothetical protein FU 10847 

BE299567 Hs.271749 ESTs, Moderately similar to ALU8_HUMAN A 

NMJXJ2666 Hs.103253 periltpin 

BE256206 Hs.17775 p75NTR associated cell death executor, o 

AA485421 Hs.270503 ESTs, Weakly similar to ALU7_HUMAN ALU S 

NM_002250 Hs.10082 potassium Intermediate/small conductance 

R44214 Hs.101189 ESTs 

C16161 Hs.283040 hypothetical protein PR02543 

AA1 93106 Hs.1 808 17 chromosome 1 1 open reading frame 23 

H39537 Hs.75309 eukaryotic translation elongation factor 

AW1 50717 Hs.296176 STAT induced STAT inhibitor 3 

N62889 Hs. 107242 Homo sapiens cDNA FLJ 12965 fis, done H\ 

AA443323 Hs.107812 BPOZ protein 

AA056483 Hs.301463 Human Chromosome 16 BAC clone C1T987SK-A 
NM_013403 Hs.108665 zinedin 

AL1 17472 Hs.108924 SH3-domain protein 5 (ponsln) 

AI146494 Hs.109525 ESTs, Weakry similar to 1RX2_HUMAN IROQU 

U40714 Hs.239307 tyrosyMRNA synthetase 

AA530892 Hs.171695 dual specificity phosphatase 1 

BE617015 Hs. 11006 ESTs, Moderately similar to T 17372 plasm 

AF1 10141 Hs.288908 WAS protein family, member 2 
NM_003877 Hs.110776 STAT induced STAT inhibitor-2 
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129371 X06828 Hs.1 10802 von Wl I leb rand factor 5 

129381 AW245805 Hs.1 10903 ctaudtn 5 (transmembrane protein deleted 10 

129440 W37944 Hs.4007 Sarcolemmal-assodated protein 5 

129441 BE061069 Hs 301943 KIAA0467 protein 10 
5 129516 AF020038 Hs 11223 isocitrate dehydrogenase 1 (NADP+), solu 10 

129554 BE222076 Hs.1 13069 ESTs 10 

129684 BE622468 Hs 11924 ESTs, Weakly simitar to I38022 hypotheti 5 

129702 AI304966 Hs 12035 ESTs, Weakly similar to I38022 hypotheti 5 

129778 AK001676 Hs 12457 hypothetical protein FL J 108 14 10 

10 129893 AK000956 Hs 13209 hypothetical protein RJ 10094 5 

129928 AI338993 Hs 134535 ESTs 5 

129973 AJ251760 Hs 273385 guanine nucleotide binding protein (G pr 5 

129977 NM_000399 Hs 1395 early growth response 2 (Krox-20 (Drosop 5 

130014 NM_001158 Hs 143102 amine oxidase, copper containing 2 (reti 5 

15 130085 M62402 Hs 274313 insulin-like growth factor binding prote 10 

130089 AA452006 Hs 333199 ESTs 5 

130162 W80711 Hs 319946 Homo sapiens mRNA for KIAA 1727 prote in, 5 

130243 D88435 Hs 153227 cyclin G associated kinase 10 

130315 AI241084 Hs 154353 no nselect've sodium potassium/proton ex c 5 

20 130339 AA435746 gb:zt79e03.s1 Soares_testis_NHT Homo sap 5 

130400 V00517 Hs 283108 hemoglobin, gamma G 10 

130436 NM_001928 Hs 155597 D component of complement (adipsin) 10 

130478 X72308 Hs 251526 small inducible cytokine A7 (monocyte ch 5 

130480 BE222978 Hs 15760 MYG1 protein 10 

25 130494 AW390834 Hs 75874 pregnancy-associated ptasma protein A 5 

130563 BE270472 Hs 279900 HSPC01 5 protein 10 

130589 AL1 10226 Hs 16441 DKFZP434H204 protein 10 

130606 AI652143 Hs 288382 hypothetical protein FLJ1 31 11 5 

130634 AI769067 Hs 127824 ESTs, Weakly similar to T28770 hypotheti 3 

30 130683 AA993269 Hs 17872 Homo sapiens, done !MAGE:38750 12, mRNA 10 

130689 NMJJ06691 Hs 17917 extracellular link domain-containing 1 10 

130716 AA232075 Hs 18259 XPA binding protein 1; putative ATP (GTP) 5 

130718 AF263462 Hs 18376 KIAA1 31 9 protein 10 

130722 N41322 Hs 18441 ESTs 5 

35 130798 M81349 Hs 1955 serum amyloid A4, const'tutive 10 

130840 BE048821 Hs 20144 small inducible cytokine subfamily A (Cy 10 

131184 AB040935 Hs 23954 cere bral cell adhesion molecule 10 

131261 AA360419 Hs 171776 inositol(myo)-1(or4)-monophospriatase 1 10 

131282 X03350 Hs4 alcohol dehydrogenase 1B (class l) t beta 10 

40 131328 AW939251 Hs 25647 v-fosFBJ murine osteosarcoma viral onco 10 

131340 AK000393 Hs 25817 BTB (POZ) domain containing 2 5 

131341 AF1 10908 Hs 297660 TNF receptor-associated factor 3 5 
131406 H83294 Hs 284122 Wnt inhibitory fa dor-1 5 
131489 BE394648 Hs 27414 hypothetical protein 5 

45 131543 AW966881 Hs 41639 programmed cell death 2 10 

131692 BE559681 Hs 30736 KIAA01 24 protein 5 

131753 AA829286 Hs 332053 serum amyloid A 1 10 

131756 AA443966 Hs 31595 ESTs 10 

131785 H69342 Hs 26320 TRABID protein 10 

50 131815 AA021258 Hs 32753 ESTs 5 

131819 BE244961 Hs 173103 FE65-LIKE2 5 

131828 AJ000263 Hs 278658 keratin, hair, basic, 6 (monilethrix) 10 

131886 AW294659 Hs.34054 Homo sapiens cDNA: FU22488 fis, clone H 5 

131927 AJ0O3112 Hs 34780 doublecortex; lissencephaly, X-linked (d 5 

55 131949 AK000010 Hs.258798 hypothetical protein FL J 20003 10 

132115 H81604 Hs.178471 KIAA0798 gene product 5 

132177 X80818 Hs 178078 glutamate receptor, metabotropic 4 5 

132296 AA467752 Hs 195161 ESTs 5 

132426 AW118072 Hs 89981 diacylgrycerol kinase, zeta (104kD) 10 

60 132477 S68874 Hs 170917 prostaglandin E receptor 3 (subtype EP3) 5 

132675 AI291496 Hs5476 Homo sapiens, done I MAGE :35301 23, mRNA, 10 

132796 NMJ)06283 Hs 173159 transforming, addic coiled-coil contain 10 

132898 W28548 Hs.224829 ESTs 10 

132905 NM_004235 Hs7934 Kruppel -like factor 4 (gut) 10 

65 132953 BE175645 Hs.321264 LBP protein 32 5 

133116 BE563966 Hs.6529 ESTs, Weakly similar to I78 88 5 serin e/th 5 

133120 NM_003278 Hs.65424 tetranectin (pi asminog en- binding protein 10 
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133139 AF052138 Hs.6580 Homo sapiens cONA: FLJ23227 f:s, done C 

133163 AA668224 Hs.6634 Homo sapiens cDNA: FLJ22547 fa, done H 

133268 AW956781 Hs.293937 ESTs, Weakly similar to FXD2_HUMAN FORK H 

133272 NMJXJ2776 Hs.69423 kallikrein 10 (KLK10) (PRSSL1) (nes1) 
5 1 33379 AA207059 gb:zq80h09.s1 Stratagene hMT neuron (937 

133407 AF017987 Hs.7306 secreted frizzted-related protein 1 

133552 H21497 Hs.7471 BBP-like protein 1 

133702 L02321 Hs.75652 glutathione S-transf erase M5 

133719 H26904 Hs.75736 apolipoprotein D 

10 133731 N71725 Hs.272572 hemoglobin, alpha 2 

133789 T85626 Hs.76239 hypothetical protein FLJ20608 

134007 AF072441 Hs.7840 caldneurin binding protein 1 

134055 D86062 Hs. 182423 ES1 (zebrafish) protein, human homolog o 

134111 AI372588 Hs.8022 TU3Apratein 

15 134117 AA081846 HsJ921 Homo sapiens mRNA; cDNA DKFZp566E1o3 (tr 

134177 BE243319 Hs79672 KIAA0652 gene product 

134308 AW905827 Hs.81454 ketohexokinase (fructokinase) 

134361 BE549343 Hs.82208 acyl-Coenzyme A dehydrogenase, very long 

134369 AF207664 Hs.8230 a disintegrin-like and metalloprotease ( 

20 134449 L34155 Hs.B3450 lamtnin, alpha 3 (nicer (150kD), kalini 
134467 AI190413 Hs.8373 ESTs 

134496 M64936 gb:Homo sapiens retinae actd-indudble 

134510 NM_Q02757 Hs.250870 mitogen-acOvated protein kinase kinase 
134550 M26315 Hs.85258 CDS antigen, alpha polypeptide (p32) 

25 134577 BE244323 Hs.85951 exportin, tRNA (nudear export receptor 
134591 U73394 Hs.166085 ki Her cell immunoglobulin -like receptor 
134678 AL008583 Hs.182595 dynein, axonemal, light polypeptide 4 
134728 D10216 Hs.89394 POU domain, class 1 , transcription facto 
134758 NM_000078 Hs.89538 cholesteryl ester transfer protein, ptas 

30 134786 T2961B Hs.89640 TEK tyrosine kinase, endothelial (venous 
134912 T87521 Hs.261457 ESTs 

134963 NM_003394 Hs.91985 wingless-type MMTV integration site fami 
134969 H22570 Hs.172572 hypothetical protein FL J 2009 3 
135001 AA302517 Hs 92732 KIAA1444 protein 

35 135066 X04430 Hs.93913 interleukin 6 (interferon, beta 2) 

135173 AL036557 Hs.95910 putative lymphocyte GO/61 switch gene 
135197 U76456 Hs.190787 tissue inhibitor of metalloproteinase 4 
135219 AB002361 Hs.96633 KIAA0363 protein 
135250 U83171 Hs.97203 small inducible cytokine subfamily A (Cy 

40 135304 AA416829 Hs.191597 ESTs 

135337 AA905406 Hs.9905 ESTs, Weakly similar to unnamed protein 
135417 X55019 Hs.99975 cholinergic receptor, nicotinic, delta p 
101367 X03350 Hs.4 alcohol dehydrogenase 1B (class I), beta 
128870 H39537 Hs.75309 eukaryotic translation elongation fa dor 

45 129381 AW245805 Hs.110903 daudin 5 (transmembrane protein deleted 
130085 M62402 Hs.274313 insulin-like growth factor binding prote 
130689 NM_0O6691 Hs.17917 extracellular link domain-containing 1 
133120 NIVL003278 Hs.65424 tetranectin (plasminogen-binding protein 
133407 AF017987 Hs.7306 secreted frizzted-related protein 1 

50 133731 N71725 Hs.272572 hemoglobin, alpha 2 

134369 AF207664 Hs.8230 a disintegrin-like and metalloprotease ( 
135066 X04430 Hs.93913 interleukin 6 (interferon, beta 2) - 
135173 AL036557 Hs.95910 putative lymphocyte G0/G1 switch gene 
322580 AK001852 Hs.274151 ligatin 

5 5 408790 AW580227 Hs.47B60 neurotrophic tyrosine kinase, receptor, type 2 
418043 AW377752 Hs.83341 AXL receptor tyrosine kinase 
427458 BE208364 Hs.29283 ESTs, Weakly similar to LKHU proteoglycan link 
446674 AA563892 Hs.306000 solute carrier family 4 (anion exchanger), memb 
d49826 U85642 Hs.138506 ESTs 

5Q RC_H15814_s Human apM1 mRNA for GS3109 (novel adipose 

YEL024w/RIP1 EST - YEL024w/RIP1 
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TABLE 1A 



Table 1 A shows the accession numbers for those pkeys lacking unigeneED's for Table 1 . 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey 



CAT Number Accessions 



108446 1 12224J AA085383 AA126091 AA074174 AA075373 AA079120 AA070831 AA075978 AA0753 72 AA 128503 
108497 110079_2 AA074897 AA113914 AA064871 AA079329 AA071309 AA084710 AA129030 AA075042 AA074794 AA071453 AA078803 
AA148628 AA122204 AA074159 AA126185 AA079117 AA127089 AA070912 AA079280 AA131372 AA078833 AA071087 
AA076131 AA071047 AA079401 AA083070 AA102076 AA1 15163 AA074198 AA134725 AA1 13889 AA121103 AA075041 
AA065148 AA071310 AA101 144 AA079659 AA078931 AA079209 AA070928 AA068994 M069817 AA076187 AA069053 
AA131489 AA071308 AA063317 AA070156 AA071430 AA076056 AA075684 AA070053 AA126283 AA126078 AA075895 
AA079208 AA074583 AA071086 AA079623 AA070627 AA07B802 AA076622 AA065051 AA079143 AAQ711 10 AA079434 
AA148748 AA079230 AA085188 AA074485 AA070580 AA076151 AA083166 AA0851 18 AA079450 AA085044 AA120938 
AA079200 AA100188 AA081472 AA122355 AA129031 AA085362 AA069220 AA070940 AA075968 AA074563 AAO84027 
AA1 15929 

124215 1597154_1 H62570H59063 

1 17058 1 219924.1 AW8018O6 H90434 BE086530 

1 10455 46874.1 H52576 AF085971 H52172 

1 11 168 38585 J AI798376 S464O0 AW81 1617 AW81 1616 WQ0557 BE142245 AW858232 AW861851 AW858362 AA232351 AA218567 
AA055556 AW858231 AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA1 73345 AA1 9994 2 AA223384 
AA227092 AA227080 T12379 AA092174 T61 139 AA149776 AA699829 AW8791 88 AW8 13567 AW813538 AI267168 
AA157718AA157719AA100472AA100774AA130756 AA1 57705 AA 1577 30 AA157715 AA053524 AW849581 AW854566 
C05254 AW882836 T92637 AW812621 AA206583 AA209204 BE156909 AA226824 AI829309 AW991957 N66951 AA527374 
H66215 AA045564 AI694265 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 
AW817659 BE081531 H59570 

111498 411008_1 AI168511 AI022712 AA700366 R07371 R07324 

104340 46289J0 AA426 1 89 F 15201 

103747 11 7944 J AA081995 AA101099 

134496 46501.1 M64936 AI025512 AI382987 BE061777 AA089966 BE169930 T41 176 AW594624 BE502415 AA121893 AI269283 T4031 1 

AI079277 AI241318 BE327710 AW975215 AW896268 AA884990 BE327514 
103750 118365J AA1 261 29 AA 126033 AA082561 

105239 34624J AA221036 R87170 BE537068 BE544757 C18935 AW812058 T92565 AA227415 AA233942 AA223237 AA668403 AA601627 
AW869639 BE061833 BE000620 AW961 170 AW847519 AA308542 AW821833 AW945688 C04699 AA205504 AA377241 
AW821667 AA055720 AW817981 AW856468 AA155719 AA179928 T03O07 AW754298 AA227407 AA113928 AA307904 
C16859 

120379 34624_3 AL042725 BE063316 AW975610 AA457591 BE062092 AI655202 AA714296 AI267264 AI075321 AA223286 AA071122 
AA227849 AA216700 AI696002 AA101867 AA099426 AA135997 AL041698 T02815 T51824 AA207189T59230 T51868 
AA663341 BE165757 AW818104 AW392886 AA584918 AA099408 AW856396 AW861859 AA053045 

114624 111686.1 AA081507 AA070071 AA070840 AA084362 

106851 322947 J AI458623 AA639708 AA485409 R22065 AA485570 

108392 113549.1 AA075124 AA075208 

100545 22955J1 M55405 AW752552 

100654 tlgr_KT2969 A03758 A06977 A15293 D17029 D17107 D17171 L00132 L00133 M12523 M13075 M13076 M92816 U22961 V00494 V00495 

X51363 X51364 X51365 
100702 tigr.HT3413 L27065 

102208 6735J U22961 AA203623 AA503337 AH74733 AI192802 C06092 AA035357 AI190619 A1199244 AI828450 AA602296 AI378195 
AI209170AI186653 AI127795 AI183846 H77389 AI5894 6 5 M629390 H94306 AI018388 R68584 AA027196 AI745413 
AI685092 AI093426 AI623873 AI074570 N50096 AA047486 N25060 AA327614 AI042512 AI383957 AA156873 AI333101 
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N70806 AI141254 AI383191 AI401237 At030709 AI093400 WB4549T90806 R00012 W01413 AA630o57 AI378348 A 5592 65 
^77103 WM4W 1/^625146 R66379 AI133207 AM329&0 AI133214 AI064826 AI061615 AH33473 AI174852 AI133404 
w£m S Tm^52Tm12523 AI207526 AI1331 20 A1054802 AI174993 AM 14729 AI061645 AI064716 AI064959 H77388 
T85706 AF075298 AH 10799 D17107 NM.0OO477 AF1 90168 R50724 A I2484 1 6 A 1207432 AH 33684 AI133345 AI174710 
AI133290AH33304A1174948 AI207484 AI110717 AF074624 AH14515 AF063516 AI110642 All 14559 AH 14498 AI114759 
AI207568 AI064960 AI174753 AH 14666 R69184 R0001 1 AI064997 T60501 AI207701 T71735 AA365318 H73569 T50496 

T73413T55909 R50261 T72061 N80533 T51189T74936 AI207490 AI132925 AI064701 
AI174748 AI114663 AI133104 AI132999 AI133100 AI064925 AI064979 AI133063 AA343347 T69091 AA2M989 T39772 
AI444620T52^0D16931 T40012 T43403 T58926 T691 95 AM 33061 T 5 0850 AI4 00677 AI091 1 36 AA334608 T5741 1 Z20979 
N56507 T87485 AI133622 AA343370 T40075 T69671 T53849 T74820 AF075316 AI1 10618 T4Q121 T57381 AM 14468 
AA332728 T51362 AM 14589 R06691 AM 10629 AF063503 AI140543 AA334661 AA332720 AA343262 T7351 3 T86549 
AM 1484? T57284 T39981 T61407 T72757 T51749 T56630 AA343125 T72126 R94135 T83028 T39972T39896 All 74786 
Al 132926 R09237 AI064838 AI133660 T60398 T88753 T55930 T92126 AW44602 T60996 AH 14792 H9391 1 AM 33106 
R10779 AI065020 T90925 T50889 D17029 AI133703 AA333805 AI133040 AI133017 AI064857 AI110730 AF074637 A1207567 
H71080T73217 AA343950 AM74743 AA334224 AA334281 R06692 T64739 T40163 T60628 T81 661 T73179 R01842 
AA501 730 T39931 T39662 T40136 AA334904 T71425 H77784 R00874 AI065049 T84512 T5591 8 AI207595 T39951 
AA005016T60361 T69176 T73356 T58795T61233 T39955 T60612 AI114676 AI064778 AA035710 W52763 AH 14786 
T83564 AA341859 T81684T55769 Al 11471 0 T51 776 AA343213 AM14714 T58102 AI110809 R28984 AM748 54 AA305675 
M343592 T53B36 T46869 T64721 T55508 WQ5241 T54019 T57945 T60513 T48364 AF075308 W86731 T82851 T48269 
H54053 T7321 1 AI1 14590 T4831 7 T55965 T74857 R84226 T56552 T52231 T74946 T76976 R02576 T95666 AI203974 
AI189471 AA005147 AI478102 AI207662 AM92792 AI768421 AI064737 AW051713 AA936693 AI133117 AI766232 AJ913646 
Tfi^962 AI0651 12 AI207689 AI174684 AI207702 T81 475 AM 33325 AI03251 2 AA701 169 A1936354 AH 14720 AI433289 
AA046980 AI823482 AM 14536 AA860651 AW242644 R07469 AW300438 AI13341 6 AW271670 AI991363 T78943 AI823481 
AA845518 AA719124 AA883454 T68850 T69115 AI935509 AI1 50977 T62890 T71374 T68294 Al 174774 T6741 1 T68318 
A1064689 T56624 T69010 T68982 T68302 AI332829 T72908 AI064819 AI205880 T62895 ^^^l^^J^ 30 
W52657 T71984 T69118 W92684 AH 1 4860 T62093 T6 1797 AI522333 T73322 H92981 T5601 8 T61 81 1T57232 Al 33^58 
T61821 T69457 T62900 T62912 T72917 T46885 A1702448 T57212 T57203 R94581 T71311 T61819 T89358 T67708 T70918 
T59166 AM87111 T64308T62071 T69427 AM14750 T60430 R09734 T69033 T69141 T69453 T67908 R16809 T69394 
AI207729 T55839 T90273 T73339 AW194909 T75486 T71850 T71305 T71287 T53S77 T73452 T68852 N75290 1 AI3 2890 
T67751 AI174983T51679 T54851 H69880 N73734 AA443453 T73466 H69672 N53869T68447 D11809 D12412 T64300 
T28321 T55864 

123941 genbank_AA621529 AA621529 

118049 genbank_N53145 N53145 

102800 14782„20 AA313538 U88895 U88902 

104106 AA422123J_atAA422123J 

1 1 1 738 genbank_R26065 R25065 

113149 genbankJT51588 T51588 

113958 genbank_W86195 W86195 

108335 genbank_AA070500 AA070500 

108351 genbank_AA071193 AA071193 

108441 genbank_AA079079 AA079079 

124276 genbank_H83465 H83465 

101447 entrez_M21305 M21305 

117226 genbank_N20468 N20468 

133379 genbank_AA207059 AA207059 M207241 

119366 genbankJ77892 T77892 

119528 NOT_FOUND_entrez_W38051 W38051 

112588 genbank_R77302 R77302 

114449 genbank_AA020736 AA020736 

114576 genbank_AA065096 AA065096 

107459 W38002_s_at W38002_s 

130339 genbank_AA435746 AA435746 
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TABLE 2: Figure 2 from BRCA 001 US 



Table 2 shows genes downregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn; Exemplar Accession number, Genbank accession n umber 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R 1 : Ratio of normal breast tissue to tumor 



Pkey 


ExAccn 


UnigenelD 


Unigene Tittle 


R1 


100499 


T51986 


HS.2S31G8 


hemoglobin, gamma G 


10 


100549 


BE142019 


Hs.222056 


Homo sapiens cDNA FLJ1 1572 fis, clone HE 


10 


100654 


A03758 




NM_000477*:Homo sapiens albumin (ALB), m 


10 


100971 


BE 379727 


Hs 83213 


fatty acid binding protein 4, adipocyte 


10 


101184 


NM 001674 


Hs.460 


activating transcription factor 3 


10 


101336 


NM 006732 


Hs 75678 


FBJ murine osteosarcoma viral oncogene h 


10 


101367 


X03350 


Hs4 


alcohol dehydrogenase 1B (class I), beta 


10 


101447 


M21305 




gbJHuman alpha satellite and satellite 3 


10 


101461 


N98569 


Hs 76422 


phospholipase A2, group HA (platelets, 


10 


101511 


M27826 


Hs 267319 


endogenous retroviral protease 


10 


101736 


M74447 


Hs502 


transporter 2, ATP-binding cassette, sub-famit 


10 


102208 


U22961 




gMHuman mRNA clone with similarity to L 


10 


102450 


U48251 


Hs 75871 


protein kinase C binding protein 1 


10 


102800 


AA313538 




gb:EST185419 Colon carcinoma (HCC) cell 


10 


102857 


NM 006744 


Hs 76461 


retinol-binding protein 4, interstitial 


10 


102990 


AA829286 


Hs 332053 


serum amyloid A1 


10 


103747 


AA081995 




gb:zn26d06.r1 Stratagene neuroepithelium 


10 


103812 


AA137107 


Hs.326391 


Homo sapiens, clone MGC:16638, mRNA, com 


10 


104093 


R50727 


Hs.336970 


ESTs 


10 


104109 


AL3 53957 


Hs 284181 


hypothetical protein DKFZp434P0531 


10 


104250 


F06638 


Hs.12440 


Homo sapiens clone 24734 mRNA sequence 


10 


104492 


N73185 


Hs 94285 


EST 


10 


104506 


N91071 


Hs.109650 


ESTs 


10 


104532 


AI498763 


Hs 203013 


hypothetical protein FIJI 2748 


10 


104677 


AA0O9764 


Hs 190380 


ESTs 


10 


104711 


AA017245 


Hs.32794 


ESTs 


10 


104731 


AA019300 


Hs.125070 


ESTs, Moderately similar to 154374 gene 


10 


105005 


AI298208 


Hs.28805 


ESTs 


10 


105036 


AA1 30390 


Hs 25549 


hypothetical protein FLJ20898 


10 


105239 


AA221036 




gb;zr03f12.r1 Stratagene NT2 neuronal pr 


10 


106052 


N79885 


Hs.6382 


ESTs, Highly similar to T00391 hypotheti 


10 


106181 


A 1803651 


Hs 191608 


ESTs 


10 


106283 


AI085846 


Hs 25522 


KIAA1808 protein 


10 


106379 


AL042069 


Hs 119021 


DKF2P434N061 protein 


10 


106451 


AW235928 


Hs 313182 


ESTs 


10 


106491 


AA135688 


Hs 10083 


Homo sapiens, clone IMAGE:4139786, mRNA, 


10 


106782 


AW054886 


Hs.25682 


Homo sapiens mRNA for KIAA1863 protein, 


10 


107124 


AB006532 


Hs.31442 


RecQ protein-like 4 


10 


107148 


AI005036 


Hs 334305 


GS1999full 


10 


107214 


AF127026 


Hs.5394 


myosin IA 


10 


107242 


AB020672 


Hs 175411 


K1AA0865 protein 


10 


107331 


AI905985 


Hs 111805 


ESTs 


10 


107447 


W28516 


Hs 19210 


hypothetical protein MGC11308 


10 


107451 


AL042425 


Hs 283976 


hypthetical protein PR02389 


10 


107872 


BE271708 


Hs 95110 


ESTs, Weakly similar to A55943 1-phospha 


10 


108351 


AA071193 




gb:zf79b12.s1 Soares_pheaLgland_N3HPG 


10 


109546 


F01449 


Hs 26954 


Homo sapiens mRNA; cDNA DKFZp762G123 (fr 


10 


110433 


AW294162 


Hs 301062 


UDP-N-acetyl-a)pha-D-galactosamine:polyp 


10 


110976 


AL044174 


Hs 159526 


patched (Drosophila) homolog 


5 
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1 1 1 168 AI798376 gb:tr34b07J<1 NCI_CGAP_Ov23 Homo sapiens 

111651 R15733 Hs.20499 ESTs 

111803 AA593731 Hs.325823 ESTs, Moderately simlar to ALU5JHUMAN A 

114484 AA034378 Hs.?67319 endogenous retroviral protease 

5 125284 NMJ302666 Hs.1 03253 perilipln 

128850 AA193106 Hs.180817 chromosome 11 open reading frame 23 

128903 AW150717 Hs296176 STAT induced STAT inhibitor 3 

129346 AF1 10141 Hs.288908 WAS protein family, member 2 

129381 AW245805 Hs.1 10903 claudin 5 (transmembrane protein deleted 

10 129516 AF020038 Hs.1 1223 isocitrate dehydrogenase 1 (NADP+), solu 

129554 BE222078 Hs.1 13069 ESTs 

130085 M62402 Hs 274313 insuMhe growth factor binding prate 

130243 D88435 Hs.153227 eye I'm G associated kinase 

130400 V00517 Hs.283108 hemoglobin, gamma G 

15 130436 NM 001928 Hs.155597 D component of complement (ad:ps;n) 

130563 BE270472 Hs.279900 HSPC0 15 protein 

130589 AL110226 Hs.16441 DKFZP434H204 protein 

130683 AA993269 Hs.17872 Homo sapiens, clone IMAGE:3875012 t mRNA 

130689 NMJJ06691 Hs.17917 extracellular link domain-containing 1 

20 130689 AA046747 Hs.17917 extracellular link domain-containing 1 

130718 N70196 Hs 18376 KIAA1 31 9 protein 

130798 M81349 Hs.1955 serum amyloid A4, constitutive 

130840 BE048821 Hs.20144 small inducible cytokine subfamily A (Cy 

131184 AB040935 Hs.23954 cerebral cell adhesion molecule 

25 131282 X03350 Hs.4 alcohol dehydrogenase 1B (class I), beta 

131328 AW939251 Hs 25647 v-fos FBJ murine osteosarcoma viral onco 

131543 AW966881 Hs 41639 programmed cell death 2 

131753 AA829286 Hs 332053 serum amyloid A1 

131785 H69342 Hs 26320 TRAB ID protein 

30 131828 AJ000263 Hs 278658 keratin, hair, basic, 6 (monilethrix) 

132426 AW118072 Hs 89981 diacylgrycerol kinase, zeta (104kD) 

132675 AI291496 Hs 5476 Homo sapiens, clone IMAGE3530123, mRNA, 

132898 W28548 Hs 224829 ESTs 

132905 NM 004235 Hs 7934 KruppeUike factor 4 (gut) 

35 133120 NNL003278 Hs 65424 tetranectin (plasm'mogen-binding protein 

133407 AF017987 Hs 7306 secreted frizzle d- related protein 1 

133719 H26904 Hs 75736 apolipoprotein D 

134007 AF072441 Hs 7840 calcineurin binding protein 1 
134055 D86062 Hs.182423 ES1 (zebrafish) protein, human homolog o 
40 134111 AI372588 Hs8022 TU3 A protein 

134117 AA081846 Hs7921 Homo sapiens mRNA; cDNA DKFZp 566 E 183 (fr 
134177 BE243319 Hs 79672 K1AA06 52 gene product 
134369 AF207664 Hs8230 a disintegrin-like and metalloprotease ( 
1 34496 M64936 gb:Homo sapiens retinoic acid-inducible 

45 134510 NM_002757 Hs.250870 mHogen^ctivated protein kinase kinase 
134550 M26315 Hs.85258 CD8 antigen, alpha polypeptide (p32) 
134758 NM000078 Hs.89538 cholestery! ester transfer protein, pias 
134963 NM_003394 Hs.91985 wingless-type MMTV integration site fa mi 
135066 X04430 Hs.93913 interieukin 6 (interferon, beta 2) 
50 408790 AW580227 Hs.47860 neurotrophic tyrosine kinase, receptor, type 2 
446674 AA563892 Hs.306000 solute carrier famly 4 (anion exchanger), memb 



10 

10 

10 

10 

10 

5 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

5 

5 

10 

10 

10 

10 

5 

5 

10 
10 
10 
10 
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TABLE 2A 
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Table 2A shows the accession numbers for those pkeys lacking unigenelD's for Table 2. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 



111168 38585J AI798376 S46400 AW811617 AW811616 W00557 BE142245 AW858232 AW861851 AW858362 AA232351 AA218567 
AA055556 AWB58231 AW857541 AW814 172 H56214 AW814398 AF134154 AA243093 AA173345 AA199942 AA223384 
20 AA227092 AA227080 T12379 AA092174 T61139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 AA157718 

AA157719 AA100472 AA100774 AA130756 AA157705 AA157730 AA1 5771 5 AAO 53524 AW849581 AW854566 C05254 
AWB82836 T92637 AW812621 AA206583 AA209204 BE156909 AA226824 AI829309 AW991957 N66951 AA527374 H66215 
AA045564 AI694265 H60808 AA149726 AW1 95620 BE081333 BE073424 AW817662 AW817705 AW817703 AW817659 
BE081531 H59570 
25 103747 117944.1 AA08 1 995 AA1 01099 

134496 46501J M64936 AI025512 AI382987 BE061777 AA089966 BE1 69930 T41176 AW594624 BE502415 AA1 21893 Al 269283 T4 03 11 

AI684569AA257011 AI079277 AI241318 BE327710 AW975215 AW896268 AA884990 BE327514 
105239 34624J AA221036 R87170 BE537068 BE544757 C18935 AW812058 T92565 AA227415 AA233942 AA223237 AA668403 AA601627 
AW869639 BE061833 BE000620 AW961 170 AW84751 9 AA308542 AW821833 AW945688 C04699 AA205504 AA377241 
3 0 AW821667 AA055720 AW817981 AW856468 AA 15571 9 AA 179928 T03007 AW754298 AA227407 AA1 13928 AA307904 

C16859 

100654 tigr_HT2969 A03758 A06977 A15293 D17029 D17107 D17171 L00132 L00133 M12523 M13075 M13076 M92816 U22961 V00494 V00495 
X51353 X51364 X51365 

102208 6735_9 U22961 AA203623 AA503337 AI174733 AI192802 C06092 AA035357 Al 19061 9 Al 199244 AI828450 AA602296 AI378195 

3 5 AI2C9170 AI186653 AI127795 AI183846 H77389 AI589465 AA629390 H94306 AI018388 R68584 AA027196 AI745413 

AI685092 AI093426 AI623873 AI074570 N50096 AA047486 N25060 AA327614 AI04251 2 AI383957 AA156873 AI333101 
N70306 AI141254 AI383191 AI401237 AI080709AI093400 W84549T90806 R00012 W01413 AA630557 AI378348 AI559265 
M877103 W84464 AA625146 R68379 AM 33207 AI132980 AI133214 AI064826 AI061615 AI133473 AI174852 AI133404 
AI133272 V00494 M12523 M12523 AI207526 AI133120 AI064802 AI174993 AH14729 AI061645 AI064716 AI064959 H77388 

40 T85706 AF075298AI110799 D17107 NM_000477 AF190168 R50724 A1248416 AI207432 AI133684 AI133345 AI17471 0 

AI133290 AI133304 Al 1 74948 AI207484 AI1 10717 AF074624 AM 14515 AF063516A11 10642 A11 14559 A1114498 AI114759 
A1207568 AI064960 AI174753AI1 14666 R69184 R00011 Al 064997 T60501 AI207701 T71735 AA385318 H73569T60496 
H94399 AI133158 T74675 AA484750 T73413T55909 R50261 T72061 N80533 T51189 T74936 AI207490 AI132925 AI064701 
AI174748 AH 14663 AI133104 AI132999 AI133100 AI064925 AI064979 AI133063 AA343347 T69091 AA233989 T39772 

45 AI444620T52290 D16931 T40012 T48403 T58926 T69195 AI133061 T50850 AI400677 AI091136 AA334608 T57411 Z20979 

N56507 T87485A1133622 AA343370 T40075T69671 T53849 T74820 AF075316 AI110818 T40121 T57381 A1114468 
AA332728 T51362 A1114589 R06691 A11 10629 AF063503 AI140543 AA334661 AA332720 AA343262 T73513 T86549 
AH 14840 T57284 T39981 T61407 T72757 T51749 T56630 AA343125 T72126 R94135 T83028 T39972T39896 AI174786 
AI132926 R09237 AI064838 AI133660 T60398 T88753 T55930 T92126 Ai444602 T60996 AI114792 H9391 1 AI133106 

50 R10779 AI06502O T90925 T50889 D17029 AI133703 AA333805 AI133040 AH33017 AI064857 AI110730 AF074637 AI207567 

H71060 T73217 AA343950 AI174743 AA334224 AA334281 R06692 T64739 T40163 T60628 T81661 T73179 R01842 
AA501730 T39931 T39662 T40136 AA334904 T71425 H77784 R00874 AI065049 T84512 T55918 AI207595 T39951 
AA005016 T60361 T69176 T73356 T58795 T61233 T39955 T60612 A1114676 AI064778 AAO35710 W52763 AI114786 
T83564 AA341859 T81684 T55769 AM 14710 T51776 AA343213 AI114714 T58102 AI110809 R28984 A1174854 AA305675 

55 AA343592 T53836T46869T64721 T55508 W05241 T54019 T57945 T60513T48364 AF075308 W86731 T82851 T48269 

H54053 T73211 AI1 14590 T48317 T55965 T74857 R84226 T56552 T52231 T74946 T76976 R02576 T95666 AI203974 
AI189471 AA005147 AI478102 AI207662 AI192792 AI768421 AI064737 AW051713 AA936693 AI1331 17 AI766232 AI913646 
T83962 AI0651 12 AI207689 AI174684 AI207702 T81475 AM 33325 AI032512 AA701 169 AI936354 AH 14720 AI433289 
AA046980AI823482 A1114536 AA860651 AW242644 R07469 AW300438 AI133416 AW271670 AI991363 T78943 AI823481 

60 AA845518 AA719124 AA883454 T68850 T691 15 AI935509 AI150977 T62890 T71374 T68294 AI174774 T67411 T68318 

AI064689 T56624 T69010 T68982 T68302 AI332829 T72908 AI06481 9 AI205880 T62895 T69430 T951 1 1 AA025050 T73330 
W52657 T71984 T691 18 W92684AI1 14860 T62093 T61797 AI522333 T73322 H92981 T56018 T6181 1 T57232 AI336158 
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T61821 T69457 T62900T62912 T72917 T46885 A1702448 T57212 T57203 R94581 T7131 1 ™«^™M^B7708^0918 
T59166 AH871 1 1 T64308 T52071 T69427 AI1 14750 T60430 R09734 T69033 T69141 T69453 T67908 R16809 T69394 
ImwraTHBM TM273 T7M39 AW1 94909 T75486 T71850 T71305 T71287 T53877 T73452 T56852 N75290 AI312890 
SSI T51679 T54851 H69880 N73734 AA443453 T73466 H69672 N53869 T68447 D1 1809 D12412 T6430O 
T28321 T55864 

102800 14782_20 AA313538 U88895 U86902 

108351 genbankJ\A071193 AA071193 

101447 entrez.M21305 M21305 
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TABLE 3: Figure 3 from BRCA 001 US 
Table 3 shows genes downrcgulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R 1 : Ratio of normal breast tissue to tumor 



Pkey ExAccn UnigenelD UnigeneTitle R1 

101336 NfVL006732 Hs.75678 FBJ murine osteosarcoma viral oncogene h 10.0 

102208 U22961 gb:Human mRNA clone with similarity to L 100 

102990 AA829286 Hs.332053 serum amyloid A 1 100 

111168 AI798376 gb:tr34b07.x1 NCI_CGAP_Ov23 Homo sapiens 100 

111803 AA593731 Hs.325823 ESTs, Moderately similar to ALU5_HUMAN A 10 0 

130085 M62402 Hs.2743 13 insulin-like growth factor binding prote 100 

130840 BE048821 Hs.20144 small inducible cytokine subfamily A (Cy 10 0 

131543 AW966881 Hs.41639 programmed cell death 2 10 0 

133120 NM_003278 Hs.65424 tetranectin (plasminogen-bind'ing protein 10 0 

134758 NMJXJ007B Hs.89538 cholesteryl ester transfer protein, plas 10.0 
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TABLE 3A 

Table 3A shows the accession numbers for those pkeys lacking unigenelD's for Table 3. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 



Ptey. Unique Eos probeset identifier number 

CAT number. Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

nnRfl 1 AI7qR17BS464O0AW811617AW811616W00557BE142245AW858232AW861851 AW858362 AA232351 AA21 8567 

111168 38585.1 4^555^ AW8582^1 AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA173345 AA199942 AA223384 

itS AaSS ^74151139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 AA157718 
S M100472 AA100774AA130756 AA157705 AA157730 AA157715 AA053524 AW849581 AW854566 C05254 
^Sm^m^U^mMOm BE156909 AA226824 AI829309 AW991957 N66951 AA527374 H66215 
JXfWOSi S»Ta AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 AW817659 

1 UM961 MxSa503337 AI174733 AI192802 C06092 AA035357 AI190619 AI199244 AI828450 AA602296 AI378195 
1022086735 J g^^™ * H77389 AI589465 AA629390 H94306 AI018388 R68584 M027196 AI74M13 A 8M92 
S A 623873 A 0 0 N50096 M047486 N25060 AA327614 AKM2512 AI383957 AA156873 A1333101 N70806 AIU1254 
A 383191 A401237 AI080709 AI093400 W84549 T90806 ROC012 W01413 AA630557 AI378348 AI559255 AA8771 03 M^64 
Su6^mMmmmS9K AI133214 AI064826 AI061615 AI133473 AI174852 AI133404 AI133272 V00494 M12523 

D17107NM 000477 AF190168 R50724 AI248416 AK07432 AM33684 AI133345 AI174710 AI133290 AI133304 AI174948 
™7484« 

A 114666 R69184 R00011 AI064997 T60501 AI207701 T71735 AA385318 H73569T60496 H94 398 i A ^» 
T73413T55909 R50261 T72061 N80533 T51 189 T74936 A1207490 AI132925 AI064701 AI174748 ^1114^^1M AI132999 
A 111100 AI064925 AB64979 AI133063 AA343347 T69091 AA233989 T39772 AI444620 T52290 D16931 T40012 T48403 

T69671 T53849 T74820 AF075316 AI110818 T40121 T57381 AI114468 AA332728 T51362 AI114589 R06691 AM 10629 
So3 AM«543 M334661 AA332720 AA343262 T73513 T86549 AI1 14840 T57284 T39981 T61407 T72757 T5 749 
TM6»^31&T72WR941M T83028 T39972 T39896 AI174786 AI1 32926 R09237 AI064838 AI133660 T60398 T88753 
SSAwV46ofi 

AI133040 AI133017 AI064857AI1 10730 AF074637 AI207567 H71080 T73217 AA343950 AI174743 AA334224 AA334281 

T40163^628 T81661 T73179 R01842 AA501730 T39931 T39662 T40136 AA334904 T71425 H77784 R00874 

aSSIowS 

A110809 R28984 AI174854 AA305675 AA343592 T53836 T46869T64721 T55508 ^^19^^3741864 
AF075308 W86731 T82851 T48269 H54053 T73211 A1114590 T48317 T55965 T74857 R84226 T56552 T5223 TO946 T76976 
S T95666 i 203974 AI189471 AA005147 AI478102 AI207662 AI192792 AI768421 AI064737 AW051713 AA936693 
Am31 17 AI7662TaI91^ A1174584 AI207702 T81475 AI133325 AI032512 AA701169 AI936354 

A 114720 AM33289 M&^sVw823^2W 114536 AA860651 AW242644 R07469 AW300438 AI133416 
T78943 AI823481 AA845518 AA719124 AA883454 T68850 T69115 AI935509 AI150977 T62890 T71374 T68294 AM 74774 
« S AI064189 T56624 T69010 T68982 T68302 AI332829 T72908 AI064819 AI20588C ) T628S I TW30 T95111 
' AA025050 T73330 W52657 T71984 T691 18 W92684 All 14860 T62093 T61797 AI522333 T73322 H92981 T56018 T61811 
^^^^^T(^tS^2 T72917 T46885 AI702448 T57212 T57203 R94581 T7131 1 T61819 T89358 

T69394 AI207729 T55839 T90273 T73339 AW194909 T75486 T71 850 T71305 T71287 T53877 T73452 T68852 N75290 
AI312890 T67751 AJ174963 T 5 1679 T54851 H69880 N73734 AA443453 T73466 H69672 N53869 T68447 D1 1809 D12412 
T54300 T28321 T55864 
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TABLE 4: Figure 4 from BRCA 001 US 



Table 4 shows genes upregulated in tumor tissue compared to normal breast tissue. 



Pkey; Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of turner to normal breast tissue 



Pkey ExAccn UnigenelD Unigene Title R1 

100113 NM_001269Hs 84746 chromosome condensation 1 2 3 

100114 X02303 Hs 82962 thymidylate synthetase 2 9 
100131 D12485 Hs 11951 ectonucleotide pyrophesphatase/phosptodiesterase 1 19 

100146 BE185499 Hs 2471 KIAA0020 gene product 1 $ 

100147 D13666 Hs 136348 osteoblast specific factor 2 (fasciclin Mike) (periostin) 7 5 
100154 H60720 Hs 81892 KIAA01 01 gene product 92 
100163 W44671 Hs 124 gene predicted from cDNA with a complete coding sequence 1-6 
100220 AW015534 Hs 217493 annexinA2 2 0 
100265 D38521 Hs 112396 KIAA0077 protein 15 
100271 BE160081 Hs 256290 S1 00 calcium-binding protein A11 (calgizzarin) 13.5 
100275 BE242802 Hs 154797 K I AA0090 protein 51 
100323 D50920 Hs 23106 KIAA01 30 gene product 19 
1X335 AW247529 Hs.6793 platelet-activating factor acetylhydrolase, isoform lb, gamma subunit (29kD) 2 7 
100364 NM_004341Hs 154868 carbamoyl-phosphate synthetase 2, aspartate transcarbamylase, and dihydroorotase 2 0 
100372 NM_01 4791 Hs 184339 KIAA0175 gene product 2 6 
100393 D84145 Hs.39913 novel RGD-containing protein 3 -2 
100400 AW954324 Hs.75790 phosphatidylinositol glycan, class C 15 
100418 D86978 Hs 84790 KIAA0 225 protein 20 
100482 M65028 Hs 81361 heterogeneous nuclear ribonucleoprotein A/B 2 9 
100518 NM_004415Hs74316 desmoplakin (DPI, DPIl) 19 

100666 L05424 Hs 169610 CD44 antigen (homing function and Indian biood group system) 57 

100667 L05424 Hs 169610 CD44 antigen (homing function and Indian blood group system) 9 0 

100668 L05424 Hs 169610 CD44 antigen (homing function and Indian blood group system) 7 6 
100678 AW502935 Hs 740 PTK2 protein tyrosine kinase 2 53.2 
100685 AA328229 Hs 184 582 ribosomal protein L24 18 
100690 AA383256 Hs1657 estrogen receptor 1 16 
100783 AF078847 Hs 191356 general transcription factor IIH, polypeptide 2 (44kD subunit) 5 9 
100850 AA836472 Hs 297939 cathepsin B 17 
100892 BE245294 Hs 180789 S164 protein 17 
100945 AF002225 Hs. 180686 ubiquitin protein ligase E3A (human papilloma virus E6-associated protein, Angelman syndrome) 1 5 
100969 AA 157634 Hs.79172 solute carrier family 25 (mitochondria I earner; adenine nucleotide transbcator), member 5 6 3 
100988 AK000405 Hs76480 ubiquitin-like 4 m 
100999 H38765 Hs 80706 diaphorase (NADH/NADPH) (cytochrome b-5 reductase) 16 
101031 J05070 Hs 151738 matrix metalbproteinase 9 (gelatinase B, 92kD gelatinase, 92kD type IV collagenase) 82 
101045 J05614 gb:Human proliferating cell nuclear antigen (PCNA) gene, promoter region. 5 0 
101077 N99692 Hs75227 Empirically selected from AFFX single probeset 2 6 
101093 L06419 Hs75093 procollagen-lysine, 2-oxoglutarate 5-dioxygenase (lysine hydroxylase, Ehlers-Danlos syndrome type VI) 1.4 
101161 NM_006262Hs 37044 peripherin 16.9 
101186 AA020956 Hs.179881 core-binding factor, beta subunit 2 ,0 
101216 AA284166 Hs.841 13 eye lin-de pendent kinase inhibitor 3 (CDK2-associated dual specificity phosphatase) 1 8 
101228 AA333387 Hs 82916 chaperonin containing TCP1, subunit 6A (zeta 1) 17 
101247 AA 132666 Hs 78802 glycogen synthase kinase 3 beta 19 
101249 L 18964 Hs.1904 protein kinase C, iota 15 
101332 J04088 Hs 156346 topoisomerase (DNA) II alpha (170kD) 5 2 
101332 J04088 Hs 156346 topoisomerase (DNA) II alpha (170kD) 3 4 
101352 AI494299 Hs.16297 COX1 7 (yeast) homolog, cytochrome c oxidase assembly protein 6 3 
101396 BE267931 Hs 78996 proliferating cell nuclear antigen 4 2 
101445 M21259 gb:Human Alu repeats in the region 5' to the small nuclear rib 1.9 
101470 NM_000546Hs. 1846 tumor protein p53 (Li-Fraumeni syndrome) 1.6 
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101478 NM_00289OHs.758 RAS p21 protein activator (GTPase activating protein) 1 2.5 

101483 M24486 Hs.76768 procollagen-proline, 2^xoglutarate 4-dioxygenase (proline 4 -hydroxylase), alpha polypeptide I b.b 

101540 J04977 Hs.84981 X-ray repair commenting defective repair in Chinese hamster cells 5 (double-strand-break rejotning^.i 

101573 AW248421 Hs,250758 proteasome (prosome, macropain) 26S subunit, ATPase, 3 ™ 



101580 NMJ)12151Hs.83363 coagulation factor Vlil-assodated (intronic transcript) 

101592 AF064853 Hs 91299 guanine nucleotide binding protein (G protein), beta polypeptide 2 l a 

101592 AF064853 Hs 91299 guanine nucleotide binding protein (G pr ™ 

101621 BE391804 Hs 62661 guanyiate binding protein 1, interferon-inducible, 67kD ^ 

101702 AW504089 Hs 179574 protein phosphatase 2 (formerly 2A), regulatory subunit B (PR 52), alpha eoform u 

101734 M74099 Hs 147049 cut (Drosophtta)-like 1 (CCAAT displacement protein) ^ 

101759 M80244 Hs.184601 solute carrief family 7 (cationic amino acid transporter, y+system), member 5 w 



101767 M81057 Hs, 1 80884 carboxypeptidase B1 (tissue) ™„ 4 

101782 AA306495 Hs.1869 phosphoglucomutase 1 ■ 

101805 AW409747 Hs.75612 stress-in duced-phosphoprotein 1 (Hsp70/Hsp90-organtzing protein) o.o 

101806 AA586894 Hs.112408 S1 00 calctum-binding protein A7 (psoriasin 1) « 
101810 NM_000318Hs.180612 peroxisomal membrane protein 3 (35kD, Zellweger syndrome) 

101879 AA176374 Hs.243886 nuclear autoantigenic sperm protein (histone-binding) ' b 

101911 AA441787 H s. 11 9689 glycoprotein hormones, alpha polypeptide ^ 
101920 AF182645 Hs.8024 IK cytokine, down -regulator of HLA II 1( „ IK1 . T1 ° 
101973 U41514 Hs.80120 UDP-N-acetyi^lpha-D^galactosamine:porypeptjde N-acetylga!actosaminyltransferase 1 (GalNAc-T1) 2.4 

101983 AI904232 Hs.75323 prohibitin ' 
102009 BE245149 Hs.82643 protein tyrosine kinase 9 

102036 BE250127 Hs.82906 C DC 20 (eel I division cycte 20, S.ce rev isiae, homo log) ^ 

102083 T35901 Hs.75117 interteukin enhancer binding factor 2, 45kD ]■» 

102083 T35901 Hs.75117 interteukin enhancer binding factor 2, 4 ™ 

102107 BE258602 Hs.182366 heat shock protein 75 . 

102123 NM„001809Hs.l594 centromere protein A (17kD) } ° 

102165 BE313280 Hs.159627 death associated protein 3 ™ 

102198 AW950852 Hs.74598 polymerase (DNA directed) , delta 2, regulatory subu nit (50kD) ™ 

102217 AA829978 Hs.301613 JTV1 gene * 

102220 U24389 Hs.65436 lysosomal ' 
102234 AW163390 Hs.278554 heterochromatin-like protein 1 

102260 AL039104 Hs.159557 karyopherin alpha 2 (RAG cohort 1, importin alpha 1) ™ 

102302 AA306342 Hs.69171 protein kinase C-fike 2 J 
102330 BE298063 Hs.77254 chromobox homoJog 1 (Drosophlla HP1 beta) 

102339 BE378432 Hs.95577 cyclin-dependent kinase 4 i 6 

102346 U37519 Hs.87539 aldehyde dehydrogenase 3 family, member B2 ^ 

102349 AU077055 Hs.289107 baculoviral IAP repeat-containing 2 ^ 
102369 U39840 Hs.299867 hepatocyte nuclear factor 3, alpha 
102374 U33635 Hs.90572 PTK7 protein tyrosine kinase 7 

102391 AA296874 Hs.77494 deoxyguanosine kinase ■ 
102455 U48705 Hs.75562 discoid in domain receptor family, member 1 

102465 NM_001359Hs.81548 2,4-dienoyl CoA reductase 1, mitochondnal '° 

102488 U50939 Hs.61828 amyloid beta precursor protein-binding protein 1, 59kD y> 

102489 AL080116 Hs.74420 origin recognition complex, subunit 3 (yeast homolog)-like ^ 
102494 AI188137 Hs.75193 COP9homolog , tl 
102501 AF217197 Hs.74562 siah binding protein 1; FBP interacting repressor, pyrtmidine tract binding spliang u 
102522 BE250944 Hs.1 83556 solute carrier family 1 (neutral amino acid transporter), member 5 j-o 
102532 AFO40253 Hs.70186 suppressor of Ty (S.cerevisiae) 5 homolog ^ 
102564 U59423 Hs.79067 MAD (mothers against decapentaplegic, Drosophlla) homolog 1 ^ 
102568 W81489 Hs. 223025 RAB31 , member RAS oncogene family 

102580 U6O808 Hs.152981 CDP-diacylgrycerol synthase (phosphatidate cytidylyttransferase) 1 ^ 

102581 AU077228 Hs.77256 enhancer of zeste (Drosophlla) homolog 2 

102582 U61232 Hs.32675 tubulrn-specifrc chaperone e *■] 

102617 AW161453 Hs.198767 COP9 (constitutive prK)kxrK)rphogenic t Arabidopsis, homolog) su bun rt 5 i.o 

102618 AL037672 Hs.81071 extracellular matrix protein 1 ^ 
102627 AL021918 Hs. 158 174 zinc finger protein 184 (Kruppel-like) ^ 
102663 NM_002270Hs.1 68075 karyopherin (importin) beta 2 J° 
102676 BE 262989 Hs. 12045 putative protein y 
102687 NM_007019Hs.93002 ubiqurtin carrier protein E2-C ™ 
102689 U96132 Hs. 171 280 hydroxy acyl-Coenzyrne A dehydrogenase, type II ^ 
102696 BE540274 Hs.239 fork he ad box M1 *J 

102704 AU077058 Hs.54089 BRCA1 associated RING domain 1 ™ 

102705 T97490 Hs.50002 small inducible cytokine subfamily A (Cys-Cys), member 19 ^ 
102750 AB014460 Hs.66196 nth (E.coliendonuclease l!l)-like 1 ^ 
102801 BE252241 Hs.38041 pyridoxal (pyridoxine, vitamin B6) kinase 

102812 U90549 Hs.236774 high-mobility group (nonhistone chromosomal) protein 17-ltke 3 1b 
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102827 BE244588 Hs.6456 ch a peron in containing TCP 1, subunil 2 (beta) 5.6 

102831 AA262170 Hs.80917 adaptor-related protein complex 3, sigma 1 subunit 2.0 

102844 AV653790 Hs.324275 WW domain-containing protein 1 1.3 

102868 X02419 Hs.77274 plasminogen activator, urokinase 4.4 

102925 BE440142 Hs.2943 signal recognition particle 19kD 1.9 

102935 BE561850 Hs.80506 small nuclear ribonucleoprotein polypeptide A' 2.4 

102968 AU076611 Hs.154672 methylene tetrahydrofolate dehydrogenase (NAD+ dependent), methenyltetrahydrofolate cyclohydrolase2.7 

102983 BE387202 Hs.1 18638 n on- metastatic cells 1, protein (NM23A) expressed in 3.1 

102985 U95742 Hs.2707 G1 to S phase transition 1 5 2 

103023 AW500470 Hs.1 17950 multifunctional polypeptide similar to SAICAR synthetase and AIR carboxylase 1 6 

103038 M926960 Hs.334883 C DC 28 protein kinase 1 2 5 

103060 NM_OO5940Hs.155324 matrix metalio proteinase 11 (MMP11; stromelysin 3) 4 5 

103080 AU077231 Hs.82932 cyclin D1 (PRAD1 parathyroid adenomatosis 1) 31 

103089 D31152 Hs.1 79729 collagen, type X, alpha 1 (Schmid metaphyseal chondrodysplasia) 2 4 

103177 BE244377 Hs.48876 farnesyl-di phosphate famesyltransferase 1 3 5 

103178 AA205475 Hs 275865 ribosomal protein S1 8 9 9 

103179 NM_001777Hs 82685 CD47 antigen (Rh-related antigen, integrin-assodated signal transducer) 13 
103181 X69636 Hs 334731 Homo sapiens, clone IMAGE: 3448306, mRNA, partial cds 2 0 
103185 NM_006825Hs.74368 transmembrane protein (63kD), endoplasmic reticulum/Golgi intermediate compartment 1 6 
103191 AA401039 Hs.2903 protein phosphatase 4 (formerly X), catalytic subunit 2 5 

103193 NM_004766Hs 75724 coatomer protein complex, subunit beta 2 (beta prime) 2.2 

103194 NM_004939Hs 78580 DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 1 6 3 
103206 X72755 Hs 77367 monokine induced by gamma interferon 8 8 
103223 BE275607 Hs.1708 chaperonin containing TCP1, subunit 3 (gamma) 3 0 
103232 X75962 Hs 129780 tumor necrosis factor receptor superfamily, member 4 1 8 
103238 AI369285 Hs.75189 death-associated protein 5 6 
103297 NM_001545Hs 9078 immature colon carcinoma transcript 1 19 
103330 AI803447 Hs 77496 small nuclear ribonucleoprotein poiy peptide G 2.5 
103349 X89059 gbHsapiens mRNA for unknown protein expressed in macrophage 16 
103376 AL036166 Hs 323378 coated vesicle membrane protein 18 

103391 X94453 Hs 114366 pyrrol ine-5-carboxylate synthetase (gluta mate gam ma - se mialdeh y de syn thetase) 2.3 

103392 X94563 g bR sapiens dbi/acbp gene exon 1 & 2. 4 0 
103430 BE564090 Hs 20716 translocase of inner mitochondria! membrane 17 (yeast) homolog A 13 
103491 AF264750 Hs 288971 myeloid/lymphoid or mixed-lineage leukemia 3 5 6 
103505 AL031224 Hs.33102 transcription factor AP-2 beta (activating enhancer-binding protein 2 beta) 5.1 
103547 A1376722 Hs 1 80062 proteasome (prosome, macropain) subunit, beta type, 8 (large multifunctional protease 7) 9.7 
103588 NM_0O6218Hs 85701 phosphoinosrtide-3-kinase, catalytic, alpha polypeptide 2 0 
103613 NMJX)0346Hs 2316 SRY (sex determining region Y)-box 9 (campomelic dysplasia, autosomal sex-reversal) 1 3 

103621 BE379766 Hs 150675 polymerase (RNA) II (DNA directed) polypeptide K (7.0kD) 2 0 

103622 AA609685 Hs 278672 membrane component chromosome 11, surface marker 1 2 3 
103727 AI878883 Hs 296381 growth factor receptor-bound protein 2 13 
103749 AL135301 Hs8768 hypothetical protein FU 10849 1 8 
103754 AI0157O9 Hs 172089 Homo sapiens mRNA; cDNA DKFZp586l 2022 (from clone DKF2p586l2022) 1.3 
103780 AA094752 Hs 169992 hypothetical 43.2 Kd protein 7.5 
103795 H26531 Hs7367 Homo sapiens BTB domain protein (BDPL) mRNA, partial cds 1.2 
103797 AA080912 gb:znO4d03.r1 Stratagene hNT neuron (937233) Homo sapiens cONA clone 5' similar 1 5 
103813 AI042582 Hs 181271 CGI- 120 protein 15 
103855 W02363 Hs.302267 hypothetical protein FU10330 1-5 
103886 AK001278 Hs 105737 hypothetical protein FLJ10416 similar to constitutive photomorphogenlc protein 1 6 5 
104052 NM_002407Hs 97644 mammaglobin 2 2.9 
104079 AA251242 Hs 103238 ESTs 14 
104174 AA478984 Hs.6451 PRO0659 protein 5 6 
104227 AB002343 Hs 98938 protocadherin alpha 9 16 
104275 A1751970 Hs 101067 GCN5 (general control of amino-acid synthesis, yeast, homoiog)-iike 2 5 4 
104325 BE379766 Hs 150675 polymerase (RNA) II (DNA directed) polypeptide K (7 0kD) 6 3 
104370 AA324597 Hs 21851 Homo sapiens cDNA FLJ12900 fc, clone NT2RP2004321 1 6 
104423 R83113 Hs.1432 protein kinase C substrate 80K-H 5 2 
104482 AB037762 Hs.44268 myelin gene expression factor 2 12 
104532 A1498763 Hs.203013 hypothetical protein FLJ1 2748 2 1 
104563 AL117403 Hs.306189 DKFZP434F 1735 protein 12 
104667 AI239923 Hs.30O98 ESTs 13 
104757 AI694413 Hs.332649 olfactory receptor, family 2, subfamily I, member 6 2 3 
104804 AI858702 Hs.31803 ESTs, Weakly similar to N-WASP [H. sapiens] 13 
104806 AB023175 Hs.22982 KIAA0958 protein 2 3 
104827 AW052006 Hs.8551 PRP4/STK/WD splicing factor 10.9 
104846 AI250789 Hs.32478 ESTs 5 6 
104854 AA041276 Hs.154729 3-phosphoinositide dependent protein kinase- 1 12.3 
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104867 AA278898 Hs.225979 hypothetical protein similar to small G proteins, especially RAP-2A 

104871 T78044 Hs.28893 Homo sapiens mRNA; cDNA DKFZp56402364 (from clone DKFZp56402364) 



2.0 
1.3 

104896 AW015318 Hs.23165 ESTs 1 7 Q 7 
104909 AW408164 Hs.249184 transcription factor 19 (SC1) * ? 
104916 AW958157 Hs.155489 NS1 -associated protein 1 ■ 
104919 AA026880 Hs25252 prolactin receptor ' 
104930 AF043467 Hs.32893 neurexophilin 2 ' 

104973 NM_015310Hs.6753 KIAA0942 protein ™ 

104974 Y12059 Hs.278675 bromodomain-containing 4 

104975 AL136877 Hs.50758 SMC4 (structural maintenance of chromosomes 4, yeast)-like 1 

104975 AL1 36877 Hs 50758 SMC4 (structural maintenance of chromoso 1 J 

104978 AlX [te.19322 Homo Udens. Similar to RIKEN cDNA 2010317E24 gene, ctone IMAGEI3502019, mRNA. partal cds 75 

104979 AA937934 Hs.321062 ESTs 3 5 
104994 AI499930 Hs.334885 mitochondrial GTP binding protein 

105009 BE379584 Hs 34789 oolicnyWiphosph^ " 
105012 AF098158 Hs.9329 chromosome 20 open reading frame 1 

105028 AI050715 Hs.2331 E2F transcription tactor5, p130binding iyAr _,, 1A7n , ,o 

105032 AA127818 gb.-zll2a02.s1 Soares_pregnanLuterus„NbHPU Homo sapens cDNA clone IMAGE501674 3 6.8 

105039 AA907305 Hs.36475 ESTs 22 
105041 AB037716 Hs.26204 K1AA1295 protein 
105045 BE242899 Hs 129951 speckle -type POZ protein 
105079 AA151342 Hs 12677 CGI-1 47 protein 

105087 AA147884 Hs.9812 Homo sapiens cDNABJ 14388 fis, done HEMBA 10027 16 

105088 H58589 Hs 35156 Homo sapiens cDNA FU 11027 fts, clone PLACE 10041 14 ^ 
105095 Z78407 Hs.27023 vesicle transport-related protein 

105110 BE387350 Hs.33122 K1AA1 160 protein * 

105126 AW975433 Hs.36288 ESTs . t ^ t * K 7 1 

105127 AA045648 Hs.301957 nudix (nucleoside diphosphate linked moiety X)-type mow 5 . ' 
105141 AA164687 Hs.177576 rmmnosyl (alpta-W^^ 2.7 
105158 AW976357 Hs.234545 hypothetical protein NUF2R ■ 
105169 BE245294 Hs.180789 S 164 protein ^ <n4 „, fi 4R 
105186 AA191512 Hs.28005 Homo sapiens cDNA FU1 1309 fis, clone PLACE 1010076 

105254 AA071276 Hs.19469 K I AA0859 protein ' 
105281 AA263143 Hs.24596 RAD51 -interacting protein 1Q 
105288 N99673 Hs.3565 ESTs, Weakly similar to AF126743 1 DNAJ domain-contain rg protein MCJ [H.sapiens] 1 .» 

105302 AA700122 Hs.3355 sentrin -specific protease ■ 
105331 AW270037 Hs.179507 KIAA0779 protein ' 2 
105359 NM_016015Hs.8054 CGI -68 protein 50 
105366 BE264645 Hs282093 hypothetical protein FLJ2 191 8 • 

105373 AW887701 Hs.32356 hypothetical protein FLJ20628 • 

105374 BE242803 Hs. 262823 hypothetical protein FU 10326 2 3 
105387 AW592146 Hs.108636 membrane protein CH1 ' 
105393 AF167570 Hs.256583 interleukin enhancer binding factor 3,90 kD ■ 
105399 BE386877 Hs.334811 N pw38 -binding protein NpwBP 16 

IK 2 HS ' 65648 gtol'^aslNCIjWAP^CBI Homo sapiens cDNA clone IMAGE:585026 3', mRNA sequence. 5.0 
105507 BE268348 Hs,226318 CCR4-NOT transcription complex, subunit 7 ■ 

105529 AA1 13449 Hs.32471 hypothetical protein FU20364 . 

105530 AB023179 Hs.9059 K1AA0962 protein g3 
105547 AA262640 Hs.27445 unknown 

105564 BE616694 Hs.288042 hypothetical protein FU 14299 ' 

105596 AA579535 Hs. 18490 hypothetical protein FU 20452 g 

105597 AF054284 Hs.334826 splicing factor 3b, subunit 1, 155kD 
105608 A1808201 Hs.287B63 hypothetical protein FU 12475 
105610 AA280072 Hs.99872 fetal Alzheimer antigen 

105617 AK0O0892 Hs.4069 glucocorticoid modulatory element binding protein 1 ■ 
105620 AW302245 Hs.1 81390 casein kinase 1, gamma 2 . 
105658 AA985190 Hs.246875 hypothetical protein FU20059 ■ 
<05 c 97 AW499988 Hs 27801 zinc finger protein 278 ' 
105708 R26944 Hs.180777 Homo sapiens mRNA; cDNA DKFZp564M0264 (from clone DKFZp564W0264) U 
105743 BE246502 Hs.9598 sema domain, immunoglobulin domain (Ig), transmembrane domain (TM) and short *.o 
105746 AW151952 Hs.46679 hypothetical protein FU 20739 ■ 
105759 AI123118 Hs.15159 ch emok'ine- like factor, a ItematK/ely spliced ]■* 
105771 AI267720 Hs.153221 synovial sarcoma, translocated to X chromosome 

105820 AA741336 Hs. 152108 transcriptional unit N 143 ' 
105826 AA478756 Hs.194477 E3 ubiquitin figase SMURF2 . 
105856 A1262106 Hs.12653 ESTs 
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105858 AF151066 Hs.281428 hypothetical protein 2.9 

105875 AK001708 Hs.32271 hypothetical protein FL J 10846 1-4 

105930 AF016371 Hs.9880 peptidyl prolyl isomerase H (cydophilin H) 5.2 

106000 AW194426 Hs.20726 ESTs 17 

106011 AW081 202 Hs. 12284 Homo sapiens, clone IMAGE:2989556, mRNA, partial cds 2.8 

106017 AA477956 Hs.26268 ESTs 14 

106073 AL157441 Hs.17834 downstream neighbor of SON 1-4 
106078 AA130158 Hs.19977 ESTs, Moderately similar to ALU8_HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 1.6 

106094 AA533491 Hs,23317 hypothetical protein FL J 14681 6 8 

106140 AB006624 Hs.14912 K1AA0286 protein 16 
106271 AA251393 Hs.289052 Homo sapiens, Similar to RIKEN cDNA 5430429M05 gene, clone MGC: 13155, mRNA, complete cds 10.8 

106288 AB037742 Hs,24336 KIAA1321 protein 1-3 

106300 Y10043 Hs,19114 high-mobility group (nonhistone chromosomal) protein 4 3.6 

106333 AL043114 Hs,22410 ESTs, Weakly similar to A54849 collagen alpha 1(VI() chain precursor [H.sapiens] 5 4 

106350 AK001404 Hs.194698 cyclin B2 5 7 

106359 AW3902B2 Hs 31 130 transmembrane 7 superf ami ly member 2 6.3 

106381 AB040916 Hs.24106 KIAA1 483 protein 6.5 

106389 AW748420 Hs.6236 Homo sapiens cDNA: FLJ21487 fis, clone COL05419 2 2 

106457 AF119256 Hs 27801 zinc finger protein 278 2 7 

106470 D63078 Hs 186 180 Homo sapiens cDNA: FLJ23038 fis, clone LNG02039 2 3 

106531 AA454036 Hs8832 ESTs 16 

106586 AA243837 Hs.57787 ESTs 16 

106589 AK000933 Hs 28661 Homo sapiens cDNA FLJ10071 fis, cbne HEMBA1O017O2 2 4 

106610 AA458882 Hs 79732 fibulin 1 7 9 

106624 NM_003595Hs 26350 tyrosylprotein sulfotransferase 2 77 

106650 AL049951 Hs.22370 Homo sapiens mRNA; cDNA DKFZp564O0122 (from clone DKFZp564O0122) 18 

106669 AV657117 Hs 184164 ESTs, Moderately similar to S65657 alpha-1Cadrenergic receptor splice form 2 [H.sapiens] 13 

106713 BE614802 Hs 184352 hypothetical protein FLJ 12549 4 5 

106717 AA600357 Hs 239489 TIA1 cytotoxic granule-associated RNA-bin ding protein 13 

106723 BE388094 Hs 21857 ESTs 16 

106795 AF174487 Hs.293753 Bcl-2-related ovarian killer protein-like 5 7 

106829 AW959893 Hs 27099 hypothetical protein FLJ23293 similar to ARL-6 interacting protein-2 16.2 

106831 BE564871 Hs 29463 centrin, EF-hand protein, 3 (CDC31 yeast homobg) 15 

106846 AB037744 Hs.34892 KIAA1323 protein 2 2 

106852 AF151031 Hs.300631 hypothetical protein 13 

106873 N49809 Hs.11197 Homo sapiens, cbne ]MAGE:33431 49, mRNA, partial cds 16-8 

106886 W79171 Hs9567 a002 protein 15 

106906 AA861271 Hs.222024 transcription factor BMAL2 2 2 

106920 AK001838 Hs.296323 serum/glucocorticoid regulated kinase 3 3 

106945 AK000511 Hs6294 hypothetical protein DKFZp434L1 435 similar to valyl tRNA synthetase 68 

106973 BE156256 Hs.11923 hypothetical protein 6 6 

106977 AL043152 Hs 50421 K1AA0203 gene product 4 8 

106978 AW631480 Hs 8688 ESTs 6 0 
107004 AA146872 Hs 300700 hypothetical protein FLJ20727 13 
107029 AF264750 Hs 288971 myeloid/lymphoid or mixed-lineage leukemia 3 1 8 
107071 AW385224 Hs 35198 ectonucleotide pyrophosphatase/phosphodiesterase 5 (putative function) 17 
107113 AK000733 Hs 23900 GTPase activating protein 2 5 
107125 AK0OO512 Hs 69388 hypothetical protein FLJ20505 1 7 
107136 AV661958 Hs 8207 GK001 protein 4 6 
107136 AV661958 Hs 8207 GK001 protein 3 3 
107146 AK0O1455 Hs 5198 Down s ynd ro me critical region gene 2 2 0 
107151 AW378065 Hs 8687 ESTs 63 
107155 AW391927 Hs 7946 KIAA 1288 protein 33.5 
107174 BE122762 Hs 25338 ESTs 55 
107197 W15477 Hs 64639 glioma pa thogenesis-related protein 6.1 
107221 AW888411 Hs 81915 leukemia-associated phosphoprotein p1 8 (stathmin) 17-4 
107243 BE219716 Hs 34727 ESTs, Moderately similar to I38759 zinc finger/leucine zipper protein [H.sapiens] 7 4 
107248 AW263124 Hs 315111 n ud ear receptor co-rep ressor/HDAC3 complex subunit 1.8 
107263 D60341 Hs 21198 translocase of outer mitochondrial membrane 70 (yeast) homo log A 6 6 
107265 BE379594 Hs 49136 ESTs, Moderately similar to ALU7_HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 2 5 
107298 N95657 Hs 6820 ESTs, Moderately similar to Y0J1_CAEEL HYPOTHETICAL 63.5 KD PROTEIN ZK353.1 IN 25 

107298 N95657 Hs 6820 ESTs, Moderately similar to YOJ1_CAEEL H 17 

107299 BE277457 Hs 30661 hypothetical protein MGC460 6 3.2 
107316 T63174 Hs.193700 Homo sapiens mRNA; cDN A DKFZp586l0324 (from clone DKFZp586l0324) 2,0 
107354 NM_006299Hs.96448 zinc finger protein 193 5.0 
107392 AW2999O0 Hs.267632 TATA element modulatory factor 1 1.2 
107481 AA307703 Hs.279766 kinesin family member 4 A 1.6 
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3 0 

107529 BE515065 Hs.296585 nucleolar protein (KKEA) repeat) 13 

Zm KsOT EST! Moderately similar to ALU7JiUWN ALU SUBFAMILY SQ SEQUENCE CONTAMINATTOM2.2 

107859 AW732573 Hs.47584 potassium voltage-gated channel, delayed-rectffer, subfamily S. member 3 tw 

107901 L42612 Hs 335952 keratin 6B l 6 

107901 L42612 Hs.335952 keratin 6B 22 

107922 BE153855 Hs.61460 Ig superfamily receptor LNIR 6y 

108230 AA054224 Hs.59847 ESTs 71 

108274 AF129535 Hs.272027 F-box only protein 5 2 5 

108296 N31256 Hs.161623 ESTs 35 

108496 AA083069 Hs.339659 ESTs MropoOAnnoA 3 4 

108607 BE300380 Hs.69476 Homo sapiens cDNAFLJ 12758 fe, done NT2RP2001328 « 

108621 AA101809 Hs.182685 ESTs t 7 

108634 AW022410 Hs.69507 ESTs g Q 

108647 BE546947 Hs.44276 homeoboxCIO 72 

108695 ABC29000 Hs.70823 KIAA1077 protein 13 

108717 AA122393 Hs.70811 hypothetical protein FL J 205 16 2 7 

108740 AI089575 Hs.9071 progesterone membrane binding protein • 

108828 AK001693 Hs.273344 DKFZP564O0463 protein 15 

108859 AL121500 Hs.178904 ESTs 21 

108872 H06720 Hs.1 11680 endosulfine alpha 53 

108891 A1801235 Hs.48480 ESTs 40 

108894 AK001431 Hs5105 hypothetical protein FLJ 10569 ' 

10B9S AA149754 15.195155 Homo sapiens amino acid transport system N2 (SN2) mRNA, complete cds 5.6 

108982 AA151708 Hs.171980 homeo box {expressed in ES ceils) 1 6 2 

108987 AA152178 Hs.23467 hypothetical protein FLJ 10633 1 ? 

109002 AB028987 Hs.72134 KIAA1 064 protein 14 

S Si 4372127 gS2a5d07 ^1 Siralageiie oolon P37204) Ham sapiens cDNAclone 3' Mr to contains Alu mp^e 5.3 

109068 AA164293 Hs.72545 ESTs t ' 6 

109101 AW608930 Hs.52184 hypothetical protein FLJ20618 3 2 

109112 AW419196 Hs,257924 hypothetical protein FLJ 13782 1 7 

109124 AK000684 Hs.183887 hypothetical protein FLJ221 04 2 6 

109139 AJ132592 Hs.59757 zinc finger protein 281 2 g 

109166 AA219691 Hs 73625 RAB6 interacting, kinesin-like (rabkres in 6) . 

109198 BE566742 Hs 58169 highly expressed in cancer, rich in leucine heptad repeats ■ 

109213 NM„016603Hs 82035 potential nuclear protein C50RF5; GAP-like protein ^ 

109220 AW958181 Hs 189998 ESTs 5 3 

109233 AU077281 Hs 170285 nucleoporin 214kD (CAIN) „ n!on ci 14 

09270 N99673 Hs.3585 ESTs, Weekly similar to AF 126743 1 ^ 

109273 AA375752 Hs.82719 Homo sapiens mRNA; cONA DKFZp586F1822 (from clone DKFZ P 586F1822) 2.9 

109313 AF153201 Hs.86276 C2H2 (Kruppel-type) zinc finger protein 2 g 

109341 AA213506 Hs.115099 EST 15 

109391 AL096858 Hs.184245 K1AA0929 protein Msx2 interacting nuclear target (MINT] homoog ^ 

109420 H83603 Hs.40408 homeo box C9 3 0 

109426 N30531 Hs.42215 protein phosphatase 1, regulatory subumt 6 19 

109429 A1160029 Hs,61438 ESTs 18 

109445 AA232103 Hs, 189915 ESTs 37 

109450 AB032969 Hs 173042 KIAA1 143 protein 3 2 

109468 NM_015310Hs6763 KIAA0942 protein 2Q 

109478 AW074143 Hs 87134 ESTs 21 

109570 L40G27 Hs.1 18890 glycogen synthase kinase 3 alpha u 

109662 F02614 Hs.27319 ESTs 13 

109825 R71264 Hs.16798 ESTs 20 

110039 H11938 Hs.21907 histone acetyltransferase 25 

110056 AA503041 Hs.279009 matrix Gla protein / 7 
110085 AA603840 Hs.29956 K1AA0460 protein 

lira S S 2 9 ESTs, Weakly similar to ALU1JHUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 1.7 

110242 N41744 Hs.19978 CGI-30 protein , . ' 

1 10259 H28428 Hs.32406 ESTs, Weakly similar to 138022 hypothetical protein [H.sapiens] u. 
110312 BE256986 Hs.11896 hypothetical protein RJ12089 
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110330 
110501 
110504 
110525 
110568 
110699 
110705 
110742 
110761 
110762 
110765 
110769 
110799 
110805 
110813 
110820 
110840 
110844 
110854 
110856 
110885 
110897 
110915 
110918 
110958 
110963 
110981 
110984 
111125 
111132 
111164 
111172 
111174 
111179 
111184 
111184 
111189 
111216 
111221 
111223 
111239 
111285 
111299 
111312 
111318 
111337 
111352 
111370 
111384 
111389 
111391 
111392 
111452 
111486 
111549 
111585 
111627 
111870 
111937 
111944 
111987 
112134 
112244 
112388 
112456 
112464 
112506 



AI288666 Hs.16621 
H55748 

H55915 Hs 210859 

H57330 Hs 37430 

AK001160 Hs5999 

T97586 Hs 18090 

AB007902 Hs 32168 

AW190338 Hs.28029 

AL1 38077 Hs 16157 

BE044245 Hs 30011 

AK000322 Hs 18457 

BE000831 Hs 23837 

AI089660 Hs.323401 

T25829 Hs.24048 

AA767373 Hs 35669 



R33261 
N31598 
AI740792 



Hs.6614 
Hs 12727 
Hs 167531 



BE612992 Hs 27931 
AA992380 

BE384447 Hs 16034 
AL1 17430 Hs.6880 
BE092285 Hs.29724 
H04360 Hs.24283 
NM_005864Hs 24587 
AK002180 Hs. 11449 
AK001980 Hs.24284 
AW613287 Hs.80120 
N63823 Hs 269115 
AB037807 Hs.83293 



N46180 
R67419 



Hs 122489 
Hs 21851 



AL050166 Hs 26295 
AK000136 Hs 10760 



AJ815486 
AI815486 
N67603 



Hs 243901 
Hs 243901 
Hs 272130 



AW139408 Hs 152940 
AB037782 Hs 15119 
AA852773 Hs 334838 
N90956 Hs 17230 
AA778711 Hs4310 
AB033O91 Hs 74313 



AI523913 
T99755 



Hs.34504 
Hs 334728 



AA837396 Hs 263925 
H58589 Hs 35156 



AI478658 
N94606 



Hs 94631 
Hs 288969 



AK000987 Hs 169111 
NM„003896Hs 225939 
W46342 Hs 325081 



R02354 

AI051194 

W90638 

R10720 

R52656 



Hs 15999 
Hs.227978 
Hs 20321 
Hs20670 
Hs 21691 



AB037834 Hs 18685 
BE298665 Hs 14846 
AW083791 Hs 21263 
NM„015310Hs6763 
R41823 Hs 7413 
AB029000 Hs 70823 
R46071 Hs 301693 
NM_016248Hs 232076 
AW007287 Hs 28538 
AI742756 Hs,26079 



DKFZP4341 116 protein 6.2 

gb:yq94a01.s1 Soares fetal liver spleen 1IMFLS Homo sapiens cDNA clone 1MAGE:203400 3* 6 1 

hypothetical protein FLJ11016 61 

EST 6 3 

hypothetical protein FLJ10298 1 3 

ESTs 1 8 

KIAA0442 protein 1.6 

hypothetical prater MGC11256 7.6 

hypothetical protein FLJ12707 2.5 

hypothetical protein MGC2963 9 3 

hypothetical protein FU2031 5 5 5 

Homo sapiens cDNA FU11812 fis, clone HEMBA1006364 2. 1 

dpy-30-like protein 1.5 

FK506 binding protein precursor 6 6 
ESTs, Moderately similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 5 7 

ESTs, Weakly similar to A43932 mucin 2 precursor, intestinal [H.saptens] 3 4 

hypothetical protein FLJ21610 1 7 

methylcrotDnoyL Coenzyme A carboxylase 2 (beta) 1 7 

hypothetical protein FLJ 10607 similar to glucosamine -phosphate N-acety (transferase 47 

gb:ot37g06 s1 SoaresJestis_NHT Homo sapiens cDNA clone 3' similarto contains element 2 3 

hypothetical protein MGC13186 3 5 

DKFZP434D156 protein 2 2 

hypothetcal protein FU13187 2 6 

ESTs, Moderately similar to reduced expression in cancer [H. sapiens] 1 9 

signal transduction protein (SH3 containing) 6 7 

DKFZP56401 23 protein 2 0 

ADP-ribosyttransferase (NAD+; poly(ADP-ribose) polymerase)- tike 2 1 3 
UDP-N-acetyl-aipha-D-galactosamine:polypeptide N-acetylgalactosaminyitransferase 1 (Ga!NAc-T1) 1,8 

ESTs, Moderately similar to Z195_HUMAN ZINC FINGER PROTEIN 195 [H.sapiens] 3 6 

hypothetical protein 2.1 

Homo sapiens cDNA FLJ13289 fis, clone OVARC1001170 2 3 

Homo sapiens cDNA FLJ12900 fis, clone NT2RP2004321 3 7 

Homo sapiens mRNA; cDNA DKFZp586D1 122 (from done DKFZp586D1 122) 7 5 

aspotin (LRR dass 1) 7.1 

Homo sapiens cDNA FLJ20738 fis, clone HEP08257 6 7 

Homo sapiens cDNA FLJ20738 fis, clone HE 3 3 

ESTs, Weakly similar to S65824 reverse transcriptase homolog [H .sapiens] 3 6 

ESTs 1 5 

KIAA1361 prote:n 2 6 

KIAA1866 protein 4 6 

hypothetical protein FLJ22087 7.9 

eukaryotic translation initiation factor 1A 6 9 

K1AA1265 protein 5 0 

ESTs 3 8 

ESTs 1 2 

LIS1 -interacting protein NUDE1, rat homolog 5 1 

Homo sapiens cONA FLJ 1 1027 fis, clone PLACE10041 14 2 2 

brefeldin A-inhibrted guanine nucleotide-exchange protein 1 2 8 

HSCARG protein 2 2 

oxidation resistance 1 2 1 
sfalyrtransferase 9 (CMP-NeuAc:lactosylceramide alpha-2,3-sialyltransferase; GM3 synthase) 5 1 

Homo sapiens, clone IMAGE :3659680, mRNA, partial cds 8 4 

ESTs 27 

EST 6 5 

ESTs, Moderately similar to ZRF1 HUMAN ZUOTIN RELATED FACTOR-1 (M-PHASE 1 4 

EST 16 

ESTs 1 6 

Homo sapiens mRNA for KIAA1413 protein, partial cds 2 4 

Homo sapiens mRNA; cDNA DKFZp564D016 (from clone DKFZp564D016) 10.6 

suppressor of potassium transport defect 3 6.6 

KIAA0942 protein 5.1 

ESTs; calsyntenin-2 2.8 

KIAA1077 protein 14.6 

Homo sapiens, done IMAGE :3638994, mRNA, partial cds 9.0 

A kinase (PRKA) anchor protein 11 14 

Homo sapiens cDNA: FLJ21086 fis, done CAS03272 1.4 

ESTs 3.2 
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112513 
112752 
112884 
112923 
112936 
112958 
112966 
112978 
112995 
112996 
113047 
113049 
113089 
113196 
113248 
113254 
113277 
113429 
113499 
113547 
113554 
113647 
113702 
113722 
113759 
113777 
113783 
113791 
113808 
113811 
113817 
113826 
113834 
113868 
113870 
113885 
113923 
113989 
114022 
114030 
114060 
114196 
114226 
114253 
114262 
114275 
114292 
114309 
114392 
114407 
114455 
114463 
114464 
114471 
114480 
114671 
114698 
114730 
114767 
114774 
114798 
114860 
114895 
114896 
114911 
114930 
114938 



R68425 
AK001635 
AK000004 
T10258 
AW970826 
R61388 
Z44718 
AK000272 
AA737033 
BE276112 
A1571940 
AW965190 
T40707 
T57317 
T63857 
AK002180 
AW971049 
AA688021 
AI467908 
H59588 
AW503990 
AA813887 
T97307 
AV653556 
AW499665 
BE266947 
AL359588 
A1269096 
W44735 
BE207480 
H13325 
AW378212 
T26483 
W57902 
AL079314 
AW959466 
AW953484 
W87544 
A3539519 
A1825386 
AB029551 
AF017445 
AB028968 
BE1 49866 
AL1 17518 
AW515443 
AI815395 
AA332453 
AA249590 
BE539976 
H37908 
AL120247 
AI091713 
AA028074 
BE066778 
AA766268 
AA476966 
A1373544 
AI859865 
AV656017 
AA159181 
AL157545 
AA236177 
BE539101 
AA236672 
AA237022 
AA242834 



Hs. 13809 
Hs.14838 
Hs.5013 
Hs.5037 
Hs.6185 
Hs.6724 
Hs,102548 
Hs.7099 
Hs7155 
Hs.7165 
Hs 7549 
Hs7560 
Hs.270862 



Hs.1 1449 
Hs.1 1774 
Hs.179808 
Hs.8882 
Hs.15233 
Hs. 142442 
Hs.1 88173 



Hs.184411 
Hs.9456 
Hs.10590 
Hs.7041 
Hs.135578 
Hs.9286 
Hs.6994 
Hs.332795 
Hs.24809 
Hs.6059 
Hs.90744 
Hs.16537 
Hs.21732 
Hs.3849 
Hs.268828 
Hs.1 20969 
Hs. 164478 
Hs.7910 
Hs.150926 
Hs.7989 
Hs.14831 
Hs.3686 
Hs.306117 
Hs.184641 
Hs.20824 
Hs.100748 
Hs.103305 
Hs.271616 
Hs.40109 
Hs.106597 
Hs.104613 
Hs.151678 
Hs.266273 
Hs.1 10857 
Hs.331328 
Hs.1 54443 
Hs.1 84325 
Hs.54900 
Hs.42179 
Hs.76591 
Hs.5324 

Hs.188717 
Hs.53384 



hypothetical protein FU10648 

hypothetical protein FLJ10773 

Homo sapiens mRNA for FLJ00004 protein, partial cds 

EST 

K1AA1557 protein 
ESTs 

glucocorticoid receptor DNA binding factor 1 
hypothetical protein FLJ20265 

ESTs, Moderately similar to 21 15357A TYKi protein [M.musculus] 

zinc finger protein 259 

ESTs 

Homo sapiens mRNA for K1AA1729 protein, partial cds 
ESTs 

qb:yb51aQ3.s1 Stratagene fetal spleen (937205) Homo sapiens cDNA clone IMAGE 74668 3', 
gb yc16e01.s1 Stratagene lung (937210) Homo sapiens cDNA done 3', mRNA sequence 
DKFZP5640123 protein 

protein (peptidyl-prolyl cis/trans isomerase) NlMA-interacting,4 (parvulm) 

ESTs 

ESTs 

ESTs 

HP1-BP74 

Horrx)sapier«cONAaJ12187fis,doneNlAJ^1000831 ]mrc1 ^ Qni 
gb:ye53h05.s1 Scares fetal liver spleen 1NFLS Homo sapiens cDNA clone 1MAGE:121497 3 , 
albumin 



2.0 

1.8 

6.6 

1.5 

3.2 

6.0 

64 

12 

56 

20 

19 

2.4 

13 

1.7 

28 

13 

3.2 

1.2 

5.9 

2.0 

3.6 

1.3 

4.4 

1.3 



SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily a, member 5 



1.2 
13.4 
1.7 
1.3 
3.3 
3.1 
3.2 
2.3 
113 
2.7 
6.1 
6.6 
1.9 
1.2 
5.4 
9.4 
1.8 
1.5 



zinc finger protein 313 
hypothetical protein DKFZp762B226 
chitobiase, di-N-acetyl- 

Homo sapiens cDNA: FLJ21278 fis, clone COL01832 
Homo sapiens cDNA: FLJ22044 fis, done HEP09141 
hypothetical protein DKFZp7610l7121 
hypothetical protein FU 10826 
EGF-containing nbutin like extracellular matrix protein 2 
proteasome (prosome, macropain) 26S subunit, non-ATPase, 1 1 
hypothetical protein, similar to (U06944) PRAJA1 
ESTs 

hypothetical proteh FLJ22041 simflar to FK506 binding proteins 
ESTs 

Homo sapiens cDNA FU1 1562 fis, done HEMBA1003197 
hypothetical proteh FLJ21939 similar to 5-azacytidine induced gene 2 
RING1 and YY1 binding protein 

fucose-1 -phosphate guanylyltransf erase ^ j 

Ho^o 1 sapie^Similarto zinc finger protein 136 (clone pHZ-20), clone MGC:10647, mRNA, cornplete cds 
K1AA0978 protein 
K1AA0306 protein 
fatty acid desaturase 2 
CGI-85 protein 

ESTs, Weakly similar to A28996 proline-rich protein M14 precursor - mouse [M.musculus] 
Homo sapiens mRNA; cDNA DKFZp434B0425 (from clone DKFZp434B0425) 
ESTs, Weakly similar to ALU8.HUMAN ALU SUBFAMILY SX SEQUENCE 

H^sapienSimilar to RIKEN cDNA 1110012M11 gene, clone 1MAGE:3688605 ( mRNA, partial cds 1.2 

RP42homo)og , t . .... ' 

UDP-N-aretyl-alpha-D-galactosamine:poVpeptide N-acetylgabctosaminyltransferase 6 (GalNAc-T6) 13.4 

hypothetical protein aJ13346 \\ 
polymerase (RNA) 111 (DNA directed) polypeptide K (12.3 kDa) 
intermediate filament protein syncoiTr 

minichromosome maintenance deficient (S. cerevisiae) 4 ] ° 

CGl-76 protein ^ 

serologically defined colon cancer antigen 1 -J $ 

bromodomain and PHD finger containing, 3 ™ 

KIAA0887 protein 7 A 
hypothetical protein 

gb zt29f02 si Scares ovary tumor NbHOT Homo sapiens cDNA clone IMAGE723771 3', mRNA sequence. 

ESTs 2.0 

ESTs zy 
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114965 
115023 
115038 
115061 
115062 
115117 
115121 
115206 
115221 
115239 
115242 
115278 
115285 
115291 
115400 
115468 
115471 
115479 
115496 
115500 
115553 
115581 
115587 
115590 
115646 
115652 
115655 
115663 
115676 
115690 
115693 
115715 
115734 
115811 
115823 
115837 
115844 
115866 
115875 
115941 
115968 
116003 
116011 
116108 
116134 
116189 
116195 
116238 
116246 
116262 
116298 
116318 
116325 
116336 
116339 
116350 
116358 
116365 
116368 
116417 
116436 
116462 
116470 
116470 
116575 
116637 
116640 



AI733881 Hs 72472 BMP-R1B 

AF102546 Hs 63931 dachshund (Drosophila) homolog 

AA252360 Hs 87968 toll-like receptor 9 

AI751438 Hs 41271 Homo sapiens mRNA full length insert cDNA clone EUROIMAGE 1913076 

AA253314 Hs 1 54 1 03 LIM protein {similar to rat protein kina 

A1670847 Hs.5324 hypothetical protein 

A1634549 Hs 88155 ESTs 

AW183695 Hs 186572 ESTs 

AW365434 Hs 79741 hypothetical protein FU101 16 

BE251328 Hs 73291 hypothetical protein FU10881 

A1368236 Hs.283732 ESTs, Moderately similar to ALU1 JiUMAN ALU SUBFAMILY J SEQUENCE 

AK002163 Hs 301724 hypothetical protein FLJ 11 301 

AW972872 Hs.293736 ESTs 

BE545072 Hs 122579 hypothetical protein FLJ10461 

AI215069 Hs.89113 ESTs 

AA314349 Hs.48499 tumor antigen SLP^8p 

AK0O1376 Hs.59346 hypothetical protein FLJ 1051 4 

AW301608 Hs.278188 ESTs, Moderately similar to 1 54 374 gene NF2 protein [H. sapiens] 

AW247593 Hs.71819 eukaryotic translation initiation factor 4E binding protein 1 

Y14443 Hs.88219 zinc finger protein 200 

AJ275986 Hs.71414 transcription factor (SMIF gene) 

A1540842 Hs.61082 ESTs 

BE081342 Hs.283037 HSPC039 protein 

AA399477 Hs.67B96 7-60 protein 

N361 10 Hs. 305971 solute carrier family 2 (facilitated glucose transporter), member 10 

BE093589 Hs.38178 hypothetical protein FLJ 23468 

AL048269 Hs.288544 Homo sapiens, clone MGC: 16063, mRNA, complete cds 

A1138785 Hs.40507 ESTs 

AA953006 Hs.88143 ESTs 

AA625132 Hs.44159 hypothetical protein FLJ21 615 

AF231023 Hs.55173 cadherin. EGF LAG seven-pass G-type receptor 3, flamingo (Drosophiia) homotog 

BE395161 Hs.1390 proteasome (prosome, macropain) subunit, beta type, 2 

A1950339 Hs.40782 ESTs 

NMJ)15434Hs.48604 DKFZP434B168 protein 

A1732742 Hs.87440 ESTs 

A1675217 Hs.42761 ESTs 

A1373062 Hs.332938 hypothetical protein MGC5370 

AW062629 Hs,52081 K1AA0867 protein 

N55669 Hs.333823 mitochondrial ribosomal protein L1 3 

AI867451 Hs 46679 hypothetical protein FU20739 

AB037753 Hs,62767 K1AA1332 protein 

BE275469 Hs 66493 Down syndrome critical region gene 5 

AL359053 Hs,57664 Homo sapiens mRNA full length insert cDNA clone EUROIMAGE 2005735 

AA770688 Hs, 28777 H2A histone family, member L 

BE243834 Hs 50441 CGI-04 protein 

N35719 Hs 44749 ESTs, Moderately similar to T00358 hypothetical protein KIAA0684 [H.sapiens] 

AW821113 Hs 72402 ESTs 

AV660717 Hs 47144 DKFZP586N081 9 protein 

AF265555 Hs 250646 baculoviial lAP repeat-containing 6 

A1936442 Hs 59838 hypothetical protein FLJ10808 

A1955411 Hs 94109 Homo sapiens cDNA FLJ13634 fis, clone PUCE1011133 

AF097645 Hs 58570 deleted In cancer 1; RNA helicase HDB/DICE1 

A1472106 Hs 49303 Homo sapiens cDNA FLJ11663 fis, clone HEMBA10CK631 

AL133033 Hs.4084 KIAA1 025 protein 

AK000290 Hs 44033 dipeptidy I peptidase 8 

AA497129 Hs 184771 nuclear factor l/C (CCAAT-binding transcription factor) 

Hs 381 25 interferon-induced protein 75, 52kD 

Hs 46765 ESTs 

Hs 71109 K1AA1229 protein 

AW499664 Hs 12484 Human clone 23826 mRNA sequence 

AA161411 Hs 58668 chromosome 21 open reading frame 57 

AF218313 Hs 236828 putative helicase RUVBL 

A12721 4 1 Hs 83484 SRY (sex determining region Y)-box 4 

A1272141 Hs 33484 SRY (sex determining region Y)-box 4 

AA3 12572 Hs 6241 phosphoinositide-3-kinase, regulatory subunit, polypeptide 1 (p85 alpha) 

AK0O1O43 Hs 92033 integrin-linked kinase-associated serine/threonine phosphatase 2C 

X89984 Hs.211563 B-cell CLLTlymphoma 7A 
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3.4 
2.9 
8,3 
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117247 N21032 

117276 N71183 

117284 AK001701 

117367 AI041793 

117368 AI878942 
117382 AF150275 
117412 N32536 
117557 AF1 23050 
117588 N34895 
117745 BE294925 



Hs.90336 
Hs.40173 
Hs,42645 
Hs.44532 
Hs 44648 
Hs 46680 



117933 Y10518 
117983 AL1 10246 
118078 N54321 
118301 AA453902 
118429 AA243332 Hs.74649 
118472 AL157545 Hs.42179 
118488 AJ277275 
11&509 N22617 
118528 AI949952 
118656 AI458020 



Hs.50102 
Hs.43228 
Hs.49397 



116700 A1BO0202 Hs.317559 hypothetical protein MGC 10765 
116705 AW074819 Hs.12313 hypothetical protein FLJ 14556 

116732 AW152225 Hs.165909 ESTs, Weakly similar to 138022 hypothetical protein [H.sapiens] 
116921 AW068115 Hs.821 bigtycan 
116926 H73608 Hs.290830 ESTs 
117034 U72209 Hs.180324 YY1 -associated factor 2 

117132 AI393666 Hs.42315 ptO-binding protein ^ ^ nklA 

gb:yx46f06.s1 Soares melanocyte 2NbHM Homo sapiens cDNA clone IMAGE :264803 3 , mRNA sequence. 
Hs 121806 Homo sapiens cDNA FLJ11971 fts, done HEMB81001208 4 c 

Hs.183779 Homo sapiens cDNA FLJ 10590 fis, cone NT2RP2004392, weakly similar to MNN4 PROTEIN 
Hs.42502 ESTs 

ATPase, H+ transporting, lysosomal (vacuolar proton pump), member J 
ESTs 

solute carrier family 16 (monocarboxylic acid transporters), member 6 
diubiquitin 
ESTs 

CGI-1 2 protein 
117754 AA121673 Hs 59757 zinc finger protein 281 
117879 N54706 Hs. 303025 chromosome 11 open reading frame 24 
117881 AF161470 Hs 260622 butyrate-induced transcript 1 
117904 BE540675 Hs.332938 hypothetical protein MGC5370 
117911 AL137379 Hs.47125 hypothetical protein FLJ 139 12 
Hs.1 16470 hypothetical protein FLJ20048 
Hs.47367 KIAA1785 protein 
Hs.47790 EST 
Hs.293264 ESTs 

cytochrome c oxidase subunit Vic 
bromodomain and PHD finger containing, 3 
rapa-2 (rapa gene) 

Homo sapiens cDNA FLJ11835 fis, done HEMBA1005595 
ESTs 

Hs.293287 ESTs . t 

118670 AA332845 Hs.152618 ESTs, Moderately similar to ZN91 .HUMAN ZINC FINGER PROTEIN 91 [H.sapiens] 

118698 AB033113 Hs,50187 KIAA1 287 protein 

118737 AA199686 gb zq75g09.ri Stratagene hNT neuron (937233) Homo sapiens cDNA cone IMAGE:b4/4oo b 

113925 N92293 Hs 206832 ESTs, Moderately similar to ALU8_HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 1,4 
118984 A1668709 Hs 240722 ESTs, Moderately similar to ALU8.HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 3 6 
118986 AF148713 Hs 125830 bladder cancer overexpressed protein 
119206 W24781 Hs .293798 KIAA17 10 protein 
119235 AW453069 Hs,3657 activity-dependent neuroprotective protein 
119235 AW453069 Hs.3657 activity-dependent neuroprotective p rote 
119265 BE539706 Hs.285363 ESTs 
119279 N57568 Hs,48028 EST 
119298 NM_001241Hs, 155478 cycHn T2 

1 19338 AI417240 Hs.320836 ESTs, Weakly similar to A47582 B^cel! growth factor precursor [H.sapiens] 
119349 T65004 Hs.163561 ESTs 
119403 AL117554 Hs, 1 1 9908 nucleolar protein NO P5/NOP5 8 
119478 AI624342 Hs.170042 ESTs 
119486 A1796730 Hs.55513 ESTs 

1 19513 W37933 Empirically selected from AFFX single probeset 

119601 AK000155 Hs,91684 Homo sapiens mRNA; cDNA DKFZp667l103 [from clone DKFZp667l103) 

119602 AW675298 Hs.233694 hypothetical protein FLJ1 1350 
119676 AA243837 Hs, 57787 ESTs 
119682 W61019 Hs 57811 ESTs 
119774 AB032977 Hs6298 K1AA1 151 protein 
119780 NM_016625Hs 191381 hypothetical protein 

119789 BE393948 Hs 50915 kallikrein 5 (KLK5; KLK-L2; stratum comeum tryptic enzyme) 

119805 AJ223810 Hs 43213 ESTs, Weakly similar to IEFS_HUMAN TRANSFORMATION-SENSITIVE PROTEIN IEF SSP 
119818 AA130970 Hs 58382 hypothetical protein HJ1 1101 
119863 AA081218 Hs 58606 Homo sapiens cDNA FLJ 14206 fis, clone NT2RP3003 157 
119905 AW449064 Hs 119571 collagen, type 111, alpha 1 (Ehlers-Danlos syndrome type IV, autosomal dominant) 
119966 AA703129 Hs.58963 ESTs 

120132 W 57554 Hs 125019 lymphoid nuclear protein (LAF-4) mRNA 
120206 H26735 Hs.91668 Homo sapiens clone PP1498 unknown mRNA 
120248 AI924294 Hs.173259 uncharacterized bone marrow protein BM033 
120253 AA131376 Hs 326401 fibroblast growth factor 12B 
120269 AW131940 Hs.104030 ESTs 
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120274 AA1 77051 gb:nc02a02.s1 NCI_CGAP_Pr3 Homo sapiens cDNA clone IMAGE:194 similar to contains Alu 4.6 

120280 AA190577 gb:zp52g02.s1 Stratagene HeLa cell s3 937216 Homo sapiens cDNA clone 3\ mRNA sequence 2.0 

120296 AW995911 Hs.299883 hypothetical protein FLJ23399 1 8 

120297 AA191384 Hs 104072 ESTs, Weakly similar to Z195JHUMAN ZINC FINGER PROTEIN 195 [H.sapiens] 15.2 
5 120324 AA195517 Hs 191643 ESTs 5 5 

120325 AA195651 Hs 104106 ESTs 64 

120327 AK000292 Hs 278732 hypothetical protein FL J 20285 1 6.1 

120336 N 85785 Hs 181165 eukaryotic translation elongation factor 1 alpha 1 2 9 

120342 AW450669 Hs.45068 hypothetical protein DKFZp434 1 143 5 7 

10 120345 AA210722 Hs. 104158 ESTs 4 5 

120349 AW969481 Hs 55189 hypothetical protein 16 8 

120352 R06859 Hs 193172 ESTs, Weakly similar to I38022 hypothetical protein [H .sapiens] 5 0 

120356 AF000545 Hs 296433 putative purinergic receptor 28.1 

120371 AA219305 Hs 104196 EST 124 

15 120382 AA228026 Hs.38774 ESTs 4 0 

120383 AL109963 Hs 123122 FSH primary response (LRPR1, rat) homolog 1 9 7 

1203B6 AW969665 Hs.154848 hypothetical protein DKFZp434 D0 127 32.6 

120388 AA232874 Hs 104245 ESTs 31 

120389 AW967985 Hs325572 ESTs, Moderately similar to ALU7_HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 21 7 
20 120396 AA134006 Hs 79306 eukaryotic translation initiation factor 4E 12 5 

120404 AB023230 Hs 96427 K1AA1 013 protein 7 2 

120418 AW966893 Hs 26613 Homo sapiens mRNA; cDN A DKFZp586F 1323 (from done DKFZp586F 1323) 11 4 

120423 AA236453 Hs 18978 Homo sapiens cDN A: FLJ22822 fis, clone KAIA3968 1 9 

120472 A1950087 gb:wq05c02.x1 NCLCGAP_Kid12 Homo sapiens cDNA clone 3', mRNA sequence 19 4 

25 120473 AA251973 Hs 269988 ESTs 5 4 

120484 AA253170 Hs 96473 EST 10 4 

120504 AA256837 gb:zr84d10.s1 Soares„NhHMPu_S1 Homo sapiens cDNA clone IMAGE:682387 3', mRNA sequence. 3.9 

120509 BE047718 Hs 96545 ESTs 9- 4 

120520 AA258601 Hs 161731 EST 24 

30 120535 BE350244 Hs 96547 ESTs 2 5 

120551 AA279160 Hs 111407 Homo sapiens, clone IMAGE:3613029 ( mRNA, partial cds 5.2 

120570 AA280679 Hs 271445 ESTs, Weakly similar to ALU1JHUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 14.4 

120582 BE244830 Hs 284228 ZNF135 like protein 10 2 

120590 AW372799 Hs 125790 leucine-rich repeat-containing 2 21 

35 120596 AA282074 Hs 237323 N-acetylglucosamine-phosphate mutase 7 5 

120619 AW965339 Hs 111471 ESTs 25 

120624 AW407987 Hs 173518 M-phase phosphoprotein homolog 52.0 

120639 AA286942 gb:zs56105.s1 NCI_CGAP_GCB1 Homo sapiens cDNA done t MAGE :70 1505 3' similar to contains A!u2.4 

120648 AA287095 Hs. 140309 Homo sapiens, clone IMAGE:3677194, mRNA, partial cds 5 0 

40 120653 AW063659 Hs 191649 ESTs 2 2 

120668 AW969638 Hs 112318 6.2 kd protein 2 2 

120669 BE536739 Hs 109909 ESTs 19 

120695 AA976503 gb:oq30a04.s1 NCI_CGAP_GC4 Homo sapiens cDNA clone 3' similar to contains PTR7.t1 PTR7 46.8 

120696 AI821539 Hs 97249 ESTs 2.5 
45 120713 AW449855 Hs 96557 Homo sapiens cDNA FL J 12727 fis, clone NT2RP2000027 5 9 

120718 AA292747 Hs 97296 ESTs 2 9 

120750 A1191410 Hs 96693 ESTs, Moderately similar to 21 09260AB cell growth factor [H. sapiens] 7 0 

120774 AI608909 Hs 193985 ESTs 7 8 

120807 AA346385 Hs 30002 SH3-containing protein SH3GLB2; K1AA1848 protein 6 8 

50 120809 AA346495 gb:EST52657 Fetal heart ll Homo sapiens cDNA 3' end similar to EST containing O family repeat, 4 4 

120938 AA386260 Hs 104632 EST 4 4 

120977 AA398155 Hs 97600 ESTs 4 4 

120984 BE262951 Hs 99052 ESTs 5 6 

120985 A1219896 Hs 97592 ESTs 12 
55 121011 AA398360 Hs 97608 EST 31 

121026 AI439713 Hs 165295 ESTs 3 5 

121081 AA398721 Hs 186749 ESTs, Highly si mi la r to 1 37 550 mismatch re pair protein MSH 2 [H. sapiens] 5 4 

121133 AA363307 Hs 97032 ESTs 3 7 

121176 AL1 21523 Hs 97774 ESTs 17 

60 121223 AI0021 10 Hs 97169 ESTs, Weakly similar to dJ667H1 2.2.1 [H.sapiens] 29 

121320 AA403008 Hs 301927 c6.1A 19 

121340 AW956981 Hs 97910 Homo sapiens cDNA FLJ 13383 fis, clone PLACE 1001024 3 5 

121408 AA406137 Hs 98019 EST 6 0 

121439 AA410190 Hs 98076 ESTs, Weakly similar to A 47582 B-cell growth factor precursor [H.sapiens] 7.4 

65 121450 AA406430 Hs.105362 Homo sapiens, clone MGC: 18257, mRNA, complete cds 6.9 

121452 AW971063 Hs.292882 ESTs 1.8 

121455 H58306 Hs.15165 retinoic acid induced 14 10,5 
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3 4 

121457 W07404 Hs.144502 hypothetical protern FU22055 . 

121496 AA442224 Hs.9790Q ESTs l3 ' 

121505 AA494172 Hs.194417 ESTs 2B0 

121508 AA402515 Hs.97887 ESTs 62 

121514 MUM HS ' 18151 ° gb:zt69b02.s1 SoaresJestis_NHT Homo sapiens cDNA cone IMAGE:727563 3 1 , mRNA sequence.^2.5 

K AA412497 HS ' 98142 gb:It95g12.s1 SoaresJes*s_NHT Ho™ sapiens cDNA clone IMAGE:730150 3' simDar to contains^ 

121577 AA411970 Hs.98096 EST nK|fl ' 

121581 AA416568 gb;zu05d0.s1 SoaresJestis_NHT Homo sapiens cDNA clone 3 , mRNA sequence b.i 

121589 AD001528 Hs.89713 spermine synthase ' 

121594 AA626010 Hs.98247 ESTs * 

121622 AA416931 Hs.126065 ESTs „ Ar ><^ 7 a 
121655 AA421537 Hs.178072 Homo sapiens mRNA; cONA DKFZp434B1023 (from clone DKFZp434B1 023) 

121682 AM18160 Hs.86043 Homo sapiens cDNA FU 13558 fis, clone PLACE 1007743 ™ 

121690 AV660305 Hs.1 10286 ESTs 127 

121706 U55184 Hs.154145 hypothetical protein FLJ11585 ' 

121714 AM19225 Hs.98269 Homo sapiens cDNA FLJ 11 953 fe, clone HEMBB 1000883 ™ 

121729 A1949597 Hs.98325 ESTs * 

121731 AA421041 Hs.180744 ESTs ' 

121744 AA398784 Hs.97514 ESTs 1Q5 

121748 BE536911 Hs.234545 hypothetical protein NUF2R g 

121773 AB033022 Hs.158654 KIAA1 196 protein _ 

121775 AA421773 Hs.161008 ESTs 66 

121776 AA292579 Hs 125133 hypothetical protein FLJ22501 ° Q 
121786 AI810774 Hs 98376 ESTs 58 
121832 AW340797 Hs 98434 ESTs 3B 
121836 AA328348 Hs.218289 ESTs . , 0 
121839 AA425691 Hs.191606 ESTs, Highly similar to K1AA1048 protein [H.sapiens] ™ 
121842 AF0274O6 Hs.104865 serine/threonine kinase 23 3 
121847 AA446628 Hs.2799 cartilage linking protein 1 

121871 AW972668 Hs.293044 ESTs 50 

121882 AA426376 Hs.98459 ESTs , 7 _. /QQ ' 
121911 AA427950 gb.zw50f02.s1 Soares_toteLfetus_Nb2HF8_9w Homo sapiens cDNA clone IMAGE773499 3 12 

121915 AA428179 Hs.223405 ESTs, Moderately similar to A46010 X-linked retinopathy protein [Rsapiens] £5 

121935 AA428647 Hs.98611 EST " 4 

121983 AA298760 Hs.180191 hypothetical protein FL J 14904 ■ 

121985 AI862570 Hs.299214 Homo sapiens, clone IMAGE:2822295, mRNA, partial cds ^ ^ 

121995 AA210863 Hs.3532 nemo-like kinase ■ 

121999 AA430211 Hs.98668 EST ?o 

122009 AW292763 Hs.160822 Homo sapiens cDNA: FU20863 fis, clone ADKA01804 ^ 

mm W2W Hs 9 27i°4 ESTs, Weakly similar to ALU5_HUMAN ALU SUBFAMILY SC SEQUENCE CONTAMINATION 13.1 

122050 AI453076 Hs.166109 ELAV (embryonic lethal, abnormal vision, Drosophila)-like 2 9- 1 

122060 AA431738 Hs.98750 EST ' 
122114 AW161023 Hs.104921 ESTs , , nt 

122188 AA398838 gb:zt80d01 .rl SoaresJestis.NHT Homo sapiens cDNA clone 5 , mRNA sequence J.J 

122204 AA435936 Hs.98842 EST ' 

122246 AA329550 Hs.29417 HCF-binding transcription factor Zhangfei ^ 

122257 AA436819 Hs.98899 ESTs • 

122302 AA441801 Hs.104947 ESTs n 

122341 AW601969 Hs.99010 hypothetical protein FLJ22263 similar to nuclear receptor-bind tng SET-domam protein 1 £U 

122356 AA443794 Hs.98390 ESTs " 

122369 AA443985 Hs.303222 ESTs ' 

122371 AA868555 Hs.178222 ESTs ' 

122372 AA446008 Hs.336677 EST '■ 
122378 AB032948 Hs.21356 hypothetical protein DKFZp762K20 15 " 
122405 AA446572 Hs.303223 EST l ' 
122412 AA446869 Hs.119316 ESTs 

122415 AA446918 Hs.99088 EST . J 

122418 AA446966 Hs.99090 ESTs, Moderately simitar to similar to KIAA0766 [H.sapiens] ' 
122440 AW505139 Hs.9460 Homo sapiens mRNA; cDNA DKFZp547C244 (from clone DKFZp547C244) 

122446 AA447603 Hs.99123 EST V* 

122448 AA447626 Hs 99127 EST ™ 

122458 A1266159 Hs 104980 ESTs ^ ^ u#fv , r „ , , ' 

122460 AW418788 Hs 99148 ESTs, Weakly similar to S43569 R01H10.6 protein - Caenomabdits elegans [Celegans] 9.7 
122464 AA448158 Hs.99152 EST 
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122490 AA448349 Hs.238151 EST 61 

122492 AA448417 Hs.104990 ESTs 54 

122502 AA204969 Hs.234863 Homo sapiens cDN A FL J 12082 fis, clone HEMBB 1002492 1.3 

122510 AA449232 Hs.99195 ESTs 112 

122530 AW959741 Hs.40368 adaptor-related protein complex 1, sigma 2 subunit 10.1 

122547 AA779725 Hs.164589 ESTs 2.5 - 

122555 AA194055 Hs,293858 ESTs 19 

122570 AA452578 Hs.262907 ESTs 9 5 

122572 AA452501 Hs.99287 EST ^° 

122586 AK001910 Hs.99303 Homo sapiens cDNA FLJ1 1048 fis, clone PLACE 1004516 3.4 

122587 AB040893 Hs.6968 KIAA1460 protein 20 

122598 A1028173 Hs.99329 ESTs 17 

122599 AL355841 Hs.99330 hypothetical protein FLJ23588 4.4 
122602 AA411925 Hs.301960 ESTs 46 
122607 AA453518 Hs.98023 ESTs 61 - 5 
122614 AA453630 Hs.99339 EST 1 <>- 7 

122616 AA453638 Hs.161873 ESTs 107 3 

122617 AI681535 Hs.148135 serine/threonine kinase 33 1214 

122618 AA453641 gb:zx48e06.s1 Scares Jestis_NHT Homo sapiens cDNA clone 3', mRNA sequence 31.1 
122622 AA453987 Hs. 144802 ESTs 5 6 
122717 AA456859 Hs.178358 ESTs 8 5 
122762 A1376875 Hs.105119 ESTs 10 4 
122829 AW204530 Hs.99500 ESTs 81 - 8 
122834 AA461492 Hs.99545 Homo sapiens cDN A FL J 10658 fis, clone NTT2RP2006052 3.6 

122836 AA460581 Hs.290996 ESTs 4 5 

122837 AA461509 Hs.293565 ESTs, Weakly similar to putative p 150 [H. sapiens] 2 7 

122838 AA460584 Hs.334386 ESTs 75 3 
122854 AA600235 Hs.9625 NIMA (never in mitosis gene a)-related kinase 6 7 7 
122856 A1929374 Hs.75367 Src-like-adapter 5.8 
122861 AA335721 Hs.1 19394 ESTs ] 3 
122866 BE539656 Hs.283705 ESTs 4 - 1 
122868 AF005216 Hs.115541 Janus kinase 2 (a protein tyrosine kinase) 5.3 
122870 AW576312 Hs 318722 Homo sapiens cDNA: FLJ21766 fis, clone COLF7179 9-9 
122872 AW081394 Hs,97103 ESTs 53 
122879 AA769410 Hs.12B654 ESTs 13 9 
122907 AA470074 Hs, 169896 ESTs ] 1 * 5 
122916 AA4 70140 Hs 229170 EST 17 
122981 AA478951 Hs.105629 ESTs 50 
123013 AW968324 Hs, 17384 ESTs l 5A 
123016 A W338067 Hs, 323231 Homo sapiens cDNA FU 11 946 fis, clone HEMBB 1000709 2 8 
123034 AL359571 Hs.44054 ninein (GSK3B interacting protein) 87 
123072 AI3826O0 Hs.104308 ESTs, Weakly similar to KIAA1395 protein [H.saptensJ 8 8 
123082 AA485360 Hs.1Q5661 ESTs 39 
123088 A1343652 Hs.105667 ESTs 38 
123110 AA486256 Hs.193510 EST 7 4 
123114 BE304942 Hs.265848 myomegalin 2 8 

123131 T52027 Hs.271795 ESTs, Weakly similar to 138022 hypothetical prote'm [H. sapiens] 2.4 

123132 AI061582 Hs .324 179 Homo sapiens cDNA FLJ12371 fis, clone MAMMA1 002434 15-6 
123136 AW451999 Hs 194024 ESTs 51 
123149 AI734179 Hs 105676 ESTs 23 „ 8 
123152 AW601773 Hs 270259 ESTs 5 2 

123258 AA490929 Hs,105274 ESTs, Weakly similar to RMS1JHUMAN REGULATOR OF MITOTIC SPINDLE ASSEMBLY 1 [H.sapiens] 9.3 

123315 AA496369 gb:zv37d10.s1 Soares ovary tumor NbHOT Homo sapiens cDNA clone IMAGE:755827 3* similar to4.1 

123369 AA504757 Hs.105738 ESTs 6 9 

123394 AA731404 Hs,105510 ESTs 3 6 

123433 AW450922 Hs,112478 ESTs 37 

123466 AA599042 Hs.112503 EST 7 4 

123470 AW303285 Hs.303632 Human DNA sequence from clone RP1 1-1 10H4 on chromosome 5 Contains a pseudogene similar to 3.5 

123471 AB021644 Hs. 197 21 9 zinc finger protein 14 (KOX 6) 5 2 
123475 BE439553 Hs. 2 5052 8 Homo sapiens, clone IMAGE :4098694, mRNA, partial cds 17 
123482 N95059 Hs.55098 ESTs 16 
123486 BE019072 Hs.334802 Homo sapiens cDNA FLJ14680 Ms, clone NT2RP2004242, weakly similar to 2.4 
123508 AW380388 Hs 155546 K1AA1080 protein; Golgi- associated, gamma-adaptin ear containing, ARF-b'mding protein 2 2.2 
1 23615 AA6O9170 gb:af12a12.s1 SoaresJestisJW Homo sapiens cDNA clone 3', mRNA sequence 7.8 
123619 AA602964 gb:no97c02.s1 NCLCGAP_Pr2 Homo sap'ens cDNA clone, mRNA sequence 2 8 

123658 AA609364 gb:zu71d09.s1 SoaresJestis_NHT Homo sapiens cDNA done IMAGE743441 3' similar to contains Alu. 1.7 

123674 AI269609 Hs. 1051 87 ktnesin protein 9 gene 5 - 7 
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123735 
123738 
123753 
123804 
123811 
123951 
123983 
124001 
124006 
124070 
124074 
124178 
124203 
124352 
124375 
124385 
124390 
124391 
124417 
124428 
124440 
124466 
124482 
124498 
124515 
124608 
124631 
124634 
124637 
124642 
124649 
124656 
124661 
124683 
124712 
124735 
124761 
124768 
124775 
124777 
124788 
124809 
124811 
124812 
124822 
124825 
124833 
124857 
124860 
124863 
124876 
124878 
124902 
124903 
124930 
124942 
124958 
124980 
125002 
125047 
125051 
125056 
125101 
125113 
125115 
125125 
125147 



NM_013241Hs.95231 FH1/FH2 domain-containing protein 
AA609891 Hs. 112777 EST 
AA609955 Hs.234961 Huntingtin interacting protein E 

AA620464 Hs.261915 EST, Weakly similar to S65557 alpha-IOadrenergic receptor splice form 2 I H - sa P !e i; s l 
AA620586 gb:ae60g05st Stratagene lung carcinoma 937218 Homo sapiens cDNA clone !MAGE:9513203 

AB012922 Hs, 173043 metastasis-associated Mike 1 
AJ272267 Hs 146178 choline dehydrogenase 
ralA binding protein 1 
ESTs 

H1V-1 rev binding protein 2 
topoisomerase-related function protein 4-2 
putative G protein-coupled receptor 
ESTs, Weakly similar to AF161356 1 HSPC093 [Ksapiens] 
ESTs 

K1AA0265 protein 

gb:aq49a10.x1 Stanley Frontal NB pool 2 Homo sapiens cONA clone similar to contains 
COBW-like protein 
NY-REN- 18 antigen 



Hs 75447 
Hs 270016 
Hs 154762 
Hs294030 
Hs 97101 
Hs ,269339 
Hs.102406 
Hs 192966 



L42542 
AI147155 
A1950314 
H05635 
BE463721 
AA372796 
AA640891 
D87454 
AI267847 

AA317338 Hs,7535 
AF155099 Hs 279780 
N34059 

H13540 Hs.82202 
AA532519 Hs.129C43 



gb:yv28h09.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone 3' similar to contains Aw 



10,0 

5.2 

30.6 

2.1 

27 

6.2 

4,4 

7.0 

8,1 

37 

1.2 

3.1 

57 

31 

3.5 

57.1 

2,8 

7,1 



3.3 



2.9 



R10084 
N53935 
H79433 
AA669097 
N71076 



Hs.113319 

Hs.268997 
Hs.109370 
Hs.102800 



NIvL014053Hs.270594 
A1765123 Hs. 143671 
AA160474 Hs.75798 
AW96B856 Hs.278569 
N92593 Hs.313054 
AW297702 Hs.102915 
R48170 Hs.78436 
AA381661 Hs.119878 
R09166 Hs.191148 
R22952 Hs.268685 
AA374756 Hs.93560 
AW36852B Hs.100855 
R41772 Hs.100878 
R41933 Hs.140237 
R43543 Hs.100912 



AL355722 
R46068 
R47948 



Hs.106875 
Hs^88912 
Hs.188732 



AA418160 Hs.86043 
AA501669 Hs.336693 
AW975868 Hs.294100 
R63652 Hs.137190 
R65763 Hs.101477 
A1382555 Hs.1 27950 
AF135422 Hs-27059 
BE397530 Hs.288057 
H37941 Hs.101883 
AW296713 Hs.221441 
A1076343 Hs.1 73939 



R99978 

A1078645 

T40841 

T59338 

T79815 

T79956 

T81310 

A1472068 

T96595 

T97341 

AI222382 

W38150 



Hsi68892 
Hs.431 
Hs.98681 
Hs.269463 
Hs.279793 
Hs.100588 
Hs.1 00592 
Hs.286236 
Hs.302270 

Hs.240767 



ribosomal protein L17 

Human DNA sequence from clone 989H11 on chromosome 22q 13.1-13.2. Contains part at a 
kinesin heavy chain member 2 

gb:yv59d09.s1 Soares fetal liver spleen 1 NFLS Homo sapiens cDNA clone 3\ mRNA sequence 
ESTs 
ESTs 

ESTs, Weakly similar to neuronal thread protein AD7c-NTP [H.sapiensl 
FLVCR protein 

Homo sapiens cDNA RJ13533 fis, clone PLACE1006371 
hypothetical protein 
sorting nexin 17 
ESTs 
ESTs 

ESTs! Weakly similar to M3K9JHUMAN MITOGEN ACTIVATED PROTEIN KINASE KINASE 
ESTs 
ESTs 

Homo sapiens mRNA for K1AA1771 protein, partial cds 
ESTs 
ESTs 

ESTs, Weakly similar to ALU1JHUMAN ALU SUBFAMILY J SEQUENCE 
Homo sapiens cDNA: FLJ22726 fe, clone HSI15005 
Homo sapiens EST from clone 35214, full insert 
hypothetical protein FLJ22604 
ESTs 

Homo sapiens cDNA FLJ 13558 fis, clone PLACE 1007743 
ESTs 
ESTs 
ESTs 
EST 

bromodomain-containing 1 
GDP-mannose pyrophosphorylase A 
hypothetical protein FLJ22242 
ESTs 
ESTs 

ESTs, Weakly similar to ALUB.HUMAN !I!! ALU CLASS B WARNING ENTHY I!! [H.saplens] 
ESTs, Moderately simiar to B34087 hypothetical protein [H.sapiens] 
murine leukemia viral (bmi-1) oncogene homolog 
ESTs 

ESTs, Weakly similar to ALU1JHUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 
ESTs 
EST 
ESTs 

K1AA1B56 protein 

ESTs, Weakly similar to ALUF.HUMAN !!!! ALU CLASS F WARNING ENTRY !!! [H.sapiens] 
gb:ye57e05.s1 Soares fetal liver spleen 1 NFLS Homo sapiens cONA clone IMAGE: 121 856 3 1 similar to 
Human DNA sequence from clone RP1-12G14 on chromosome 6q24.1-25.2. Contains the 5' end of the gene 
Empirically selected from AFFX single probeset 17 
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2.6 
7.9 
7.8 
3.3 
4.5 
3,2 
5.8 
9.3 
3.5 
6.1 
83 
56 
7 9 
57 
11.3 
9.0 
8.1 
4.9 
2.8 
5.1 
4.2 
14.2 
7.9 
6.6 
2.3 
2.7 
2.3 
23.9 
2.0 
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57 
32.4 
22.8 
61 
19 
45 
4.9 
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125161 W44657 Hs.U4232 EST 10-7 

125249 AA630863 Hs,131375 ESTs, Moderately similar to ALUBXUMAN )!!! ALU CU\SS B WARNING ENTRY !!! [H.sapiens] 1.3 

125255 AF098162 Hs.1 18631 timeless (Drosophfla) homolog 9-4 

125279 AW401809 Hs.4779 K1AA1 150 protein 1-5 

125280 AI123705 Hs.106932 ESTs 8.0 
125298 AW972542 Hs.289008 Homo sapiens cDNA: FLJ21814 fis, clone HEP01068 1-5 
125660 AW292171 Hs.23978 scaffold attachment factor B 5.9 
125827 NMJ03403Hs 97496 YY1 transcription factor 1-2 
125891 U29589 Hs 7138 cholinergic receptor, muscarinic 3 6.4 
126005 AW409701 Hs 1578 baculoviral lAP repeat-containing 5 (survivin) 14.3 
126202 AA157632 Hs 272630 vacuolar proton pump delta polypeptide 2.4 
126695 AA643322 Hs 172028 a disintegrin and rnetalloproteinase domain 10 9.1 
127050 AW411066 Hs 274351 CGI -89 protein 17 0 
127274 AW966158 Hs 58582 Homo sapiens cDNA FLJ12789 fis, clone NT2RP2001947 12.8 
128355 AW293012 Hs 161623 ESTs 7.3 
128493 D87466 Hs 240112 KIAA0276 protein 3.1 
128493 D87466 Hs 240112 KIAA0276 protein 1-3 
128522 BE173977 Hs 10098 putative nucleolar RNA helicase 9-4 
126527 AA504583 Hs 101047 transcription factor 3 (E2A immunoglobulin enhancer binding factors E12/E47) 1.5 
128528 R39234 Hs 251699 ESTs, Weakly similar to IDN4-GGTR1 4 [H.sapiens] 2.8 
128595 U31875 Hs 272499 short-chain alcohol dehydrogenase family member 12.1 
128599 NM_015366Hs 102336 Rho GTPase activating protein 8 2.3 
128604 A I879099 Hs 102397 GIOT-3 for gonadotropin inducible transcription repressor-3 13 
128608 BE267994 Hs 102419 zinc finger protein 71 
128625 AB037841 Hs 102652 hypothetical protein ASH 1 13 
128629 AL096748 Hs 102708 DKFZP 4 34 A043 protein 32 
128639 AW582962 Hs 102897 CGI-47 protein 2.0 
128656 AA458542 Hs 10326 coatomer protein complex, subunit epsiton 14 
128656 AA458542 Hs 10326 coatomer protein complex, subunit epsito 1.3 
128658 BE397354 Hs 324830 dtptheria toxin resistance protein required for diphthamide biosynthesis (Saccharomyces)-fike 2 2 4 
128670 AA975486 Hs 103441 Homo sapiens, Similar to RIKEN cDNA 1700010L19 gene, clone MGC:16214, mRNA, complete cds 7.1 
128691 W27939 Hs 103834 hypothetical protein MGC5576 7.7 
128696 BE081143 Hs 225977 nuclear receptor coacth/ator 3 3 8 
128700 Y 15221 Hs 103982 small inducible cytokine subfamily B (Cys-X-Cys), member 11 16 
128714 T85231 Hs 179661 tubulin, beta 5 7.6 
128717 AK001564 Hs 104222 hypothetical protein FLJ10702 55 
128733 BE147740 Hs 104558 ESTs, Moderately similar to I38022 hypothetical protein [H.sapiens] 2 7 
128737 AF292100 Hs 104613 RP42homolog 2.8 
128742 AA307211 Hs 251531 proteasome (prosome, macropaln) subunit, alpha type, 4 4 4 

128746 AI470163 Hs 323342 acfin related protein 2/3 complex, subunit 4 (20 kD) 2 2 

128747 AB027249 Hs 104741 PDZ -binding kinase; T-cell originated protein kinase 2 8 
128772 BE302796 Hs 105097 thymidine kinase 1, soluble 5 3 
128781 N71 826 Hs 105465 small nuclear ribonucleoprotein polypeptide F 53.9 
128797 NM_002975Hs 105927 stem cell growth factor, lymphocyte secreted C type lectin 13.3 
128806 AW630942 Hs 106061 RD RNA-bfnding protein 2 6 
128814 AW248431 Hs 256526 nuclear prelam in A recognition factor 22 
128830 BE281170 Hs 106357 vatosin -containing protein 5 9 
128835 AK001731 Hs 106390 Homo sapiens mRNA; cDNA DKFZp586H0924 (from clone DKFZp586H0924) 1 6 
128854 BE159181 Hs 168232 hypothetical protein FLJ13855 2 2 
128854 BE159181 Hs 168232 hypothetical protein FU13855 1 9 
128868 AA419008 Hs 106730 chromosome 22 open reading frame 3 3 0 
128868 AA419008 Hs 106730 chromosome 22 open reading frame 3 2 2 
128871 AF189723 Hs 106778 ATPase, Ca++ transporting, type 2C, member 1 15 
128891 F34856 Hs 292457 Homo sapiens, clone MGC: 16362, mRNA, complete cds 13.3 
128906 R57988 Hs 10706 epithelial protein lost in neoplasm beta 4 7 
128920 AA622037 Hs 166468 programmed cell death 5 14 
128925 R67419 Hs 21851 Homo sapiens cDNA FLJ12900 fis, clone NT2RP2004321 19 
128946 Y13153 Hs 107318 kynurenine 3 -rnonooxygenase {kynurenine 3 -hydroxylase) 7 2 
128949 AA009647 Hs 8850 a disintegrin and rnetalloproteinase domain 12 (meltrin alpha) (ADAM-12) 2 4 

128958 AW139032 Hs 107376 hypothetical protein DKFZp434N035 1.3 

128959 AI580127 Hs 107381 hypothetical protein FLJ 11 200 10.9 
128965 AW150697 Hs 107418 ESTs 1.4 
128970 AI375672 Hs 165028 ESTs 1.3 
128975 BE560779 Hs.284233 NIC E-5 protein 14.0 
128979 AW271217 Hs.281434 Homo sapiens cDNA FLJ 14028 f/s, clone HEMBA 1003838 1.6 
128995 AI816224 Hs.107747 DKFZP566C243 protein 19 
129019 AI950087 gb.-wq05c02.x1 NCLCGAP_Kid12 Homo sapiens cDNA clone 3", mRNA sequence 2.9 
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3 8 

129021 AL044675 Hs.173081 K1AA05 30 protein 25 

129021 AL044675 Hs.173081 KIAA053O protein 34 

129032 RB0088 Hs.108104 ubiquitin -conjugating enzyme E2L 3 n« 7n A-uM5n^ 1 fH <ani«isl 5 0 

129076 AW296806 Hs.326234 ESTs, Highly similar to T46422 hypothefccal prote;n DKFZp434M2023 1 [H.sapiensj a.u 

5 129078 AI351010 Hs.102267 lysosomal 17 A 

129088 AA744610 Hs.194431 pallad'm 2 7 

129095 L12350 Hs.108623 thrombospondin 2 20 9 

129096 AA463189 Hs.288906 WW Domain-Containing Gene 3Q 

129097 BE243933 Hs.108642 zinc finger protein 22 (KOX 15) Q 
10 129099 AF146074 Hs. 108660 ATP-bind;ng cassette, sub-family C (CFTR/MRP), member 5 ^ 

129136 W93048 Hs.250723 hypothetical protein MGC2747 5 3 

129149 AA356620 Hs.108947 KIAA0050 gene product 18 

129172 AW162916 Hs.241576 hypothetical protein PR02577 2 1 

129192 AA286914 Hs.183299 ESTs 32 

15 129194 AA150797 Hs.109276 latexin protein 53 

129198 N 57532 Hs.109315 KIAA14 15 protein 80 

129207 AI934365 Hs.109439 osteogen (osteoinductive factor, mimecan) 2g 

129228 U40714 Hs.239307 tyrosyl-tRN A synthetase 3 2 

129229 AF013758 Hs.109643 potyadenylate binding protein-interacting protein l ^ 
20 129254 AA252468 Hs.1098 DKF2p434J 1813 protein ? 3 

129255 AI961727 Hs.1 09804 H1 histone family, member X g6 

129288 W26392 Hs 1 10080 ESTs, Weakly simitar to S13495 pregnancy zone protein [H.sapiensj ^ 

SS H^osapienscONAFU11311f.clonePLACE1010102 5.1 

25 SK S» IKS 3E2^ B 

129362 U30246 Hs.1 10736 solute carrier famHy 12 (sodium/potash m/chlonde transporters), member 2 6.7 

129366 BE220806 Hs.184697 Homo sapiens clone 23785 mRNA sequence u 

129370 AI686379 Hs.1 10796 SAR1 protein 2 0 

30 129372 NMJ)16039Hs.1 10803 CGI-99 protein 7 4 

129403 AF149785 Hs.111126 pituitary tumor-transforming 1 interacting protein 5Q 

129404 AI267700 Hs.317584 ESTs 2^5 
129404 AI267700 Hs.317584 ESTs 102 
129423 M204686 Hs.234149 hypothetical protein FLJ20647 8Q 

35 129449 AI096988 Hs.1 1 1554 ADP-ribosylat'on factor-like 7 32 

129453 AW974265 Hs 111632 Lsm3 protein 5 7 

129482 M188185 Hs 289043 spindlin 36 

129482 AA188185 Hs 289043 spindlin 71 

129513 AW843633 Hs 306163 hypothetical protein AL1 101 15 2 5 

40 129515 AF255303 Hs.112227 memb rane- associated nucleic aad binding protein ^ 

129527 AA769221 Hs. 270847 detta-tubulin 75 

129559 W01296 Hs.1 1360 hypothetical protein FU 14784 6 8 

129560 AA317841 Hs.7845 hypothetical protein MGC2752 2 0 
129570 AI923097 HsT1441 chromosome 1 open reading frame 8 16 

45 129575 F08282 Hs.278428 progestin induced protein 6 g 

129587 H14718 Hs.11506 Human done 23589 mRNA sequence 14 

129588 BE408300 Hs 301 862 postmeiotic segregation increased 2- 1 ike 9 ?3 
129591 N57423 Hs.179898 HSPC055 protein 9 0 
179594 AW403724 Hs. 36989 coagulation factor VII (serum prothrombin conversion accelerator; ■ 

50 S AF035537 Hs.115521 REV3 (yeast bowleg)*, cata^c M^^^SSS^) 

129628 U38945 Hs.1174 cyclin dependent kinase inhibitor 2A (melanoma, p16, inhibits CDK4) ^ 

129628 U38945 Hs.1174 cydin-dependent kinase inhibitor 2A (me 3 Q 

129629 AK000398 Hs.1 1747 hypothetical protein FU20391 3 3 
129649 AD00O092 Hs.16488 calreticulin 13 4 

55 129675 NM 015556Hs.172180 K I AA0440 protein 

129680 U03~749 gb:Human chromogranin A (CHGA) gene, promoter an ^' 

1?9689 AW748482 Hs.77873 B7homdog3 , . ru , 7 4 

129702 AI304966 Hs.12035 ESTs, Weakly similar to I38022 hypothetical protein [H.sapiensj ^ 

129720 AA156214 Hs.12152 APMCF1 protein , 17 

60 129721 NM_001415Hs.211539 eukaryotic translation initiation factor 2, suountto flamm*. ;>zkj, ^ 

129726 H 15474 Hs.1 32898 fatty acid desatu rase 1 1Q 

129778 AK001676 Hs.12457 hypothetical protein FU 1081 4 54 

129779 AA394090 Hs.12460 Homo sapiens done 23870 mRNA sequence ^ 
129800 AF052112 Hs.12540 lysosomal 1 2 

65 129806 AB023148 Hs.173373 KIAA0931 protein 31 

129815 BE565817 Hs.26498 hypothetical protein FU 2 1657 ^ 
129840 NM_006590Hs. 12820 SnRNP assembly defective 1 homolog 
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129861 


AL049999 


Hs, 85963 


DKFZP564M182 protein 


2.2 


129864 


AI393237 


Hs.129914 


runt- related transcription factor 1 (acute myeloid leukemia 1; am 11 oncogene) 


17 


129869 


AI222069 


Hs.13015 


hypothetical protein similar to mouse Dnajll 


27 


129922 


AF042379 


Hs. 13386 


gamma-tubulin complex protein 2 


45 


129945 


BE514376 


Hs 165998 


PAI-1 mRNA- binding protein 


18 


129953 


AA412195 


Hs.13740 


ESTs 


25 


129972 


AW753185 


Hs. 180628 


dynamin 1-like 


18 


129983 


U09848 


Hs, 132390 


zinc finger protein 36 (KOX 18) 


13 


129989 


AB015856 


Hs .247433 


activating transcription factor 6 


40 


130010 


AA301116 


Hs.142838 


nucleolar phosphoprotein Nopp34 


16 


130081 


AA287325 


Hs.14713 


ESTs 


40 


130082 


S73265 


Hs.1473 


gastrin-releasing peptide 


18 


130097 


AL046962 


Hs14845 


forkhead box 03A 


28 


130100 


AL1 35561 


Hs.14891 


hypothetical protein FLJ21047 


23 


130111 


X53002 


Hs.149846 


integrin, beta 5 


23 


130112 


AA916785 


Hs 180610 


splicing factor proline/glutamine rich (polypyrimidine tract-binding protein -associated) 


30 


130112 


AA916785 


Hs.180610 


splicing factor proline/glutamine rich ( 


2.1 


130128 


L76937 


Hs. 150477 


Werner syndrome 


18 


130135 


AA311426 


Hs.21635 


tubulin, gamma 1 


61 


130211 


NM003358Hs.23703 


ESTs, Moderately similar to CEGT _HUMAN CERAMIDE GLUCOSYLTRANSFERASE [H.sapiens] 1 6 


130212 


D80001 


Hs.152629 


KIAA0179 protein 


1 7 


130236 


R85367 


Hs.51957 


splicing factor, arginine/serine-n'ch 2, interacting protein 


L U 


130241 


AL035588 


Hs 153203 


MyoD family inhibitor 


7 0 

j I 


130242 


X79201 


Hs 153221 


synovial sarcoma, translocated to X chromosome 


5 4 


130249 


D81983 


Hs322852 


GAS2-related on chromosome 22 


4 8 




NM 002497Hs 153704 


NIMA (never in mitosis gene a)-related kinase 2 


1.4 


130287 


AA479005 


Hs.154036 


tumor suppressing subtransferable candidate 3 


2.6 


I JUO IU 


AB011121 


Hs 154248 


amyotrophic lateral sclerosis 2 (juvenile) chromosome region, candidate 3 


6 3 




219084 


Hs 172210 


MUF1 protein 




130356 


AF127577 


Hs.155017 


nuclear receptor interacting protein 1 


2 4 


1 W157 


AJ224442 


Hs.155020 


putative methyltransferase 


34 


130359 


NM_013449Hs 277401 


bromodomain adjacent to zinc finger domain, 2A 


85 


130367 


AL135301 


HS8758 


hypothetical protein FLJ 10849 


14 


1 30372 


AI077464 


Hs5011 


RNA binding motif protein 9 


3.3 


130393 


N89487 


Hs.155291 


KIAA0005 gene product 


18 


130399 


AW374106 Hs155356 


hypothetical protein MGC2840 similar to a putative glucosyltransferase 


3.4 


130407 


BE385099 


Hs 334727 


hypothetical protein MGC3017 


23 


130409 


NM_001197Hs.155419 


BCL2-interacting killer (apoptosis-inducing) 


27 


130419 


AF037448 


Hs.155489 


NS1 -assxiated protein 1 


18 


130441 


U63630 


Hs 155637 


protein kinase, DNA-activated, catalytic polypeptide 


23 


130448 


BE513202 


Hs 15589 


PPAR binding protein 


39 


130455 


D90041 


Hs155956 


N-acetyltransferase 1 (arylamine Nicety (transferase) 


33.6 


130455 


D90041 


Hs.155956 


N-acetyltransferase 1 (arylamine N-acety 


46 


130471 


AL1 21438 


Hs.183706 


adducin 1 (alpha) 


27 


l JvtUvI 


BE245851 


Hs.180779 


H2B histone family, member B 


50 


130487 


U49844 


Hs 77613 


ataxia telangiectasia and Rad3 related 


43 


130498 


L38951 


Hs.180446 


karyopherin (importin) beta 1 


16 


130503 


BE208491 


Hs 295112 


KIAA0618 gene product 


16.1 


130511 


L32137 


Hs1584 


cartilage oligomeric matrix protein (pseudoachondroplasia, epiphyseal dysplasia 1, multiple) 


61 


130511 


L32137 


Hs1584 


cartilage oligomeric matrix protein (pse 


5.3 


130526 


AW876523 


Hs 15929 


hypothetical protein FLJ 12910 


2.1 


130542 


U64675 


Hs. 179825 


RAN binding protein 2-like 1 


78 


130544 


AA321238 


Hs 4310 


eukaryotic translation initiation factor 1A 


15 


130553 


AF062649 


Hs252587 


pituitary tumor-transforming 1 


14.4 


130556 


AI907018 


Hs 15977 


Empirically selected from AFFX single probeset 


47 


130567 


AA383092 


Hs.1608 


replication protein A3 (14kD) 


79 


130568 


AA232119 


Hs.16085 


putative G-protein coupled receptor 


33 


130574 


AF083208 


Hs.16178 


apo ptosis antagonizing transcription factor 


12 


130586 


AB007891 


Hs. 16349 


KIAA0431 protein 


56 


130598 


AL042210 


Hs 16493 


hypothetical protein DKF2p762N2316; KIAA1803 protein 


14 


130601 


M609738 


Hs 16525 


ESTs 


15 


130614 


AI354355 


Hs.16697 


down-regulator of transcription 1 , TBP -binding (negative cofactor 2) 


13 


130617 


M90516 


Hs1674 


glutamine-fructose-6-phosphate transaminase 1 


12.1 


130617 


M90516 


Hs 1674 


glutamir>e-fajctose-6-phosphate transamin 


2.4 


130618 


AA383439 


Hs 16758 


Spir-1 protein 


15.9 


130667 


BE246961 


Hs17639 


Homo sapiens ubiquitin protein ligase (UBE3B) mRNA, partial cds 


13.9 


130674 


AL048842 


Hs 194019 


attractin 


1.5 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



130693 R68537 

130712 AJ271881 

130714 AI348274 

130730 AB007920 Hs.18586 

130744 H59696 Hs.18747 

130751 AF052105 Hs.18879 

130757 AL036057 Hs. 18925 



130675 AA442233 Hs.17731 hypothetical protein FLJ 12892 
130692 AA652501 Hs.13561 hypothetical protein MGC4692 
Hs.17962 ESTs 

Hs.279762 bromodomain-containing7 

Hs.18212 DNA segment on chromosome X (unique) 9879 expressed sequence 
KIAA0451 gene product 

POP7 (processing of precursor, S. cerevisiae) homdog 
chromosome 12 open reading frame 
protein x 0001 

1X768 AF258627 Hs 211562 ATP-binding cassette, sub-family A (ABC1), member 1 
130789 AK000355 Hs 8899 sirtuin (silent mating type jnlbfrnation regulation 2, S.oerevisiae. homotog) 5 
130815 AB018298 Hs 19822 S EC 24 (S. cerevisiae) related gene family, member D 
130836 J05068 Hs 2012 transcobalamin I (vitamin B12 binding protein, R binder family) 
1X841 AL157468 Hs 325825 Homo sapiens cDNA FLJ2084Bfc, clone ADKA01732 

130843 AA447492 Hs20183 ESTs, Weakly similar to AF164793 1 protein x 013 [H .sapiens] 

130844 U76248 Hs 20191 seven in absentia (Drosophila) homotog 2 
130855 AJ243706 Hs 143323 putative DNA/chromatin binding motif 
130861 NM_016578Hs.20509 HBV pX associated protein-8 

130879 NM„003416Hs.2076 zinc finger protein 7 (KOX 4, cioneHF.16) 

130880 BE514434 Hs,20830 kinesin-tike 2 

130892 AL120837 Hs.20993 high-glucose-regulated protein 8 
1X898 AB033078 Hs.186613 sp hingosine-1 -phosphate lyase 1 
130911 BE409769 Hs.21189 DnaJ (Hsp40) homolog, subfamify A, member 2 
130919 N79110 Hs.21276 collagen, type IV, alpha 3 (Goodpasture antigen) binding protein 
130944 BE382657 Hs.21486 signal transducer and activator of transcription 1, 91 kD 
130971 N39842 Hs.301444 KIAA1673 
130992 BE398091 Hs.74316 desmoplakin (DPI, DPII) 
Hs.21929 ESTs 

thyroid hormone receptor interactor 3 
CCAAT/enhancer binding protein (C/EBP), gamma 
Hs 171637 hypothetical protein MGC2628 

small inducible cytokine subfamily B (Cys-X-Cys), member 10 
small inducible cytokine subfamily B (Cy 

ESTs, Moderately similar to A46010 X-linked retinopathy protein [H.sapiens] 
myosin VI 
myosin VI 
ESTs 

dolichyl-phosphate (UDP-N^acetylglucosamine) N-aceryiglucosaminephosphotransferase 1 ( 
dolichyl-phosphate (UDP-N^acetylglucosam 
131099 AL133353 Hs 226581 COX1 5 (yeast) homolog, cytxhromec oxidase assembly protein 
131174 NMJX)6540Hs 29131 nuclear receptor coactivator 2 
cyclin B1 
ESTs 

CGI-26 protein 

thyroid hormone receptor-associated protein, 95-kD subunit 
zinc finger protein 281 
131233 D89053 Hs 268012 fatty-acid-Coenzyme A ligase, long-chain 3 
131243 AW383256 Hs 24752 spectrin SH3 domain binding protein 1 
131245 AL080080 Hs 24766 thioredoxin domain-containing 
131247 AL043100 Hs.326190 fatty acid amide hydrolase 
131281 AA251716 Hs 25227 ESTs 

131283 X80038 Hs.339713 Homo sapiens clone F1 9374 APOE-C2 gene cluster 

131X5 AV656017 Hs.184325 CGI-76 protein 

131320 AA505691 Hs.145696 splicing factor (CC 1.3) 

131339 AF058696 Hs.25812 Nijmegen breakage syndrome 1 (nibrin) 

131339 AF058696 Hs.25812 Nijmegen breakage syndrome 1 (nibrin) 

131375 AW293165 Hs.143134 ESTs 

131390 BE269388 Hs.182698 mitochondrial ribosomal protein L20 
^410 BE259110 Hs.279836 HSPC 166 protein 

131412 NM_012247Hs.124027 SELENOPHOSPHATE SYNTHETASE ; Human selenium donor protein 
131429 AL046302 Hs.26750 hypothetical protein FLJ 2 1908 
hypothetical protein FLJ20392 
KIAA1458 protein 
GK001 protein 
GK001 protein 

Homo sapiens cDNA: FU21333 fts, clone COL02535 
UDP-glucose dehydrogenase 



130993 T97401 

131005 AV658308 Hs.2210 

131028 AI879165 Hs 2227 

131042 AI826288 

131046 AA321649 Hs.2248 

131046 AA321649 Hs2248 

131047 H23230 Hs 22481 
131060 AA194422 Hs 22564 
131060 AA194422 Hs 22564 
131070 N53344 Hs 22607 
131076 AA749230 Hs 26433 
131076 AA749230 Hs 26433 



131185 BE280074 Hs 23960 
131206 AW1 38839 Hs 24210 
131213 AA885699 Hs 24332 
131225 H62087 Hs 31659 
131231 N47468 Hs 59757 



131458 BE297567 Hs27047 

131475 AA992841 Hs.27263 

131501 AV661958 Hs.8207 

131501 AV661956 Hs.8207 

131511 AA732153 Hs.27865 

131528 AU076408 Hs.28309 
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131532 
131543 
131544 
131562 
131564 
131564 
131569 
131618 
131622 
131623 
131623 
131643 
131653 
131656 
131669 
131692 
131714 
131722 
131737 
131760 
131760 
131763 
131772 
131774 
131787 
131793 
131795 
131798 
131817 
131824 
131850 
131878 
131885 
131885 
131887 
131900 
131900 
131904 
131905 
131913 
131916 
131925 
131929 
131941 
131950 
131962 
131965 
131971 
131977 
131985 
131991 
132019 
132031 
132062 
132084 
132103 
132105 
132116 
132176 
132180 
132192 
132194 
132203 
132207 
132235 
132240 
132252 



BE268278 Hs.28393 
AW966881 Hs.41639 
AL355715 Hs.28555 
NM_003512Hs 28777 
T93500 Hs 28792 



T93500 
AL389951 



Hs.28792 
Hs 271623 



BE393822 Hs 29645 

R78195 Hs.29692 

AB037791 Hs 29716 

AB037791 Hs 29716 

AW4106O1 Hs 30026 

AW960597 Hs 30164 
AI218918 



X52486 



Hs 30209 
Hs 3041 



BE559681 Hs.30736 

AA642831 Hs 31016 

D13757 Hs 311 

AK001641 Hs.31323 



X76732 
X76732 
AI878932 



Hs.3164 
Hs 3164 
Hs.317 



AA382590 Hs 170980 
BE267158 Hs 169474 
D87077 Hs 196275 
AW966127 Hs 32246 
BE501849 Hs.32317 
X86098 Hs 301 449 



Hs.3280 

Hs.32935 

Hs.33184 



U20536 
U28838 
A1251317 
AA083764 Hs 6101 
BE502341 Hs.3402 
BE502341 Hs3402 
W17064 
AA099014 Hs.231029 
AA099014 Hs.231029 
AF078866 Hs 284296 
AA179298 Hs3439 
AW207440 Hs.185973 
AA025976 Hs 34569 
AF151048 Hs 183180 
BE541211 Hs 34804 
BE252983 Hs 35086 
AA355113 Hs 35380 
AK000046 Hs.267448 
W79283 Hs.35962 
BE567100 Hs,154938 
U90441 Hs.3622 
AA503020 Hs 36563 
AF053306 Hs.36708 
H56995 Hs.37372 
AF1 93844 Hs.3758 
BE266155 Hs.3832 
NM.C02267Hs.3886 
BE171921 Hs.3991 
AV646076 Hs.39959 
AW960474 Hs.40289 
AA857025 Hs.8878 
NM_004460Hs.418 
AA206153 Hs.4209 
R42432 Hs.4212 
NMJJ04782HS.194714 
BE206939 Hs.42287 
AV658411 Hs.42656 
AB018324 Hs.42676 
AI566004 Hs.141269 



hypothetical protein MGC2592 

programmed cell death 2 

programmed cell death 9 (PDCD9) 

H2A histone family, member L 

Homo sapiens cONA FU11041 fis, clone PLACE1004405 

Homo sapiens cDNA FLJ11041 fis, clone PL 

nucleoporin 50kD 

Homo sapiens mRNA; cDNA DKF2p761C029 (from clone DKFZp761C029); partial cds 

Homo sapiens cDNA FLJ11436 fis, clone HEMBA1001213 

hypothetical protein FLJ10980 

hypothetical protein FL J 10980 

HSPC182 protein 

ESTs 

KIAA0854 protein 

uracil-DNA grycosybse 2 

KIAA0124 protein 

putative DNA binding protein 

phosphoribosyl pyrophosphate amidotransferase 

inhibitor of kappa light polypeptide gene enhancer in B-cells, kinase complex-associated protein 

nucleobindin 2 

nucleobindin 2 

topoisomerase (DNA) I 

K1AA0948 protein 

DKFZP586J01 19 protein 

KIAA0240 protein 

Homo sapiens cDNA FU14656 fis, clone NT2RP2002439 

high-mobility group 20B 

adenovirus 5 E1A binding protein 

caspase 6, apoptosis-related cysteine protease 

TATA box binding protein (TBP)-associated factor, RNA polymerase III, GTF3B subunrt 2 
ESTs 

hypothetical protein MGC3178 
ESTs 
ESTs 
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Hs 332848 SWl/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily e, member 1 3.2 



Homo sapiens, clone MGC:15961, mRNA, complete cds 
Homo sapiens, clone MGC: 15961, mRNA, com 
Homo sapiens cDNA: FLJ22993 fis, clone KAT1 1914 
stomatin-like 2 

degenerative spermatocyte (homolog Drosophila; lipid desaturase) 
ESTs 

anaphase promoting complex subunit 11 (yeast APC11 homolog) 
Homo sapiens cDNA FU11472 fis, clone HEMBA1001711 
ubiquitin specific protease 1 
x 001 protein 

hypothetical protein FU20039 
ESTs 

hypothetical protein MDS025 

procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline 4-hydroxylase), alpha polypeptide I 
hypothetical protein FU22418 

budding uninhibited by benzimidazoles 1 (yeast homolog), beta 

Homo sapiens DNA binding peptide mRNA, partial cds 

COP9 complex subunit 7a 

clathrin-associated protein AP47 

karyopherin alpha 3 (importin alpha 4) 

ESTs 

ESTs 

ESTs 

kinesin-like 1 

fibroblast activation protein, alpha 
mitochondrial ribosomal protein L37 
ESTs 

synaptosomal-associated protein, 29kD 
E2F transcription factor 6 
KIAA1681 protein 
KIAA0781 protein 

Homo sapiens cDNA: FU21550 fis, clone COL06258 
132 
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132256 AA301228 Hs.43299 hypothetical protein FU 12890 57 

132273 AA227710 Hs.43658 DKFZP586L151 protein * 2 

132276 AA653507 Hs.285711 hypothetical protein FLJ 13089 2A 

132288 N36110 Hs.305971 solute carrier fa miry 2 (facilitated glucose transporter), member 10 1-5 

5 132294 AB023191 Hs.44131 KIAA097 4 protein ]° Q ° 

132298 NM_015986Hs.7120 cytokine recepto Mike molecule 9 1-9 

132299 AW405882 Hs.44205 cortistatin 

132325 N37065 Hs.44856 hypothetical protein FLJ1 21 16 2.0 

132348 AW067708 Hs.170311 heterogeneous nuclear ribonucleo protein D-like 6.5 

10 132370 AW572805 Hs.46645 ESTs z -\ 

132374 AF155582 Hs.46744 corel UDP-galactose:N-acetylgalactosamine-alpha-R beta l^galactosyrtrartsferase 1.5 

132376 AI279892 Hs.46801 sorting nexin 14 J2 5 
132384 AA312135 Hs.46967 HSPC034 protein 

132393 AL135094 Hs.47334 hypothetical protein RJ1 4495 ]-9 

15 132450 AA100012 Hs.48827 hypothetical protein FLJ 12085 ™ 
132452 AW973521 Hs.247324 mitochondrial ribosomal protein S1 4 

132456 AB011084 Hs.48924 KIAA05 12 gene product; ALE X2 J 7 

132465 AW169847 Hs.49169 KIAA1 634 protein °° 

132470 AI224456 Hs.4934 H.sapiens polyA site DNA ^ 

20 132484 X16660 Hs.1 19O07 RAB4, member RAS oncogene family * * 

132518 AW885606 Hs.5064 ESTs ° \ 

1 32528 T78736 Hs.50758 SMC4 (structural maintenance of chromosomes 4, yeastj-like 1 o o 

132530 AA306105 Hs.50785 SEC22, vesicle trafficking protein (S. cerevisiae)-iike 1 2 0 

132532 AA454132 Hs.5080 mitochondrial ribosomal protein L16 2.9 

25 132534 BE388673 Hs.5086 hypothetical protein MGC 10433 2 2 

132543 BE568452 Hs.5101 protein regulator of cytokinesis 1 J.3 

132571 AW674699 Hs.5169 suppressor of G2 allele of SKP1, S. cerevisiae, homolog of 1' 

132574 AW631437 Hs.5184 TH1 drosophila homolog 7A 

132596 AK001484 Hs.5298 CGU5 protein l _ l 

30 132611 AA345547 Hs.53263 hypothetical protein FLJ 13287 2.2 

132612 H12751 Hs.5327 PR01914 protein b ° 

132616 BE262677 Hs.283558 hypothetical protein PR0 1855 ^ 

132638 AI796870 Hs.54277 DNA segment on chromosome X (unique) 9928 expressed sequence 11-4 

132648 U51127 Hs.54434 hypothetical protein MGC 171 5 1-* 

35 132668 AB018319 Hs.5460 K1AA07 76 protein 2.6 
132692 AW191962 Hs.249239 collagen, type VIII, alpha 2 

132715 F1 1875 Hs.5534 Homo sapiens cDNA FLJ12961 fis, clone NT2RP2005645 1 5 

132718 NM_0O4600Hs.554 Sjogren syndrome antigen A2 (60kD, ribonucleoprotein autoantigen SS-A/Ro) 3 0 

132724 AI142265 Hs.55498 geranylgeranyi diphosphate synthase 1 2 4 

40 132731 AI189075 Hs.301872 hypothetical protein MGC4840 ]2.4 
132744 AA010233 Hs.55921 glutamy I -prolyMRNA synthetase ™ 

132760 AA1 25985 Hs,56145 thymosin, beta, identified in neuroblastoma cells 2 7 

132771 Y10275 Hs.56407 phosphoserine phosphatase 3-° 
132773 AA459713 Hs.295901 KIAA0493 protein 

45 132784 AI142133 Hs.56845 GDP dissociation inhibitor 2 |° 

132798 AI026701 Hs.5716 KIAA0310 gene product 3 7 

132807 U07418 Hs 57301 mul (E. coli) homolog 1 (colon cancer, nonpotyposis type 2) 1-8 

132810 AB007944 Hs.5737 KIAA0475 gene product =9 

132813 BE3 13625 Hs.57435 solute carrier family 1 1 (proton-coupled divalent metal ion transporters), member 2 BY 

50 132815 AI815189 Hs.57475 sex comb on mid leg homolog 1 j>.4 

132817 N27852 Hs.57553 tousled -like kinase 2 3 b 

132821 AJ251595 Hs.169610 CD44 antigen (homing function and Indian blood group system) Zfl 
132833 U78525 Hs.57783 eukaryotic translation initiation factor 3, subuntt 9 (eta, 11 6kD) 14.6 
132842 NM_016154Hs.279771 Homo sapiens clone PP1 596 unknown mRNA 1j> 

55 132844 F12200 Hs.5811 chromosome 21 open reading frame 59 2 5 

132851 U09716 Hs 287912 lectin, mannose-binding, 1 |J 
132863 BE268048 Hs.236494 RAB10, member RAS oncogene family 4 -2 
132869 AW96321 7 Hs. 203961 ESTs, Moderately similar to AF1 16721 89 PR02168 [H.sapiens] 2 8 

132873 AW007683 Hs 58598 K1AA1 266 protein 20 

60 132875 NM_004850Hs 58617 Rho-associated, coiled<:oil containing protein kinase 2 i o 

132891 BE267143 Hs 59271 U2(RNU2) small nuclear RNA auxiliary factor 1 (non-standard symbol) 1.4 
1 32897 AW503667 Hs 59545 ring finger protein 15 5 4 

132902 AI936442 Hs 59838 hypothetical protein FLJ 10808 61 
132912 AW732760 Hs 167578 Homo sapiens cDNA FLJ 1 1095 fis, clone PLACE1005374 7,1 

65 132913 W78714 Hs 60257 Homo sapiens cDNIA FLJ 13598 fis, done PLACE 1009921 2.8 

132940 T79136 Hs 127243 Homo sapiens mRNA for KIAA 1724 protein, partial cds 6.1 

132941 A1817165 Hs.6120 hypothetical protein FLJ 13222 10.3 
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132942 AA554458 Hs.1 97751 KIAA0666 protein 18 

132952 AI658580 Hs,61426 Homo sapiens mesenchymal stem cell protein DSC96 mRNA, partial cds 2 2 

132962 AA576635 Hs.6153 CGI-48 protein 4 9 

132972 AA034365 Hs.288924 Homo sapiens cONA FLJ1 1392 fis, clone HEMBA1 000575 2.7 

5 132973 AA035446 Hs.323277 ESTs 5 3 

132977 AA093322 Hs.301404 RNA binding motif protein 3 3 2 

132980 AA040696 Hs.62016 ESTs 13 

132994 AA112748 Hs 279905 clone HQ0310 PRO031 Op 1 3 0 

133012 AA847843 Hs 62711 Homo sapiens, clone I MAG E3351 295, mRNA 10.3 

10 133015 AJ002744 Hs 246315 UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 7 (GalNAc-T7) 2.1 

133016 AI439688 Hs6289 hypothetical protein FLJ20886 1 3 

1 33053 AI065016 Hs 6390 Homo sapiens clone FLB3344 PRO0845 mRNA, complete cds 6 0 

133062 AW500374 Hs 64056 PRO0 149 protein 5 3 
133069 BE247441 Hs 6430 protein with polyglutamtne repeat; calcium (ca2+) homeostasis endoplasmic reticulum protein 4 9 

15 133091 AK001628 Hs 64691 KIAA0483 protein 3 5 

133110 AA808177 Hs 65228 ESTs 13.1 

133134 AF198620 Hs 65648 RNA binding motif protein 8A 13 

133145 H94227 Hs.6592 Homo sapiens, clone IMAGE:2961 368, mRNA, partial cds 2 2 

133152 Z11695 Hs 324473 m itog en-activated protein kinase 1 13 

20 133174 AA431620 Hs 324178 hypothetical protein MGC2745 17.1 

133175 AW955632 Hs 66666 ESTs, Weakty similar to S19560 proltne-rich protein MP4 - mouse [M.musculus] 18 

133177 X97795 Hs 66718 RAD54 (S.cerevisiae)-like 4 9 

133197 AI275243 Hs 180201 hypothetical protein FU20671 3 1 

133208 AI80T777 Hs.6774 ESTs 44 

25 133226 AW954569 Hs 296287 Homo sapiens, Similar to bromodorna '^containing 4, clone IMAGE;3542455, mRNA, partial cds 1 7 

133228 AI492924 Hs.6831 gc^i phosphoprotein 1 6 0 

133240 AK001489 Hs 242894 ADP-rlbosylation factor-like 1 15 

133254 AI567421 Hs 273330 Homo sapiens, clone IMAGE:3544662, mRNA, partial cds 14 

133266 AI160873 Hs.69233 zinc finger protein • 5 6 

30 133268 AW956781 Hs.293937 ESTs, Weakly similar to FXD2_HUMAN FORKHEAD BOX PROTEIN D2 [H. sapiens] 19 

133285 M76477 Hs 289082 GM2 ganglioside activator protein 4 7 

133291 BE297855 Hs 69855 NRAS-relatedgene 5 0 

133314 AA102670 Hs 70725 gamma-am inobutyric acid (GABA) A receptor, pi 2 7 

133321 T79526 Hs.179516 integral type I protein 9 3 

35 133327 AL390127 Hs.7104 KruppeMike factor 13 44 

133347 BE257758 Hs 71475 acid cluster protein 33 18 

133360 AI016521 Hs 71816 v-akt murine thymoma viral oncogene homolog 1 5 5 

133366 AA292811 Hs 72050 non-meta static cells 5, protein expressed in (nucleoside -diphosphate kinase) 27 

133367 AF231919 Hs.18759 KIAA0539 gene product 17 
40 133370 AF245505 Hs 72157 DKFZP564 1 1922 protein 18 

133383 BE313555 Hs.7252 KIAA 1224 protein 17 

133390 AI950382 Hs 72660 phosphatidylserine receptor 13 

133391 AW103364 Hs.727 inhi bin, beta A (acttvin A, acth/in AB alpha polypeptide) 16.1 
133394 AA305127 Hs 237225 hypothetical protein HT023 12 2 

45 133437 AL031591 Hs.7370 phosphotidylinosito I transfer protein, beta 104 

133452 NM_002759Hs 274382 protein kinase, interferon-Inducible double stranded RNA dependent 1 2 

133453 AI659306 Hs.73826 protein tyrosine phosphatase, non-receptor type 4 (megakaryocyte) 17 
133500 AW964804 Hs.74280 hypothetical protein FLJ22237 1 1 1 
133529 W45623 Hs 74571 ADP-ribosylation factor 1 2 8 

50 133540 AL037159 Hs.74619 proteasome (prosome, macropain) 26S subunit, non-ATPase, 2 2 9 

133543 AU077073 Hs 108327 damage-specific DNA binding protein 1 (127kD) 2 5 

133578 AU077050 Hs.75066 translin 1 5 

133579 X75346 Hs .75074 mitog en-activated protein kinase-activated protein kinase 2 2 1 
133582 BE391579 Hs.75087 Fas- activated serine/threonine kinase 13 

55 133594 AW160781 Hs.172589 nuclear phosphoprotein similar to S.cerevisiaePWPI 2.2 

133595 AA393273 Hs.75133 transcription factor 6-like 1 (mitochondrial transcription fa dor Mike) 15 

133599 NMJ)02885Hs,75151 RAP 1, GTPase activating protein 1 5 7 

133621 NMJ04893HS.75258 H2A hi stone family, member Y 25,5 

133627 NMJ02047HS.75280 glycyMRNA synthetase 15 8 

60 133631 NM„000401 Hs.75334 exostoses (multiple) 2 3.3 

133649 U25849 Hs.75393 acid phosphatase 1 , soluble 1-6 

133690 AV661185 Hs.75574 mitochondrial ribosomal protein L19 4-1 

133720 L27841 Hs.75737 pericentriolar material 1 1-5 

133722 AW969976 Hs.279009 matrix Gla protein 5 3 

65 133751 AW402048. Hs.334787 Homo sapiens, Similar to likely orthotog of yeast ARV1 , clone IMAGE:3506392, mRNA 3.9 

133757 T52946 Hs.196209 RAE1 (RNA export 1, S.pombe) homolog 1.7 

133760 BE271766 Hs. 181357 iaminin receptor 1 (67kD, ribosomal protein SA) 1.8 
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133735 M62194 Hs.75929 cadhenn 11, type 2, OB-cadherin (osteoblast) 1-5 

133780 AA557660 Hs.76152 deoorin ^.5 

133784 BE622743 Hs.301064 arfaptin 1 J 

133791 M34338 Hs 76244 spermidine synthase 2.6 

5 133797 AL133921 Hs.76272 retinoblastoma-binding protein 2 ™ 

133822 D50525 Hs,699 peptidy I prolyl isomerase B (cycbphii'm B) 8 0 

133842 AW797468 Hs 285013 putative human H LA class II associated protein I 13-5 

133845 AA147026 Hs.76704 ESTs 22 

133850 W29092 Hs7678 cellular retinoic add-binding protein 1 J° 

10 133859 U86782 Hs 178761 26S proteasome^associated pad1 homolog 2 0 

133865 AB011155 Hs 170290 discs, large (Drosophila) homolog 5 2 8 

133867 AW3401 25 Hs 76989 K1M0097 gene product °7 

133868 AB012193 Hs 183874 cullin4A 2 5 
133881 U30872 Hs 77204 centromere protein F (350/400kD, mitosin) • 3 0 

15 133922 U30825 Hs 77608 spScing factor, arginine/senne-rfch 9 ' * 

133924 D86326 Hs 325948 vesicle docking protein p1 15 J 4 

133929 NM_006306Hs 211602 SMC1 (structural maintenance of chromosomes 1, yeast)-like 1 4.9 

133936 L17128 Hs.77719 gamma^lutamyl carboxylase 37 

133941 BE244332 Hs 77770 adaptor-related protein complex 3, mu 2 subunit 12_1 

20 133959 X81789 Hs 77897 splicing factor 3a, subunit 3 t 60kD 

133976 Al 908 165 Hs 169946 GATA-binding protein 3 (T-cell receptor gene activator) 31 

133989 AL040328 Hs 78202 SWI/SNF related, matrix associated , actin dependent regulator of chromatin 13 

133997 AI824113 Hs 78281 regulator of G-protein signalling 12 97 

134010 AB016092 Hs.197114 RNA binding protein; AT-rich element binding factor 24 

25 134015 D31764 Hs 278569 sorting nexin 17 2 5 

134070 NMJ»3590Hs 78946 cullln 3 \\ 

134110 U41060 Hs 79136 LIV-1 protein, estrogen regulated * l 

134129 NMJ14742Hs 79305 K1AA02 5 5 gene product 2 2 

134134 H86504 Hs 173328 protein phosphatase 2, regulatory subunit B (B56), epsilon isoform 5 0 

30 134200 BE559598 Hs. 197803 KIAA01 60 protein 3.2 
134206 AF107463 Hs 79968 splicing factor 30, survival of motor neuron-related 

134208 NM_000288Hs79993 peroxisomal biogenesis factor 7 2.1 

134219 NM_000402Hs 80206 g I ucose-6-phosp hate dehydrogenase »J 

134234 BE300078 Hs.80449 Homo sapiens, clone IMAGE:3535294, mRNA, partial cds 2 8 

35 134275 AI878910 Hs.3688 cisp latin resistance -associated overexpressed protein 18 

134292 AI906291 Hs.81234 immunoglobulin superfamily, member 3 20 

134301 AW502505 Hs.81360 Homo sapiens cDNA: FLJ21927 fis. clone HEP04178, highly similar to HSU90909 2.5 

134305 U61397 Hs.81424 ubiquitin-like 1 (sentrin) 2 ° 

134324 AB02S023 Hs.1 79946 K1AA1 100 protein 10 Q 4 

40 134326 AW903838 Hs.81800 chondroitin sulfate proteoglycan 2 (versican) 

134329 N92036 Hs.81848 RAD21 (S. pombe) homolog 2.6 

134337 NM_004922Hs.81964 SEC24 (S. cerevisiae) related gene family, member C 2.3 

134348 AW291946 Hs.82065 interieukin 6 signal transducer (gp130, oncostatin M receptor) 13^0 

134367 AA339449 Hs.82285 phosphwbosylgiydn amide formyltransferase, phosphoribosylglycinamide synthetase, 8.8 

45 134376 X06560 Hs.82396 2',5'-oligoadenylate synthetase 1 (40-46 kD) 15 

134379 AW362124 Hs.323193 hypothetical protein MGC 3222 , 8 1 
134384 AI589941 Hs.8254 Homo sapiens, Similar to tumor differentially expressed 1 , done IMAGE:3639252, mRNA, partial cds 2.6 

134391 M417383 Hs.82582 integrin, beta-like 1 (with EGF-fike repeat domains) jl 

134395 AA456539 Hs.8262 lysosomal I 7 

50 134403 AA334551 Hs.82767 sperm specific antigen 2 l ° 

1 34405 AW067903 Hs.82772 collagen, type XI, alpha 1 J 3 

134411 BE272095 Hs.167791 reticulocalbin 1 , EF-hand calcium binding domain 3 2 

134415 AI750762 Hs.82911 protein tyrosine phosphatase type IVA, member 2 1-9 

134421 AU077196 Hs.82985 collagen, type V, alpha 2 ^ 

55 134424 244190 Hs.83023 peroxisomal biogenesis factor 11B 24 

134446 AA112036 Hs.83419 KIAA025 2 protein n < , 4nc * \l 

1 34447 M58603 Hs.83428 nuclear factor of kappa light polypeptide gene enhancer in B-cells 1 (p105) 1 6 
134470 X54942 Hs.83758 CDC28 protein kinase 2 2.1 
134480 NM 005000 Hs.839 16 Empirically selected from AFFX single probeset 5 3 

60 134485 X82153 Hs.83942 cathepsin K (pycnodysostosis) 2 5 

134498 AW246273 Hs.84 131 threonyl-tRN A synthetase 21 

134513 AA425473 Hs.84429 KIAA0971 protein 3 8 

134516 AK001571 Hs.273357 hypothetical protein FLJ 10709 2 4 

134520 BE0910O5 Hs74861 activated RNA polymerase 1 1 transcription cofector 4 67 

65 134529 AW41 1479 Hs.848 FK506-binding protein 4 (59kD) 2 3 

134577 BE244323 Hs.85951 exportin, tRNA (nuclear export receptor for tRNAs) 5.5 

134582 AA927177 Hs.86041 CGG triplet repeat binding protein 1 5.8 
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134612 AW068223 Hs.171581 ubiquitin C- terminal hydrolase UCH37 

134624 AF035119 Hs.8700 deleted in five r cancer 1 

134532 X78520 Hs.174139 chloride channel 3 

134654 AK001741 Hs8739 hypothetical protein FLJ10879 

5 134664 AA256106 Hs 87507 ESTs 

134666 BE391929 Hs8752 transmembrane protetn 4 

134687 U62317 Hs 88251 arylsulfetaseA 

1 34692 NM_003474Hs.8650 a dtsintegrin and metalloproteinase domain 1 2 (meltrin alpha) 

134705 BE 161 887 Hs 88799 anaphase- promoting complex subu nit 10 

10 134714 Y14768 Hs890 lysosomal 

134719 AA852985 Hs 89232 chromobox homolog 5 (Drosophila HP1 alpha) 

134722 AF1 29536 Hs 284226 F-box only protein 6 

134724 AF045239 Hs 321576 ring finger protein 22 

134746 X07871 Hs 89476 CD2 antigen (p50), sheep red blood ceil receptor 

15 134751 AW630803 Hs 89497 laminBI 

134790 BE002798 Hs 287850 integral membrane protein 1 

134806 AD001528 Hs 89718 spermine synthase 

134834 AW451 370 Hs 8991 adaptor-related protein complex 1 , gamma 2 subunit 

134850 AI701162 Hs 90207 hypothetical protein MGC1 1 1 38 

20 134853 BE268326 Hs 90280 5^mirwhiWazoM<artx)xamide ribonucieotide formylU 

134859 D26488 Hs 90315 KIAA0007 protein 

134880 AI879195 Hs 90606 15 kDa selenoprotein 

134910 AA532963 Hs9100 Homo sapiens cDNA FLJ13100 fis, clone NT2RP3002255 

134925 AW885909 Hs6975 PRO1073 protein 

25 134955 AW401361 Hs 91773 protein phosphatase 2 (formerly 2A), catalytic subunit, alpha isoform 

134971 AI097346 Hs 286049 phosphoserine aminotransferase 

134975 R50333 Hs 92186 Leman co;led-coil protein 

135011 AB037835 Hs 92991 KIAA1414 protein 

1 35022 NMJ)00408Hs.93201 glycerol-3-phosphate dehydrogenase 2 (mitochondrial) 

30 135032 AW301984 Hs.173685 hypothetical protein FLJ12619 

135077 AW503733 Hs.9414 KIAA1 488 protein 

135083 AB036063 Hs.94262 p53-inductble ribonucleotide reductase small subunit 2 homolog 

135095 AF027219 Hs.9443 zinc finger protein 202 

135096 AA081258 Hs.132390 zinc ftnger protein 36 (KOX 18) 
35 135153 AI093155 Hs.95420 JM27 protein 

135181 BE250865 Hs.279529 px 19-1 ike protein 

135199 AA477514 Hs.96247 transiirvassociated factor X 

135207 N26427 Hs.9634 ESTs, Highly similar to C10_HUMAN PUTATiVE C10 PROTEIN [H.sapiensj 

135214 T78802 Hs 96560 hypothetical protein FLJ 11 656 

40 135243 BE463721 Hs.97101 putative G protein-coupled receptor 

135245 AI028767 Hs.262603 ESTs 

135257 AW291023 Hs 97255 ESTs, Weakly similarto A46O10 X-linked retinopathy protein [H.sapiensj 

135263 AI088775 Hs 55498 geranytgeranyl diphosphate synthase 1 

135274 AA448450 Hs 112017 GE36gene 

45 135294 AA150320 Hs9800 protein kinase Nimu^RI 

135295 AI090838 Hs 98006 ESTs 

135307 AI743770 Hs 98368 ESTs, Weakly similar to KIAA0822 protein [H.sapiensj 

135321 AI652069 Hs 98614 ribosome binding protein 1 (dog 180kD homolog) 

135354 AA456454 Hs 183418 cell division cycle 2-like 1 (PITSLRE proteins) 

50 135361 AA373452 Hs 167700 Homo sapiens cDNA FLJ 10174 fis, done HEMBA 1003959 

135389 U05237 Hs 99872 fetal Alzheimer antigen 

135400 X78592 Hs 99915 androgen receptor (dthydrotestosterone receptor; testicular feminization; spinal and bulbar 

134975 R50333 Hs92186 Leman coited-coil protein 

135011 AB037835 Hs 92991 KIAA14 14 protein 

55 135022 NM_000408Hs 93201 glycerol-3-phosphate dehydrogenase 2 (mi 

135032 AW301984 Hs 173685 hypothetical protein FLJ 1261 9 

135077 AW503733 Hs.9414 K1AA1488 protein 

135083 AB036063 Hs 94262 p53 -inducible ribonucleotide reductase s 

135095 AF027219 Hs9443 zinc f,ng or protein 202 

60 135096 AA081258 Hs 132390 zinc finger protein 36 (KOX 18) 

135153 AI093155 Hs 95420 J M27 protein 

135181 BE250865 Hs 279529 p*19-l;ke protein 

135199 AA477514 Hs 96247 trans lin-associated factor X 

135207 N26427 Hs9634 ESTs, H:ghfy similar to C10J-IUMAN PUTATI 

65 135214 T78802 Hs .96560 hypothetical protein a J1 1656 

135243 BE463721 Hs.97101 putative G protein-coupled receptor 

135245 AI028767 Hs 262603 ESTs 
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135257 AW291023 Hs.97255 ESTs, WeaWy similar to A46010 X-I;nted 7 y 

135263 A1088775 Hs.55498 geranykjeranyl diphosphate synthase 1 y 
135274 AA448460 Hs.1 12017 GE36ger>e 

135294 M150320 Hs,9800 protein kinase Njmu-R1 y 

5 135295 AI090838 Hs.96006 ESTs y 

135307 AI743770 Hs.98368 ESTs, Weakly similar to KSAA0822 protein y 

135321 AI652069 Hs.98614 ribosome binding protein 1 (dog 180kD ho yy 

135354 AA456454 Hs. 183418 cell division cycle 2-like 1 (PITSLRE pr y 

135361 AA373452 Hs 167700 Homo sapiens cDNA FU 10174 fts, clone HE y 
10 135389 U05237 Hs 99872 fetal Alzheimer antigen „ 9 

135400 X78592 Hs 99915 androgen receptor (dihydrotestosterone r *y 

302256 AA857131 Hs 171595 HIV TAT specific factor 1 y 

302276 AW057736 Hs. 323910 HER2 receptor tyrosine kinase (c-erW)2, y 

303135 AW592789 Hs 279474 HSPC070 protein y 

15 303686 AK000714 Hs 109441 MSTP033 protein y 

310085 R43191 Hs.101248 Homo sapiens clone I MAGE 32553, mRN A seq y 

315518 AA808229 Hs 167771 ESTs y 

317781 NM_007057Hs 42650 ZWIOInteractor y 

320836 AI268997 Hs 197289 rab3 GTPase-activatjng protein, nor -cata y 

20 321114 AA902256 Hs.78979 Golgi apparatus protein 1 J* 

322221 N24236 Hs 179662 nucleosome assembly protein Mike 1 y 
322474 AF1 18083 Hs 29494 PR019 12 protein 
322556 BE041451 Hs 177507 hypothetical protein 
323541 AF292100 Hs 104613 RP42homolog 
25 407827 BE278431 Hs 40323 BUB3 (budding uninhibited by benzimidazo 

408196 AL034548 Hs.43627 SRY (sex determining region Y)-box 22 b ] 

408813 AI580090 Hs 48295 RNA he li case family y 

409176 R73727 Hs.101617 ESTs, Weakly similar to T32527 hypotheti y 

413670 AB000115 Hs.75470 hypothetical protein, expressed in osteo 2.4 

30 414108 AI267592 Hs.75761 SFRS protein kinase 1 y 

414846 AW304454 Hs.77495 UBX domain-containing 1 ™ 

416980 AA381133 Hs 80684 high-mobility group (nonhistone chromoso yy 

417378 R57256 Hs.82037 TATA box binding protein (TBP)-associate y 

418283 S79895 Hs.83942 cathepsin K (pycnodysostosis) y 

3 5 418467 NM_006910Hs.85273 retinoblastoma-bindlng protein 6 y 

420269 U72937 Hs.96264 alpha thalassemia/me nta I retardation syn y 

420802 U22376 Hs.1334 v-myb avian myeloblastosis viral oncogen y 

421225 AA463798 Hs.102696 MCT-1 protein y 

421642 AF172066 Hs.106346 retinoic acid represstble protein y 

40 421828 AW891965 Hs.279789 histone deacetylase 3 y 

421983 AI252640 Hs.110364 peptldylprolyl Isomerase C (cyclophilin y 

422052 AA302744 Hs.104518 ESTs y 

422055 NMJ)14320Hs.111029 putative he me-binding protein y 
423750 AF165883 Hs.298229 prefoldin2 

45 424001 W67883 Hs. 137476 paternally expressed 10 (PEG 10; KIAA 105 y 
425182 AF041259 Hs.155040 zinc finger protein 217 
425284 AF155568 Hs.1 55489 NS1 -associated protein 1 
426372 BE304680 Hs.169531 DEAD/H (Asp- Glu-A la-Asp/His) box polypep 
428049 AW183765 Hs.182238 GW128 protein 
5 0 428477 AW500533 Hs.1 1482 splicing factor, argtnineVserine-rich 11 y 
437562 AB001636 Hs.5683 DEAD/H (Asp-Glu-Ala-Asp/His) box polypep y 
438449 AK001333 Hs.6216 Homo sapiens hepatocellular carcinoma-as y 
441560 F13386 Hs.7888 Homo sapiens clone 23736 mRNA sequence y 
445580 AF167572 Hs.12912 skbl (S. pombe) homolog y 
5 5 446999 AA151520 Hs.334822 hypothetical protein MGC4485 y 
447111 AI017574 Hs.17409 cysteine-rich protein 1 (intestinal) y 
447778 BE620592 Hs.71190 ESTs, Weakly similar to S 16506 hypotheti y 
448873 NM_003677Hs.22393 density- regulated protein 5 9 

449687 W68520 Hs.331328 ! ntermediate filament protein syncoilin 5 b 

60 450701 H39960 Hs.288467 Homo sapiens cDNA FL J 1 2280 fis, clone MA y 
450703 AA011202 Hs.184771 nuclear factor l/C (CCAAT-bindng transc y 
452461 N78223 Hs.108106 transcription factor 2,9 
452511 BE408178 Hs.285165 Homo sapiens cDNA FLJ20845 fts. clone AD 12.1 
453157 AF077036 Hs31989 DKFZP586G1 722 protein y 
65 453658 BE541906 Hs.87819 Homo sapiens, clone MGC:2492, mRNA, comp 1-3 
100833 AF135168 Hs.108802 N-ethylma)eimide-sensftive factor 3.2 
102481 U50360 gb:Human calcium, calmodulin-dependent p 5-2 
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102827 BE244588 Hs.6456 chaperonin containing TCPl.subuntt 2 (b 7 9 

103549 BE270465 Hs.78793 protein kinase C, zela 2.0 

104331 AB040450 Hs.279862 cdk inhibitor p21 binding protein 53 

110018 AW579842 Hs.104557 hypothetical protein FU 10697 2.0 

115008 AK0O1827 Hs.87889 helicase-moi 5.7 

119075 M10905 Hs. 287820 fibronectin 1 1 -3 

119615 AL034423 Hs.75875 u b iq uitin-conj ugating enzyme E2 variant 2.9 

125006 BE065136 Hs.145696 splicing factor (CC 1.3) 1-7 

127609 X80031 Hs.530 collagen, type IV, alpha 3 (Goodpasture 2.4 

129209 R62676 Hs.17820 Rho-associated, coiled-coil containing p 5.2 

129917 M30773 Hs.278540 protein phosphatase 3 (formerly 2B), reg 4.5 

1301B2 BE267033 Hs.192853 ubiquitin-conjugating enzyme E2G 2 (homo 11.0 

130365 W56119 Hs.1 55103 eukaryotic translation initiation factor 3.3 

131135 NM_016569Hs.267182 TBX3-iso protein 1-3 

131853 A1681917 Hs.3321 ESTs, Highly similar to IRX1.HUMAN IROQU 3.2 

131881 AW351018 Hs.3383 upstream regulatory element binding prot 14.3 

132726 N52298 Hs.55608 hypothetical protein MGC955 3.0 

135193 X95525 Hs.96103 TATA box binding protein (TBP)-assoctate 2.7 

409487 H 19886 gb:yn57a05.r1 Scares adult brain N2b5H 2.3 

416040 AW819158 Hs. 289044 Homo sapiens cDNA FLJ 12048 fis, clone HE 7.4 
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TABLE 4A 



50 



55 



60 



Table 4A shows the accession numbers for those pkeys lacking unigenelD's for Table 4. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 



Pkey. 

CAT number. 
Accession: 



Pkey CAT number Accessions 



10 



15 



20 



25 



30 



35 



40 129019 44573.2 



45 



Unique Eos probeset identifier number 
Gene cluster number 
Genbank accession numbers 



123615 
123619 
101445 
124385 
124417 
124482 
102481 
103349 
110856 



120280 
113248 
120472 



30686_-15 

371681.1 

1650_-5 

656394,1 

1642364.1 

1657509.1 

31281_-28 

11052.-2 

19346J4 . 



160212J 
328626J 
44573_2 



120695 9683_3 



M609170 

AA602964 AA609200 
M21259 

AI267847 N27351 
N34059 N46979 
N53935 N53950 
U50360 
X89059 

AA992380 N33063 N21418 H79958 R2191 1 H79957 

103797 109699 1 AA080912 AA075318 AA083403 AA076594 AA078992 AA084926 AA081881 AA113913 M113892 
AA083821 AA134801 M082953 AA070343 AA062835 AA075419 AA063293 AA071252 AA078900 AA062836 AW974305 
AA190577AA181657 

ASmSof R9704O N36809 AB0811 9 AW967677 N35320 AI251473 H59397 AW971 573 R97278 W01059 AW967671 
AA908598 AA251875 A1820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 AA488964 
AA283144 A1890387 AI950344 AI741346 AI689062 AA282915 AW102898 AI872193 AI763273 AW173586 AW 150329 AI653832 
AI762688 AA988777 AA488892 AI356394 AW103813 AI539642 AA642789 AA856975 AW505512 AI961530 AW629970 
BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW961 101 AA251669 AA251 874 
AI819225 AW205862 AI683338 AI858509 AW276905 AI633006 AA972584 AA908741 AW072629 AW513996 AA293273 
AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 AW613456 AI373032 AI564269 
F00531 H83488 W37181 W78802 R65056 AI0D2839 R67840 AA300207 AW959581 T63226 F04005 
AI950087 N70208 R97040 N36809 AI308119 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
moms AA251875 AI820501 AK20532 W87891 T85904 U71456 T82391 BE328571 T751Q2 ^/f^^^. 
AI21 9788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 M488964 
S^890387 AI950344 AI741346 At689062 AA282915 AW102898 AI872193 AI763Z73 AW1 73586 AW150329 AI653832 
A1762688 AA988777 AA488892 AI356394 AW103813 AI539642 AA642789 AA856975 AW505512 AI961530 AW629970 
BE612881 AW276997 AW513601 AW512843 AA044209 AW856538AA1 80009 AA337499 AW951 101 AA251669 AA251874 
AI819225 AW205862 AI683338 AI858509 AW276905 A1633006 AA972584 AA908741 AW072629 AW513996 AA293273 
AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 AW613456 AI373032 AI564269 
F00531 H83488 W37181 W78802 R66056 AIO02839 R67840 AA300207 AW959581 T63226 F04005 

AA976503 AI917B02 AA953664 AA404613 AA428771 BE280542 AW194691 AI927301 AI740458 AI796100 AI935603 AW052210 
AA970201 AI633384 AA425910 A1017004 AI241295 AA402816 AA291468 
AA398838AA435847 

AA416568 AA442889 AA417233 AA442223 
AA453641AA454061 



122188 275673.1 

121581 283769.1 

122618 305217 J 

109026 150431J AA157811 AA836869 

123658 genbank.AA609364 AA609364 

123811 genbank JVA620586 AA620586 

125115 genbank_T97341 T97341 

125147 NOTJOUND_entrez_W38150 

118737 382979.1 AA1 99686 N73861 

120274 genbankAA177051 AA177051 

113196 genbank T57317 T57317 

120504 genbankCAA256837 AA256837 

120639 genbankAA286942 AA286942 
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120809 genbanlcAA346495 AA346495 
113702 genbanleT97307 T97307 

129680 23162.1 U03749 NM_001275 J03483 J03915 AI214509 AW245744 AL046455 AA318960 AI741505 AA843875 AI829382 AI560122 
AI858999 D55958 AI684005 D53170 AA854091 AI025609 D53119 D54729 D55504 D55377 D55313 AW512244 AA846441 
AW043898 AI969102 AA405741 AI091983 AA788784 AA706586 AA854361 AW470949 AA843095 AA772028 AI148432 
AI038109 AA782478 AA910064 AI220384 AA781296 AA843881 AA854064 AA843125 AA843419 AA319036 AA319054 
AI273831 W32275 AI584185 C05724 AA789023 AI686818 D54392 AI022485 AA431410 AA854232 W39212 W15214 AA894441 
AI803081 AI167381 AW245389 AA319430 AA335156 AI042646 AA327030 AA725170 T27943 AA889304 AA976699 AI687001 
AI621107 AI865540 AA772107 C06286 AA319661 AA405992 

101045 entrezL_J0561 4 J0561 4 

117247 genbank N21032 

110501 genbank H55748 

103392 entrez_X94563X94563 

105032 genbank AA127818 

119513 NOT_FOUND_entrez W37933 

105445 genbank AA252395 

121514 genbank AA412112 

121558 genbank AA412497 

121911 genbank AA427950 

123315 714071J AA4 96369 AA496646 

114911 genbank AA236672 

409487 11 34778 J H 19886 AW402806 T 10231 
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TABLE 5: Figure 5 from BRCA 001 US 
Table 5 shows genes upregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

1 0 Unigene Title: Unigene gene title 

R1 ; Ratio of tumor to normal breast tissue 



Pkey ExAccn UnigenelD UnigeneTitle R1 

15 

100114 X02308 Hs.82962 thymidyiate synthetase 2.9 

100147 D13666 Hs.136348 osteoblast specific factor 2 (fasctclin 7,5 

100154 H60720 Hs.81892 KIAA0101 gene product 9.2 

100335 AW247529 Hs.6793 platelet-activating factor acetylhydrola 27 

20 100666 L05424 Hs.169610 CD44 antigen (homir>g function and Indian 5.7 

100667 L05424 Hs.169610 CD44 antigen (homing function and Indian 9 

100668 L05424 Hs.169610 CD44 antigen (homing function and Indian 7.6 
100678 AW502935 HsJ40 PTK2 protein tyrosine kinase 2 53.2 
100988 AK000405 Hs.76480 ubiquitin-like 4 11.4 

25 101031 J05070 Hs. 151738 matrix metaHoprotei'nase 9 {gelatinase B 8.2 

101045 J05614 gb:Human proliferating cell nuclear anti 5 

101332 J04088 Hs.156346 topoisomerase (DNA) It alpha (170kD) 3.4 

101352 AI494299 Hs.16297 COX17 (yeast) homoJog, cytochrome coxid 6.3 

101580 NM„012151Hs.83363 coagulation factor VII l-associated (in tr 5.7 

30 101592 AF064853 Hs.91299 guanine nucleotide binding protein ( 5.6 

101767 M81057 Hs.180884 carboxypeptidase B1 (tissue) 14.4 

101806 AA586894 Hs.112408 S100 calcium-binding protein A7 (psorias 8.9 

101810 NM_00O318Hs.180612 peroxisomal membrane protein 3 (35kD, Ze 3.2 

101983 AI904232 Hs. 75323 prohibits 8.4 

35 102107 BE258602 Hs.182366 heat shock protein 75 1.4 

102165 BE313280 Hs.159627 death associated protein 3 4,6 

102198 AW950852 Hs.74598 polymerase (DNA directed), delta 2, regu 4.3 

102217 AA829978 Hs,301613 JTV1 gene 6 7 

102220 U24389 Hs.65436 lysosomal 4 3 

40 102302 AA306342 Hs.69171 protein kinase (Mike 2 2 7 

102348 U37519 Hs.87539 aldehyde dehydrogenase 3 family, member 2 

102374 U33635 Hs.90572 PTK7 protein tyrosine kinase 7 6.2 

102455 U48705 Hs.75562 discoidin domain receptor family, member 6 9 

102568 W81489 Hs.223025 RAB31, member RAS oncogene family 5 3 

45 102618 AL037672 Hs.81071 extracellular matrix protein 1 5 8 

102687 NM_007019Hs.93002 ubiquitin carrier protein E2-C 4 3 

102689 U96132 Hs.171280 hydroxyacyl-Coenzyme A dehydrogenase, ty 6 

102704 AU077058 Hs.54089 BRCA1 associated RING domain 1 19 

102705 T97490 Hs.50002 small inducible cytokine subfamily A (Cy 2 3 
50 102801 BE252241 Hs.3B041 pyridaxal (pyridoxine, vitamin B6) kinas 6.4 

102827 BE244588 Hs.6456 chaperonin containing TCP 1, subunit 2 (b 5 6 

103060 NrvL005940Hs.1 55324 matrix metallo proteinase 11 (M(vP11; stro 4 5 

103080 AU077231 Hs.82932 cydin D1 (PRAD1: parathyroid adenomatos 3.1 

103178 AA205475 Hs.275865 ribosomal protein S18 9,9 

55 103206 X72755 Hs.77367 monokine induced by gamma interferon 8.8 

103238 At 369285 Hs.75189 death-associated protein 5.6 

103547 AI376722 Hs.180062 proteasome (prosome, macropain) subunit, 9.7 

103549 BE270465 Hs.78793 protein kinase C, zeta 7.9 

103886 AK001278 Hs.105737 hypothetical protein FU10416 similar to 6.5 

60 104325 BE379766 Hs.150675 polymerase (RNA) II (DNA directed) polyp 6.3 

1C4827 AW052006 Hs.8551 PRP4/STK/WD splicing factor 10.9 

104846 AI250789 Hs.32478 ESTs 5.6 

104854 AA041276 Hs 154729 3-phosphoinosKide dependent protein kin 12.3 

104867 AA278898 Hs.225979 hypothetical protein similar to small G 2 
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104896 
104909 
104916 
104919 
104974 
104978 
105012 
105039 
105079 
105088 
105393 
105547 
105564 
105658 
105746 
105858 
105930 
106094 
106350 
106359 
106610 
106624 
106713 
106829 
106846 
106873 
106973 
107029 
107197 
107859 
107901 
107922 
107974 
108274 
108647 
108695 
108894 
109011 
109068 
109273 
109468 
110240 
110330 
110501 
110742 
110762 
110856 
110958 
111125 
111179 
111239 
111285 
111392 
111937 
112244 
112995 
113777 
113791 
113811 
113834 
113868 
113870 
113923 
114275 
114895 
114965 
115061 



AW015318 Hs.23165 
AW408164 HS.2491B4 
AW958157 Hs.155489 
AA026880 Hs.25252 
Y12059 Hs.278675 
AI199268 Hs.19322 
AF098158 Hs.9329 
AA907305 Hs.36475 
AA151342 Hs.12677 
H58589 Hs.35156 
AF1 67570 Hs.256583 
AA262640 Hs.27445 
BE616694 Hs.288042 
AA9B5190 Hs.246875 
AW151952 Hs.46679 
AF151066 Hs.281428 
AF016371 Hs.9880 
AA533491 Hs.23317 
AKD01404 Hs.194698 
AW390282 Hs.31130 
AA458882 Hs.79732 
NM_0O3595Hs.26350 
BE614802 Hs.184352 
AW959893 Hs.27099 
AB037744 Hs.34892 
N49809 Hs.11197 
BE156256 Hs.11923 
AF264750 Hs.288971 
W15477 Hs.64639 
AW732573 Hs.47584 
L42612 Hs.335952 
BE153855 Hs.61460 
AW956103 Hs.61712 
AF 129535 Hs^72027 
BE546947 Hs.44276 
AB029000 Hs.70823 
AKO01431 Hs.5105 
AA156542 Hs.72127 
AA164293 Hs.72545 
AA375752 Hs.82719 
NM_015310HS.6763 
AI668594 Hs.1 76588 
AI288666 Hs.16621 
H55748 

AW1 90338 Hs.28029 
BE044245 Hs.30011 
AA992380 

NM_005864Hs.24587 
N63823 Hs.269115 
AK000136 Hs.10760 
N90956 Hs.17230 



ESTs 

transcription factor 19 (SC1) 
NS1 -associated protein 1 
prolactin receptor 
bromodomain-containing 4 



17.7 
5 

1.7 
1.4 

1.4 



AA778711 
W46342 



Hs.4310 
Hs.325081 



BE298665 Hs.14846 

AB029000 Hs.70823 

AA737033 Hs.7155 

BE266947 Hs.10590 

AI269096 Hs.135578 

BE207480 Hs.6994 

T26483 Hs.6059 

W57902 Hs.90744 

AL079314 Hs.16537 

AW953484 Hs.3849 

AW515443 Hs.306117 

AA236177 Hs.76591 

AI733881 Hs.72472 

AI751438 Hs.41271 



Homo sapiens, Similar to RIKEN cDNA 2010 7.2 

chromosome 20 open reading frame 1 3.3 

ESTs 2.5 

CGI-147 protein 9.5 

Homo sapiens cDNA FLJ1 1027 fis, clone PL 2.2 

interleukin enhancer binding factor 3, 9 5.4 

unknown 9.3 

hypothetical protein FU14299 1 .4 

hypothetical protein FU20059 9.4 

hypothetical protein FLJ20739 1 .5 

hypothetical protein 2.9 

peptidyl prolyl isomerase H (cydophilin 5.2 

hypothetical protein FLJ14681 6.8 

cyclin B2 5.7 

transmembrane 7 superfamily member 2 6.3 

fibulin 1 7.9 

tyrosylprotein sulfotransferase 2 7.7 

hypothetical protein FU 12549 4.5 

hypothetical protein FLJ23293 similar to 16.2 

KIAA1323 protein 2.2 
Homo sapiens, clone IMAGE:3343149, mRNA, 16.8 

hypothetical protein 6.6 

myeloid/lymphoid or mixed-lineage leukem 1 .8 

glioma pathogenesis-retated protein 6.1 

potassium voltage-gated channel, delayed 8.4 

keratin 6B 2.5 

Ig superfamily receptor LNIR 2.2 

pyruvate dehydrogenase kinase, isoenzyme 6.7 

F-box only protein 5 7.1 

homeoboxCIO 9-8 

K1AA1 077 protein 7.2 

hypothetical protein FLJ10569 4 

ESTs 1.4 

ESTs 2.9 
Homo sapiens mRNA; cDNA DKFZp586F1822 (f 2.9 

KIAA0942 protein 3.2 
ESTs, Weakly similar to CP4Y_HUMAN CYTOC 4.2 

DKFZP434I116 protein 6.2 

gbyq94a01 .s1 Soares fetal liver spleen 6.1 

hypothetical protein MGC1 1256 7.6 

hypothetical protein MGC2963 9.3 
gb:ot37g06^1 Soares_testis_NHT Homo sap 2.3 

signal transduction protein (SH3 contain 6.7 
ESTs, Moderately similar to Z195_HUMAN Z 3.6 

asporin (LRR class 1) 7.1 

hypothetical protein FU22087 7.9 

eu ka ryotic translation initiation factor 6 .9 
Homo sapiens, clone IMAGE: 3659680, mRNA, 8.4 
Homo sapiens mRNA; cDNA DKFZp564D016 (fr 10.6 
KIAA1077 protein 14.6 
ESTs, Moderatelysimilarto2115357ATYK 5.6 
zinc finger protein 313 13.4 
ch'rtobiase, di-N -acetyl- 1.3 
Homo sapiens cDNA: FLJ22044 fis, clone H 3.1 
EGF<»ntalning fibulin-like extracellula 11.3 
proteasome (prosome, macropain) 26S subu 2.7 
hypothetical protein, similar to (U06944 6.1 
hypothetical protein FLJ2204 1 similar to 1 .9 
KIAA0306 protein 15.8 
KIAA0887 protein 7.1 
BMP-R1B 2.3 
Homo sapiens mRNA full length insert cDN 1 1 .8 
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115278 
115291 
115652 
115693 
115941 
115968 
116011 
116417 
116470 
116637 
117132 
117881 
118528 
119075 
119265 
119349 
119403 
119789 
120206 
120253 
120297 
120325 
120327 
120349 
120356 
120371 
120383 
120386 
120389 
120396 
120418 
120472 
120484 
120570 
120582 
120596 
120624 
120695 
120713 
120750 
120774 
120807 
120809 
120984 
121081 
121408 
121505 
121508 
121513 
121549 
121558 
121655 
121744 
121748 
121773 
121832 
121839 
121882 
121911 
121999 
122013 
122036 
122356 
122371 
122372 
122460 
122490 



AK002163 
BE545072 
BE093589 
AF231023 
AI867451 
AB037753 
AL359053 
AW499664 
A1272141 
AK001043 
AI393666 
AF161470 
AI949952 
M10905 
BE539706 
T65004 
AL1 17554 
BE393948 
H26735 
AA131376 
AA191384 
AA195651 
AK000292 
AW969481 
AF000545 
AA2 19305 
AL109963 
AW969665 
AW967985 
AA1 34006 
AW966893 
AI950087 
AA253170 
AA280679 
BE244830 
AA282074 
AW407987 
AA976503 
AW449855 
AI191410 
A1608909 
AA346385 
AA346495 
BE262951 
AA398721 
AA406137 
AA494172 
AA402515 
AA416653 
AA412477 
AA412497 
AA421537 
AA398784 
BE536911 
AB033022 
AW340797 
AA425691 
AA426376 
AA427950 
AA430211 
AA431085 
W92142 
AA443794 
AA868555 
AA446008 
AW4 18788 
AA448349 



Hs.301724 hypothetical protein FU113G1 1-5 

Hs.122579 hypothetcal protein FLJ10461 6.2 

Hs.38178 hypothetical protein FU23468 10.6 

Hs.55173 cadherin, EGF LAG seven-pass G-type rece 6 8 

Hs.46679 hypothetical protein FLJ20739 5,5 

Hs.62767 K1AA1332 protein 9.8 

Hs.57664 Homo sapiens mRNA full length insert cDN 2 4 

Hs. 12484 Human clone 23826 mRNA sequence 7 4 

Hs.83484 SRY (sex determining region Y)-box 4 2 1 

Hs.92033 integrin-linked kinasenassociated serine 2.7 

Hs.42315 p10-binding protein 5 2 

Hs.260622 butyrate -induced transcript 1 5 7 

Hs. 49397 ESTs 7 4 

Hs.287820 fibranectin 1 5.7 

Hs.285363 ESTs 1.4 

Hs. 163561 ESTs 8.4 

Hs. 119908 nucleolar protein NOP5/NOP58 6 7 

Hs.50915 kalfikreln 5 (KLK5; KLK-L2; stratum com 9.2 

Hs.91668 Homo sapiens clone PP1498 unknown mRNA 457 

Hs.326401 fibroblast growth factor 12B 38.9 

Hs.104072 ESTs, Weakly similar to Z195_HUMAN ZINC 15.2 

Hs.104106 ESTs 6.4 

Hs.278732 hypothetical protein FLJ20285 16.1 

Hs.55189 hypothetical protein 16.8 

Hs.296433 putative purinetgic receptor 28.1 

Hs.104196 EST 12.4 

Hs.123122 FSH primary response (LRPR1, rat) homoto 9 7 

Hs.154848 hypothetical protein DKFZp434D01 27 32.6 

Hs.325572 ESTs, Moderately similar to ALU7JHUMAN A 21.7 

Hs.79306 eukaryotlc translation initiation factor 12.5 

Hs.26613 Homo sapiens mRNA; cDNA DKFZp586F1323 (f 11.4 
gb:wq05c02.x1 NCLC6AP_Kid12 Homo sapien 19.4 

Hs.96473 EST 10.4 

Hs.271445 ESTs, Weakly similar to ALU1JHUMAN ALU S 14.4 

Hs.284228 ZNF135-like protein 10.2 

Hs.237323 N-acetylglucosamine-phosphate mutase 7.5 

Hs.173518 M-phase phosphoprotein homotog 52 

gb:oq30a04.s1 NCI_CGAP_GC4 Homo sapiens 46.8 

Hs.96557 Homo sapiens cDNA FLJ12727 fis, done NT 5.9 

Hs.96693 ESTs, Moderately similar to 2109260A B c 7 

Hs.193985 ESTs 7.8 

Hs.30002 SH3-containing protein SH3GLB2; KIAA1 848 6.8 
gb:EST52657 Fetal heart II Homo sapiens 4.4 

Hs.99052 ESTs 5.6 

Hs.1 86749 ESTs, Highly similar to I37550 mismatch 5,4 

Hs.98019 EST 6 

Hs.194417 ESTs 13-1 

Hs.97887 ESTs 28 

Hs.181510 ESTs 6 2 

Hs.98142 EST 7.4 
gb:zt95g12.s1 Soares_testis_NHT Homo sap 2 8 

Hs.178072 Homo sapiens mRNA; cDNA DKFZp434B1023 (f 7.8 

Hs.97514 ESTs 71 

Hs.234545 hypothetical protein NUF2R 19.5 

Hs.158654 KIAA1196 protein 7 9 

Hs.98434 ESTs 5.8 

Hs.191606 ESTs, Highly similar to KIAA1048 protein 5 

Hs.98459 ESTs 5 

qb:zw50f02.s1 Soares_total_fetus_Nb2HF8_ 7 2 

Hs.98668 EST 6.4 

Hs.98706 ESTs 6 5 

Hs.271963 ESTs, Weakly similar to ALU5_HUMAN ALU S 13.1 

Hs.98390 ESTs 7.3 

Hs. 178222 ESTs 5 

Hs.336677 EST 7.6 

Hs.99148 ESTs, Weakly similar to S43569 R01H10.6 9.7 

Hs.238151 EST 6.1 
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122492 
122510 
122530 
122572 
122607 
122614 
122616 
122618 
122622 
122717 
122829 
122838 
122856 
122868 
122907 
123016 
123034 
123136 
123152 
123394 
123466 
123486 
123615 
123735 
123753 
124006 
124385 
124440 
124656 
124683 
124735 
124761 
124768 
124788 
124811 
124812 
124822 
124860 
124903 
124930 
124942 
125051 
125056 
125101 
125115 
125280 
127274 
128528 
128670 
128691 
128772 
128781 
128797 
128868 
128891 
128946 
128975 
128995 
129019 
129076 
129088 
129096 
129198 
129347 
129362 
129372 
129404 



AA448417 Hs.104990 
AA449232 Hs.99195 
AW959741 Hs.40368 
AA452601 Hs.99287 
AA453518 Hs.98023 
AA453630 Hs.99339 
AA453638 Hs. 161 873 
AA453641 

AA453987 Hs.144802 
AA456859 Hs. 178358 
AW204530 Hs.99500 
AA460584 Hs.334386 
AI929374 Hs.75367 
AF005216 Hs.115541 
AA470074 Hs. 169896 
AW338067 Hs.323231 
AL359571 Hs.44054 
AW451999 Hs.194024 
AW601773 Hs.270259 
AA731404 Hs.105510 
AA599042 Hs.1 12503 
BE019072 Hs.334802 
AA609170 

NM_013241Hs.95231 
AA609955 Hs.234961 
AI147155 Hs.270016 
AI267847 

AA532519 Hs.129043 
AW297702 Hs.1 02915 
AA381661 Hs.1 19878 
R22952 Hs.268685 
AA374756 Hs.93560 
AW368528 Hs. 100855 



R43543 

R46068 

R47948 

AA418160 

R65763 



Hs.100912 
Hs.288912 
Hs.1 88732 
Hs.86043 
Hs.101477 



AW296713 Hs.221441 
AI076343 Hs.173939 



Hs.268892 
Hs.1 00588 
Hs.1 00592 
Hs.286236 

Hs. 106932 



R99978 
T79956 
T81310 
AI472068 
T97341 
AI123705 
AW966158 Hs 58582 



R39234 

AA975486 

W27939 

BE302795 

N71826 



Hs 251699 
Hs 103441 
Hs 103834 
Hs. 105097 
Hs 105465 



NM_002975Hs 105927 
AA419008 Hs 106730 



F34856 
Y13153 
BE560779 
AI816224 
AI950087 

AW296806 Hs 326234 
AA744610 Hs 194431 



Hs.292457 
Hs.107318 
Hs.284233 
Hs 107747 



AA463189 
N57532 
BE614192 
U30246 



Hs 288906 
Hs 109315 
Hs 279869 
Hs.1 10736 



NM_016039Hs.11O8O3 
AI267700 Hs.317584 



ESTs 5.4 

ESTs 11.2 

adaptor-related protein complex 1 , sigma 1 0.1 

EST 11 

ESTs 615 

EST 10.7 

ESTs 107.3 

gb:zx48e06.s1 Soares_tesfs_NHT Homo sap 31.1 

ESTs 5.6 

ESTs 8.5 

ESTs 81.8 

ESTs 75.3 

Src-like-adapter 5.8 

Janus kinase 2 {a protein tyrosine kinas 5.3 

ESTs 11.5 

Homo sapiens cDNA FLJ1 1946 fis, done HE 2.8 

ninein (GSK3B interacting protein) 8.7 

ESTs 5.1 

ESTs 5.2 

ESTs 3.6 

EST 7.4 

Homo sapiens cDNAFLM 4680 fis, done NT 2.4 

gb;af!2a12.s1 Soares_testis_NHT Homo sap 7.8 

FH1/FH2domainHX>ntaining protein 10 

Huntingtin interacting protein E 30.6 

ESTs 8.1 

gbaq49a10.x1 Stanley Frontal NB pool 2 57.1 
Human DNA sequence from clone 989H1 1 on 7.8 

ESTs 8.3 
ESTs, Weakly similar to M3K9JHUMAN MITOG 7.9 

ESTs ' 11.3 

Homo sapiens mRNAfor KIAA1771 protein, 9 

ESTs 8.1 

Homo sapiens cDNA: FU22726 fis, clone H 5.1 

hypothetical protein FLJ22604 14.2 

ESTs 7.9 

Homo sapiens cDNA FLJ13558 fis, clone PL 6.6 

EST 23.9 

ESTs 32.4 

ESTs, Weakly similar to ALUBJ-fUMAN !!!! 22.8 

ESTs, Moderately similar to B34087 hypot 6.1 

EST 135.3 

ESTs 5.4 

KIAA1 856 protein 5.6 

gb:ye57e05.s1 Scares fetal liver spleen 9.6 

ESTs 8 

Homo sapiens cDNA FLJ12789 fis, clone NT 12.8 

ESTs, Weakly similar to IDN4-GGTR14 [H.s 2.8 

Homo sapiens, Similar to RIKEN cDNA 1700 7.1 

hypothetical protein MGC5576 7.7 

thymidine kinase 1 , soluble 5.3 

small nuclear ribonucteoprotein porypept 53.9 

stem cell growth factor; lymphocyte seer 13.3 

chromosome 22 open reading frame 3 3 
Homo sapiens, clone MGC:16362, mRNA, com 13.3 

kynurenine 3-monooxygenase (kynurenlne 3 7.2 

NICE-5 protein 14 

DKFZP566C243 protein 1.9 
gb:wq05c02.x1 NCLCGAP_Kid12Homosapien 2.9 

ESTs, Highly similar to T46422 hypotheti 5 

palladin 17.1 

WW Domain-Containing Gene 20.9 

K1AA1415 protein 5.8 

melanoma-associated antigen recognised b 7.6 

solute carrier family 1 2 (sodium/potass i 6.7 

CGI-99 protein 2 

ESTs 5 
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129482 AA188185 Hs.289043 
129559 W01296 Hs.11360 
129587 H14718 Hs 11506 
129629 AK000398 Hs 11747 
5 129649 AD0C0092 Hs 16488 
129680 U03749 
129689 AW748482 Hs 77873 
129702 AI304966 Hs 12035 
129720 AA156214 Hs 12152 
10 130010 AA301116 Hs 142838 
130097 AL046962 Hs. 14845 
130135 AA311426 Hs,21635 
130211 NM_Q03358Hs.23703 
130242 X79201 Hs 153221 
15 130359 NNL013449HS.277401 
130365 W56119 Hs.155103 
130448 BE513202 Hs.15589 
130455 D90041 Hs.155956 
130471 AL121438 Hs.183706 
20 130503 BE208491 Hs.295112 
130511 L32137 Hs.1584 
130542 U64675 Hs.179825 
130553 AF062649 Hs.252587 
130556 AI907018 Hs.15977 
25 130567 AA383092 Hs.1608 
130574 AF0832O8 Hs.16178 
130617 M90516 Hs.1674 
130667 BE246961 Hs. 17639 
130693 R68537 Hs.17962 
30 130744 H59696 Hs.18747 
130757 AL036067 Hs.18925 
130880 BE514434 Hs.20830 
130944 BE382657 Hs.21486 
131046 AA321649 Hs.2248 
35 131060 AA194422 Hs.22564 
131099 AL133353 Hsi26581 
131135 NM„015569Hs.267182 
131185 BE280074 Hs.23960 
131225 H62087 Hs.31659 
40 131245 AL080080 Hs.24766 
131283 X80038 Hs.339713 
131569 AL389951 Hs.271623 
131643 AW410601 Hs.30026 
131714 AA642831 Hs.31016 
45 131722 D13757 Hs.311 
131760 X76732 Hs.3164 
131793 AW966127 Hs.32246 
131885 BE502341 Hs.3402 
131900 AA099014 Hs,231029 
50 131905 AA179298 Hs.3439 
131941 BE252983 Hs.35086 
131971 BE567100 Hs.154938 
132180 NM_004460Hs.41 8 
132203 NM„004782Hs.194714 
55 132273 AA227710 Hs.43658 
132288 N36110 Hs.305971 
132294 AB023191 Hs.44131 
132348 AW067708 Hs. 170311 
132370 AW572805 Hs.46645 
60 132384 AA312135 Hs.46967 
132450 AA1 00012 Hs.48827 
132465 AW1 69847 Hs.49169 
132532 AA454132 Hs.5080 
132574 AW631437 Hs.5184 
65 132638 AI796870 Hs.54277 
132718 NM_004600Hs.554 
132726 N52298 Hs.55608 



spindSn 6.7 

hypothetical protein FU 14784 7.5 

Human clone 23589 mRNA sequence 6.8 

hypothetical protein aj20391 3.8 

calreticulin 3.3 
gb:Human chrornogranin A (CHGA) gene, pro 14.1 

B7 homolog 3 2.6 

ESTs, Weakly similar to I38022 hypothet 7.4 

APMCF1 protein 2 

nucleolar phosphoprotein Nopp34 1 .6 

forkhead box 03A 2.8 

tubulin, gamma 1 6.1 
ESTs, Moderately similar to CEGTHUMAN C1.6 

synovial sarcoma, translocated to X chro 5.4 

bromodomain adjacent to zinc finger doma 8.5 

eukaryotic translation initiation factor 1 1 

PPAR binding protein 3.9 

N-acetyltransferase 1 (a ry la mine N-acety 33.6 

adducin 1 (alpha) 2.7 

KIAA0618 gene product 16.1 

cartilage oligomeric matrix protein (pse 6. 1 

RAN binding protein 2-like 1 7.8 

pituitary tumor-transforming 1 1 4.4 

Empirically selected from AFFX single pr 4.7 

replication protein A3 (14kD) 7.9 

apopbsis antagonizing transcription fac 1 .2 

glutamtne-fructose-6-phosphate transamin 12.1 

Homo sapiens ubiquitin protein Ifgase (U 1 3.9 

ESTs 2 

POP7 (processing of precursor, S. cerevi 3.1 

protein x 0001 5.7 

kinestn-flke 2 2.1 

signal transducer and activator of trans 5,4 

small inducible cytokine subfamily B(Cy 7.4 

myosin VI 5.1 

COX1 5 (yeast) homolog, cytochrome c oxid 7 

TBX3-iso protein 3.3 

cyctin B1 5.8 

thyroid hormone receptor-associated prot 7.5 

thioredoxin domain-containing 2.8 
Homo sapiens clone F1 9374 APO E-C2 gene 1 .3 
nucteoporin 50kD 5 

HSPC182 protein 2.9 
putative DNA binding protein 2.9 
phosphoribosyl pyrophosphate amidotransf 3.4 
nudeobindin 2 2.9 
Homo sapiens cDNA FU 1 4656 fc, done NT 7.9 
ESTs 13.7 
Homo sapiens, clone MGC;15961 , mRNA, com 8 J 
stomatin-like 2 11.3 
ubiquitin specific protease 1 2.3 
hypothetical protein MDS025 3.5 
fibroblast activation protein, alpha 14.7 
synaptosomal-associated protein. 29kD 7,8 
DKFZP586L151 protein 10 
solute carrier family 2 (facilitated glu 9.2 
KIAA0974 protein 2 
heterogeneous nuclear ribonucleoprolein 12.5 
ESTs 28.3 
HSPC034 protein 6.1 
hypothetical protein FU 12085 8.6 
KIAA1634 protein 6.1 
mitochondrial ribosomal protein L16 7.1 
TH1 drosophila homolog 14 
DNA segment on chromosome X (unique) 99212.4 
Sjogren syndrome antigen A2 (60kD, ribon 3.7 
hypothetical protein MGC955 14.3 
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132731 AJ189075 Hs.301872 hypothetical protein MGC4840 5.9 

132744 AA010233 Hs.55921 g lutamy I -proiyl-tRN A synthetase 6.4 

132773 AA459713 Hs.295901 KIAA0493 protein 14.6 

132798 AI026701 Hs.5716 KIAA0310 gene product 2.5 

5 132810 AB007944 Hs.5737 KIAA0475 gene product 4.2 

132833 U78525 Hs.57783 eukaryotic translation initiation fadof 6.1 

132842 NM 016154Hs.279771 Homo sapiens clone PP1596 unknown mRNA7.1 

132851 U09716 Hs.287912 lectin, manrwse-binding, 1 6.1 

132891 BE267143 Hs.59271 U2(RNU2) small nuclear RNA auxiliary fac 2.7 

10 132941 AI817165 Hs.6120 hypothetical protein FL J 1322 2 2.1 

132972 AA034365 Hs.288924 Homo sapiens cDN A FLJ 11 392 fis, clone HE 3.5 

132980 AA040696 Hs.62016 ESTs 1.3 

132994 AA112748 Hs.279905 clone HQ03 10 PRO0310p1 17.1 

133016 AI439688 Hs.6289 hypothetical protein FLJ20886 4.4 

15 133177 X97795 Hs.66718 RAD54 (S.cerevfeiae)-like 4.4 

133208 AI801777 Hs.6774 ESTs 5.5 

133254 AI567421 Hs.273330 Homo sapiens, done IMAGE:3544662, mRNA, 1.3 

133266 AI160873 Hs.69233 zinc finger protein 16.1 

133268 AW956781 Hs.293937 ESTs, Weakfy similar to FXD2JHUMAN FORKH 12.2 

20 133285 M76477 Hs.289082 GM2 gang lioside acth/ator protein 10.4 

133390 AI950382 Hs.72660 phospha tidy (serine receptor 5.7 

133391 AW103364 Hs.727 inhibin, beta A (actjvin A, activin AB a 25.5 
133540 AL037159 Hs.74619 proteasome (prosome, macropain) 26S subu 1.7 
133594 AW160781 Hs.172589 nuclear phospho protein similar to S. cer 2.6 

25 133621 NM_004893Hs.75258 H2A histone family, member Y 13.5 

133720 L27841 Hs.75737 pericentriolar material 1 6.7 

133760 BE271766 HsJ 81357 laminin receptor 1 (67kD, ribosomal prot 5.4 

133784 BE622743 Hs.301064 arfaptin 1 12.1 

133791 M34338 Hs.76244 spermidine synthase 9.7 

30 133797 AL1 33921 Hs,76272 retinoblastonn a -binding protein 2 1.3 

133822 D50525 Hs.699 peptidylprolyiisomeraseB (cyctophilin 9.7 

133850 W29092 Hs.7678 cellular retinoic acid-binding protein 1 4.2 

133865 AB011155 Hs.170290 discs, large (Drosophi la) homolog 5 5 

133881 U30872 Hs.77204 centromere protein F (350/4O0kD, mitosin 9.1 

35 133924 D86326 Hs.325948 vesicle docking protein p1 15 1.8 

133959 X81789 Hs.77897 splicing factor 3a, subun'rt 3, 60kD 10.4 

133989 AL040328 Hs.78202 SWi/SNF related, matrix associated, act! 2.6 

133997 AI824113 Hs.78281 regulator of G-protein signalling 12 13 

134234 BE300078 Hs.80449 Homo sapiens, clone IMAGE:3535294, mRNA, 10.3 

40 134348 AW291946 Hs.82065 interieukin 6 signal transducer (gp1 30, 6.7 

134376 X06560 Hs.82396 2',5'digoadenyl ate synthetase 1 (40-46 5,5 

134379 AW362124 Hs.323193 hypothetical protein MGC3222 5.8 

134405 AW067903 Hs.82772 collagen, type XI, alpha 1 72.9 

134421 AU077196 Hs.82985 collagen, type V, alpha 2 6.7 

45 134480 N(vL005000Hs.83916 Empirically selected from AFFX single pr 6,2 

134516 AK001571 Hs.273357 hypothetical protein aJ 10709 1.4 

1 34529 AW41 1479 Hs.848 FK50 6 -binding protein 4 (59kD) 2.8 

134751 AW630803 Hs.89497 laminBI 6.1 

134790 BE002798 Hs.287850 integral membrane protein 1 1.2 

50 134806 AD001528 Hs.89718 spermine synthase 2.6 

134850 AI701162 Hs.90207 hypothetical protein MGC11 138 9.1 

134859 D26488 Hs.90315 KIAA0007 protein 13.3 

134971 AI097346 Hs.286049 phosphoserine aminotransferase 2 

135181 BE250865 Hs.279529 px19-like protein 14.9 

55 135207 N26427 Hs.9634 ESTs, Highly similar to C10_HUMAN PUT AT1 17 

135245 AI028767 Hs.262603 ESTs 12.2 

135257 AW291023 Hs.97255 ESTs, Weakly similar to A46010 X linked 7.6 

135307 AI743770 Hs.98368 ESTs, Weakly similar to KIAA0822 protein 5.8 

135321 AI652069 Hs 98614 ribosome binding protein 1 (dog 180kD ho 12.3 

60 135354 AA456454 Hs. 183418 cell division cycle 2-like 1 (PITSLRE pr 57 

135400 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 13.9 

302276 AW057736 Hs. 323910 HE R 2 receptor tyrosine kinase (c 5.3 

317781 NM_007Q57Hs.4265O ZW10 interactor 2.8 

321114 AA902256 Hs.78979 Golgi apparatus protein 1 5.5 

6 5 322556 BE041451 Hs.177507 hypothetical protein 2.9 

420802 U22376 Hs.1334 v-myb avian my e lob tastosis viral oncogen 2.3 

424001 W67883 Hs.137476 paternally expressed 10 (PEG10; KIAA105 7 
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425182 


AF041259 


Hs. 155040 


zinc finger protein 217 


2.3 


446999 


AA151520 


Hs.334822 


hypothetical protein MGC4485 


7.5 


450701 


H39960 


Hs.288467 


Homo sapiens cDNA FU12280 fis, clone MA 5.5 


452461 


N78223 


Hs.108106 


transcription factor 


4.7 


453157 


AF077036 


Hs.31989 


DKFZP586G1722 protein 


12.1 
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TABLE 5A 



Table 5A shows the accession numbers for those pkeys lacking unigenelD's for Table 5. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 

123615 30686_-15 AA609170 

1 24385 656394J AI267847 N27351 

1 10856 19346J4 AA992380 N33063 N21418 H79958 R21911 H79957 

120472 44573.2 AI950087 N70208 R97040 N36809 AI3081 19 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 



AA908598 AA25 187 5 A 1820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 
AA4889S4 AA283144 AI890387 AI950344 AI741346 AI689062 AA28291 5 AW1 02898 AI872193 AI763273 AW173586 
AW1 50329 AI653832 AI762688 AA988777 AA488892 AI356394 AW 1038 13 AI539642 AA642789 AA856975 AW505512 
AI961530 AW629970 BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW961101 
AA251669 AA251874 AI819225 AW205862 AI683338 AI858509 AW276905 AI633006 AA972584 AA908741 AW072629 
AW513996 AA293273 AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 
AW613456 AI373032 AI564269 F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 
F04005 



129019 44573_2 AI950087 N70208 R97040 N36B09 AI308119 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 



AA908598 AA251875 AI820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL1 34043 AW235363 AA663345 AW008282 
AA488964 AA283144 AI890387 AI950344 AI741346 AI689062 AA282915 AW102898 AI872193 AI763273 AW173586 
AW1 50329 AI653832 AI762688 AA988777 AA488892 AI356394 AW103813 AI539642 AA642789 AA856975 AW505512 
AI961530 AW629970 BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW961101 
AA251669 AA251874 AI8 19225 AW205862 AI683338 AI858509 AW276905 AI633006 AA972584 AA908741 AW072629 
AW513996 AA293273 AA969759 N75628 N223B8 H84729 H60052 T92487 AKJ22058 AA780419 AA551005 W80701 
AW613456 AI373032 AI564269 F00531 H83488 W371 81 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 
F04005 

120695 9683_3 AA976503 AI917802 AA953664 AA404613 AA428771 BE280542 AW 194691 AI927301 AI740458 
AI796100 AI935603 AW052210 AA970201 AI633384 AA425910 AI017004 AI241295 AA402816 AA291468 



122618 305217J AA4 5364 1 AA45 4061 

125115 genbank JT97341 T97341 

120809 genbank_AA346495 AA346495 

129680 23162J U03749 NM.001275 J03483 J03915 AI214509 AW245744 AL046455 AA31 8960 AI74 1505 AA843875 AI829382 AI560122 



AI858999 D55958 AI684005 D53170 AA854091 AI025609 D53119 D54729 D55504 D55377 D55313 AW512244 AA846441 
AW043898 AI969102 AA405741 AI091 983 AA788784 AA706586 AA854361 AW470949 AA843095 AA772028 AI148432 
AI038109 AA782478 AA910064 AI220384 AA781296 AA843881 AA854064 AA843125 AA84341 9 AA319036 AA319054 
AI273831 W32275 AI584185 C05724 AA789023 AI686818 D54392 AI022485 AA431410 AA854232 W39212 W15214 
AAB94441 AI803081 AI167381 AW245389 AA319430 AA335156 AI042646 AA327030 AA725170 T27943 AA889304 
AA976699 AI687001 AI621 107 AI865540 AA772107 C06286 AA319661 AA405992 



101045 
110501 
121558 
121911 



entrez_J05614 J05614 



genbank_H55748 H55748 
genbank_AA412497 AA412497 
genbank_AA427950 AA427950 
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TABLE 6: Figure 6 from BRCA 001 US 
Table 6 shows genes upregulated in tumor tissue compared to 



Pkey; Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title; Unigene gene title 

R 1 : Ratio of tumor to normal breast tissue 

Pkey ExAccn UnigenelD UnlgeneTttle R1 

100147 D13666 Hs.136348 osteoblast specific factor 2 (fasciclin 7.5 

100678 AW502935 Hs.740 PTK2 protein tyrosine kinase 2 53.2 

101806 AA586894 Hs.1 12408 S100 calcium -binding protein A7 (psorias 8.9 

102455 U48705 Hs.75562 discoidin domain receptor family, member 5.9 

103206 X72755 Hs.77367 monokine induced by gamma interferon 8.8 

105743 BE246502 Hs.9598 semadomari, rmmunog lob uJin domain (Ig), 2.6 

105746 AW151952 Hs.46679 hypothetical protein FLJ20739 1.5 

106373 AW503807 Hs.21907 histone acetyltransf erase 1.8 
110240 AI668594 Hs.176588 ESTs, Weakly similar to CP4Y_HUMANCYTOC 4.2 

119260 AK001724 Hs.102950 coat protein gamma-cop 3.2 

120206 H26735 Hs.91668 Homo sapiens clone PP1498 unknown mRNA 45.7 

120253 AA131376 Hs.326401 fibroblast growth factor 12B 38.9 

120297 AA191384 Hs.104072 ESTs, Weakly simitar to Z195_HUMAN ZINC 15.2 

120624 AW407987 Hs.173518 M-phase phosphoprotein homolog 52 
120695 AA976503 gb:oq30a04.s1 NCf_CGAP_GC4 Homo sapiens 46.8 

120807 AA346385 Hs.30002 SH3-containing protein SH3GLB2; KIAA1 848 6.8 

121508 AA402515 Hs.97887 ESTs 28 

122607 AA453518 Hs.98023 ESTs 61.5 

122616 AA453638 Hs.161873 ESTs 107.3 

122618 AA453641 gb:zx48e06.s1 SoaresJestis.NHT Homo sap 31.1 

122829 AW204530 Hs.99500 ESTs 81.8 

122838 AA460584 Hs.334386 ESTs 75.3 

123753 AA609955 Hs.234961 H unting tin interacting protein E 30.6 

124385 AI267B47 gb:aq49a1 0.x 1 Stanley Frontal NB pool 2 57.1 

124860 R65763 Hs,101477 EST 23.9 

124930 AI076343 Hs.173939 ESTs, Weakly similar to ALUB.HUMAN !!!! 22.8 

125051 T79956 Hs.100588 EST 135.3 

128781 N71826 Hs.105465 small nuclear ribonucleoprote in polypept 53.9 

129096 AA463189 Hs.288906 WW Domain-Containing Gene 20.9 

129347 BE614192 Hs.279869 melanoma-associated antigen recognised b 7.6 

129689 AW748482 Hs.77873 B7 homolog 3 2.6 

130503 BE208491 Hs.295112 KIAA0618 gene product 16.1 

130511 L32137 Hs.1584 cartilage oJigomeric matrix protein (pse 61 

131046 AA321649 Hs.2248 small inducible cytokine subfamily B (Cy 7.4 

131643 AW410601 Hs.30026 HSPC1 82 protein 2 9 

131925 AF151048 Hs.183180 anaphase promoting complex subu nit 1 1 (y 2 7 

1321B0 NM__004460Hs.41B fibroblast activation protein, alpha 14.7 

132370 AW572805 Hs.46645 ESTs 28.3 

132994 AA1 12748 Hs.279905 clone HQ0310 PRO0310p1 17.1 

133016 AJ439688 Hs.6289 hypothetical protein FLJ20886 4.4 

133266 AI160873 Hs.69233 zinc finger protein 16.1 

133391 AW103364 Hs.727 inhibin, beta A (activin A, activin AB a 25.5 

134169 AI690916 Hs.178137 transducer of ERBB2, 1 1.2 

134219 NM_000402Hs.80206 glucose-6 -phosphate dehydrogenase 1.9 

134405 AW067903 Hs.82772 collagen, type XI, alpha 1 72.9 

134529 AW411479 Hs.848 FK506- binding protein 4 (59 kD) 2.8 

134975 R50333 Hs.92186 Leman coiled-coil protein 2.6 

135181 BE250865 Hs.279529 px19-like protein 14.9 

322556 BE041451 Hs.177507 hypothetical protein 2.9 
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TABLE 6A 

Table 6A shows the accession numbers for those pkeys lacking unigeneDD's for Table 6. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number. Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 
1 24385 656394J AI267847 N2735 1 

120695 96B3J AA976503 AI917802 AA953664 AA404613 AA428771 BE280542 AW1 94691 AI927301 AI740458 AI796100 AI935603 

AW052210AA970201 AI633384 AA425910 AI0170O4 AI241295 AA402816 AA291468 
122618 305217J AA453641 AA454061 



150 



WO 02/(159377 



PCT/LS02/02242 



TABLE 7: Figure 7 from BRCA 001-1 US 

5 Table 7 shows genes upregulated in tumor tissue compared to normal breast tissue. Open 
reading frames in the sequences have been characterized as having a signal sequence (SS), a 
transmembrane domain (TM) or other. 
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Pkey: 


Unique Eos pro beset identifier number 






ExAccn; 


Exemplar Accession number, Genbank accession number 






UnfcjenelD: Unigene number 








Unigene Title: Unigene gene title 








R1: 


Ratio of tumor to norma! breast tissue 






ORF struct info: Structural characterization of open reading frame for the sequence of the gene 




Pkey 


ExAccn 


UntgenelD 


unigene I ine 




ORF struct info 


100113 NM 001269 


Up 


chromosome condensation 1 


2.3 


TM 


100114 


X02308 




inymiuyiatc bynu>cu»c 


2.9 


other 


100131 


D12485 


US. 1 laOl 


ecto nucleotide pyrophosphatase/phosphodi 


1.9 


other 


100146 


BE1 85499 


Up 

HS./4M 


mamuu/u gene product 


1.9 


TM 


100147 


D1 3666 


nS. i*xw4o 


osteoblast specific factor 2 (fasctclin 


7 fi 


other 


100154 


H60720 


ns.o ioy^ 


MAAUiui gene proouci 


9.2 


other 


100163 


W44671 


Hs. 124 


gene predicted from cDNA with a complete 


1 

t .0 


other 


100220 


AW015534 


Hs.2 17493 


annexin A2 


L 


other 


100255 


D38521 


rib. i ijijtjo 


KIAAnn77 nrntpin 


1.5 


other 


100271 


BE160081 


Hs.256290 


S100 calcium-binding protein A11 (calgiz 


13.5 


other 


100275 


BE242802 


Hs.154797 


KIAA0090 protein 


5.1 


other 


100323 


D50920 


Hs.23106 


KIAA0130 gene product 


1.9 


TM 


100335 


AW247529 


Hs.6793 


platelet-activating factor acetyl hydrola 


2.7 


other 


100364 


NM 004341 


Hs.154868 


carbamoyl-phosphate synthetase 2, aspart 


2 


other 


100372 NM_014791 


Hs.184339 


KiAA0175gene product 


2.6 


other 


100393 


D84145 


Hs.39913 


novel RGD-containing protein 


3.2 


other 


100400 


AW954324 


Hs.75790 


phosphatidytfnosrtoi gfycan, class C 


1.5 


other 


100418 


D86978 


Hs.84790 


KIAA0225 protein 


2 


other 


100482 


M65028 


Hs.81361 


heterogeneous nuclear ribonucleoprotein 


2.9 


other 


100518 


NM 004415 


Hs.74316 


desmopiakin (DPI, DPII) 


1.9 


other 


100666 


L05424 


Hs.169610 


CD44 antigen (homing function and Indian 


5.7 


other 


100667 


L05424 


Hs.169610 


CD44 antigen (homing function and Indian 


9 


? 


10066B 


L05424 


Hs.169610 


CD44 antigen (homing function and Indian 


7.7 


other 


100678 


AW502935 


Hs.740 


PTK2 protein tyrosine kinase 2 


53.2 


other 


100783 


AF078847 


Hs.191356 


general transcription factor IIH, polype 


6 


other 


100892 


BE245294 


Hs.1 80789 


S164 protein 


1.7 


? 


100945 


AF002225 


Hs.1 80686 


ubiquitin protein ligase E3A (human papi 


1.5 


other 


100969 


AA 157634 


Hs.79172 


soiute carrier family 25 (mitochondrial 


6,3 


other 


100988 


AK000405 


Hs.76480 


ubiquitin-like 4 


11.4 


? 


100999 


H38765 


Hs.80706 


diaphorase (NADH/NADPH) (cytochrome b-5 


1.6 


other 


101031 


J05070 


Hs.151738 


matrix metalloproteinase 9 (gelatinase B 


8.4 


other 


101045 


J05614 




gb:Human proliferating cell nuclear anti 


5 


? 


101077 


N99692 


Hs.75227 


Empirically selected from AFFX single pr 


2.6 


other 


101093 


L06419 


Hs.75093 


procollagen-lysine, 2-oxoglutarate 5-dio 


1.4 


? 


101186 


AA020956 


Hs.1 79881 


core-binding factor, beta subunit 


2 


TM 


101216 


AA284166 


Hs.84113 


cyclin-dependent kinase inhibitor 3 (CDK 


1.8 


other 


101228 


AA333387 


Hs.82916 


chaperonin containing TCP1. subunit 6A ( 


1.7 


TM 


101247 


AA1 32666 


Hs.78802 


glycogen synthase kinase 3 beta 


1.9 


other 


101249 L18964 


Hs.1 904 


protein kinase C, iota 


1.5 


other 


101332 


J04088 


Hs.156346 


topoisomerase (DNA) II alpha (170kD) 


5.3 


other 


101352 


AJ494299 


Hs.1 6297 


COX17 (yeast) homclog, cytochrome coxid 


4.2 


other 


101396 


BE267931 


Hs.78996 


proliferating ceil nuclear antigen 


1.9 


TM 


101445 


M21259 




gbiHuman Alu repeats in the region 5 to 


1.6 


TM 


101470 


NM_000546 


Hs.1846 


tumor protein p53 (Li-Fraumeni syndrome) 


2.5 


other 


101478 


NM_002890 


Hs.758 


RAS p21 protein activator {GTPase activa 


5.5 


other 
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101483 


M24486 


Hs.76768 


procollagen-proline, 2-oxoglutarate 4-di 


2.1 


other 


101540 J04977 


Hs.84981 


X-ray repair complementing defective rep 


1.6 


other 


101573 AW248421 


Hs.250758 


proteasome (prosome, macropain) 26S subu 


5.7 


other 


101580 


NM 012151 


Hs.83363 


coagulation factor V Ill-associated (intr 


1.8 


other 


101592 


AF064853 


Hs.91299 


guanine nucleotide binding protein (G pr 


5.6 


? 


101621 


BE391804 


Hs.62661 


guanytate binding protein 1, interferon- 


2.4 


other 


101702 


AW504089 


Hs, 179574 


protein phosphatase 2 (formerly 2A), reg 


1.3 


other 


101734 


M74099 


Hs.147049 


cut (Drosophila)-like 1 (CCAAT displacem 


2.1 


? 


101759 


M80244 


Hs. 184601 


solute carrier family 7 (cat/on ic amino 


5 


TM 


101767 


M81057 


Hs. 180884 


carboxy peptidase B1 (tissue) 


14.4 


SS, 


101782 


AA306495 


Hs.1869 


phosphogiucomutase 1 


5.2 


other 


101805 


AW409747 


Hs.75612 


stressHnduced-phosphoprotein 1 (Hsp70/H 


8.6 


other 


101806 


AA586894 


Hs.1 12408 


S100 calcium-binding protein A7 (psorias 


8.9 


SS.TM 


101810 


NM 000318 


Hs.180612 


peroxisomal membrane protein 3 (35 kD, Ze 


3.2 


TM 


101879 


AA1 76374 


Hs 243886 


nuclear autoantigenic sperm protein (his 


1.6 


other 


101911 


AA441787 


Hs 119689 


glycoprotein hormones, alpha polypeptide 


31.3 


? 


101920 AF182645 


Hs 8024 


IK cytokine, down -regulator of HLA li 


1.8 


other 


101973 


U41514 


Hs 80120 


UDP-N-acetyl-alpha-D-galactosamine:polyp 


2.4 


other 


102009 


BE245149 


Hs 82643 


protein tyrosine kinase 9 


1.3 


other 


102036 


BE250127 


Hs 82906 


CDC20 (cell division cycle 20, S. cerevi 


2 


? 


102083 


T35901 


Hq 75117 

no. / j i i 1 


intpripnkin pnhanrpr btndina factor 2 4 


1.6 


other 


102107 


BE258602 


He 18236R 


hpat ehoHc nrntpin 75 


1.4 


other 


102123 


NM 001809 


Hs 1594 


centromere protein A (17kD) 


1.8 


other 


102165 


BE3132B0 


He 159627 

(ID. lu J\s£- 1 


death associated protein 3 


4.6 


? 


102198 


AW950852 


uic 745QR 


poiymerase (DNA directed), delta 2, regu 


4.4 


? 


102217 


AA829978 


n o . ju iuij 


ITV1 npnp 

•J I V 1 UtJI IC 


6.7 


other 


102220 


U24389 


He R'id'Vl 


kfenenmal 

IJOUGUI 1 K3l 


4.4 


TM 


102234 


AW1 63390 


He 97fttti 


hptprnrhrnrnatln-likp nrntpin 1 


1.9 


TM 


102260 


AL039104 


no. i 


karyopherin alpha 2 (RAG cohort 1 , intpor 


4.4 


other 


102302 


AA306342 
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Hs.323342 actin related protein 2/3 complex, subun 
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hypothetical protein MGC 10433 


22 


SS, 


132543 


BE568452 


Hs.5101 


protein regulator of cytokinesis 1 


2.2 


other 


132574 


AW631437 


Hs.5184 


TH1 drosophila homolog 


14 


? 


132596 


AK001484 


Hs,5298 


CGI-45 protein 


1.9 


other 


132611 


AA345547 


Hs 53263 


hypothetical protein FLJ 13287 


26 


TM 


132612 


H12751 


Hs.5327 


PR01914 protein 


2 


other 


132616 


BE262677 


Hs. 283558 


hypothetical protein PR01855 


3.1 


other 


13263ft 


AF796870 


Hs.54277 


DNA segment on chromosome X (unique) 992 


12.4 


TM 




AR018319 


Hs 5460 


K1AA0776 protein 


28 


SS, 


1 32692 


AW1 91962 


Hs 249239 


collagen, type VIII, alpha 2 


3 


other 


132715 


F11875 


Hs 5534 


Homo sapiens cDNA FLJ 12961 fis, clone NT 


1.8 


other 


132718 


NM 004600 


Hs554 


Sjogren syndrome antigen A2 (60kD, ribon 


37 


other 


132724 


AI142265 


Hs 55498 


geranylgeranyl diphosphate synthase 1 


1.8 


TM 


132731 


AI189075 

r> I (u Jut u 


Hs301 872 


hypothetical protein MGC 4840 


5.9 


other 


132744 


AA010233 


Hs.55921 


glutamyl-protyl-tRNA synthetase 


8.7 


other 


132760 


AA125QA5 

/Art 1 L-OU\KJ 


Hs.56145 


thymosin, beta, identified in neuroblast 


3.6 


other 


132771 


Y 10275 


Hs.56407 


phosphoserine phosphatase 


2.8 


TM 


132773 


AA459713 

HAT JJI 1 v 


Hs.295901 


K1AA0493 protein 


14.6 


other 


132784 


AI142133 


Hs 56345 


GDP dissociation inhibitor 2 


1,7 


other 


132798 


A 1026701 


Hs ^716 


KIAA0310 gene product 


2.5 


other 


n2RA7 






mutL (E. coli) homolog 1 (colon cancer, 


1.4 


other 


132810 


AR007944 


Hs 5737 


KIAA0475 gene prodxt 


43 


SS, 


1 OtO 10 


□ CO IJQlj 


He 57435 


solute canrier family 11 (proton-coupled 


2.8 


other 


132815 


Alft151ftQ 


Ho. 57475 


sex comb on mid leg homolog 1 


1.6 


other 


1 "*2ft17 


W27ft52 
0J4; 


H« 57553 


tousled-like kinase 2 


1.4 


other 


132821 




Hs 169610 


CD44 antigen (homing function and Indian 


5.4 


other 


1 JIOOO 


U r Ou^J 




eukaryotic translation initiation factor 


6.1 


? 


112fU2 


MM 0161^4 

IHIvl_U ID 1 JH 


Hs 279771 


Homo sapiens clone PP1596 unknown mRNA 


7.2 


other 




r izzuu 


H<! 5811 


chromosome 21 open reading frame 59 


2,9 


other 


1 ^OftCI 
1 OtO 31 


r trvQ7ifi 

UUjI ID 


Hq 2A7Q1 2 


Iprtin mannnQP-hindino 1 


61 


other 




AWQfi^217 


He 203961 


ESTs, Moderately similar to AF116721 89 


1.8 


other 


1 "^ft?^ 
I ozo/o 


r\v vui/f 000 


He S8598 


KIAA1266 protein 


2.2 


other 


132875 


MM 004^50 


Hq 58617 


Rho-associated, coiled-coil containing p 


5 


TM 




RF2fl7141 


Hq 59271 


U2(RNU2) small nuclear RNA auxiliary fac 


2.7 


? 


1 j£03 r 




Hq 59545 


ring finger protein 15 


5.4 


? 


1 oZjuz 




Hq 5QfttA 


hypothetical protein FLJ 10808 


3.2 


other 




AW/ ( CHJ 


Hq 16757ft 


Homo sapiens cDNA FLJ1 1095 fis, clone PL 


1.4 


other 




\AT7ft71A 
W/O/ 


He 60957 


Homo sapiens cDNA FLJ 13598 fis, clone PL 


3 


other 


1 32940 


T79136 


Hs. 127243 


Homo sapiens mRNA for K1AA1724 protein, 


10.3 


other 


132942 


AA554458 


Hs.197751 


KIAA0666 protein 


2.1 


SS, 


132952 


AI658580 


Hs.61426 


Homo sapiens mesenchymal stem cell prote 


1.3 


other 


132962 


AA576635 


Hs.6153 


CG1-48 protein 


4.9 


other 


132972 


AA034365 


Hs.288924 


Homo sapiens cDNA FLJ1 1392 fis, clone HE 


3.6 


TM 


132973 


AA035446 


Hs.323277 


ESTs 


13.1 


other 


132977 


AA093322 


Hs.301404 


RNA binding motif protein 3 


1.3 


other 


132980 


AA040696 


Hs.62016 


ESTs 


2.3 


? 


132994 


AA1 12748 


Hs.279905 


clone HQ0310PRO0310p1 


17.1 


other 


133012 


AA847843 


Hs.62711 


Homo sapiens, clone IMAGE:3351295, mRNA 


1.9 


other 


133015 


AJ002744 


Hs.246315 


UDP-N-acetyl-alpha-D-galactosamine:poryp 


5 


TM 


133062 


AW500374 


Hs .64056 


PRO0149 protein 


6.1 


other 


133069 


BE247441 


Hs.6430 


protein with potyglutamine repeat; calci 


1.5 


TM 


133091 


AK001628 


Hs.64691 


K1AA0483 protein 


1.4 


other 


133110 


AA808177 


Hs.65228 


ESTs 


5.6 


other 


133134 


AF1 98620 


Hs.65648 


RNA binding motif protein 8 A 


1.9 


other 


133145 


H94227 


Hs-6592 


Homo sapiens, clone 1MAGE:2961368, mRNA, 


4.8 


? 


133152 


211695 


Hs.324473 


mitogen-activated protein kinase 1 


5 


other 
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133174 


AA431620 


Hs.324178 


hypothetical protein MGC2745 


2.7 


other 


133175 


AW955632 


Hs.66666 


ESTs, Weakly similar to S 19560 prolines 


9.3 


other 


133177 


X97795 


Hs.66718 


RAD54 (S.cerevisiae)-]ike 


4.5 


TM 


133208 


AI801777 


Hs.6774 


ESTs 


5.5 


TM 


133226 


AW954569 


Hs.296287 


Homo sapiens, Similar to bromodomain-con 


2.7 


other 


133228 


AI492924 


Hs.6831 


golgi phosphoprotein 1 


17 


? 


133254 


A1567421 


Hs.273330 


Homo sapiens, clone )MAGE:3544662, mRNA, 


1.3 


other 


133268 


AW956781 


Hs.293937 


ESTs, Weakly similar to FXD2_HUMAN FORKH 


12.2 


other 


133291 


BE297855 


Hs.69855 


NRAS-related gene 


1.2 


other 


133314 


AA1 02670 


Hs.70725 


gamma-aminobutyric acid (GABA) A recepto 


1.7 


TM 


133321 


T79526 


Hs.179516 


integral type I protein 


11.1 


? 


133327 


AL390127 


Hs.7104 


Kruppel-like factor 13 


2.9 


other 


133347 


BE257758 


Hs.71475 


acid cluster protein 33 


2.5 


? 


133360 


A1016521 


Hs.71816 


v-akt murine thymoma viral oncogene homo 


1.5 


other 


133366 


AA292811 


Ks.72050 


non-metastatic cells 5, protein expresse 


2.1 


other 


133367 


AF231919 


Hs. 18759 


KIAA0539 gene product 


1.3 


other 


133370 


AF245505 


Hs.72157 


DKFZP56411922 protein 


2.2 


other 


133390 


AI950382 


Hs.72660 


phosphatidyteerine receptor 


5.7 


TM 


133391 


AW 103364 


Hs.727 


inhibin, beta A (activin A, activin AB a 


25.5 


other 


133394 


M305127 


Hs.237225 


hypothetical protein HT023 


3.3 


other 


133437 


AL031591 


Hs.7370 


phosphotidylinositol transfer protein, b 


1.6 


other 


133452 


NM 002759 


Hs.274382 


protein kinase, interferon-inducible dou 


4.1 


other 


133453 


AI659306 


Hs.73826 


protein tyrosine phosphatase, non-recept 


1.5 


other 


133500 


AW964804 


Hs.74280 


hypothetical protein FL J 22237 


6.3 


TM 


133529 


W45623 


Hs.74571 


ADP-ribosylation factor 1 


4 


? 


1Vi54^ 

1 OOsWO 


AU077073 


Hs,108327 


damage-specific DNA binding protein 1 (1 


1.8 


? 


1 JvJvF 1 (J 


AU077050 


Hs.75066 


translin 


1.5 


other 


133579 


X75346 


Hs.75074 


mitog en-activated protein kinase-actfvat 


3.5 


TM 


133582 


BE391579 


Hs.75087 


Fas-activated serine/threonine kinase 


6.8 


TM 


133594 


AW1 60781 


Hs.172589 


nuclear phosphoprotein similar to S. cer 


2.6 


TM 


133595 


AA393273 


Hs.75133 


transcription factor 6-like 1 (mitochond 


1.4 


other 


133599 


NNL002885 


Hs.75151 


RAP1 , GTPase activating protein 1 


8.1 


other 


133621 


NM_004893 


Hs.75258 


H2A histone family, member Y 


13.5 


other 


133627 


NM_002O47 


Hs.75280 


glycyl-tRNA synthetase 


2.2 


other 


133631 


NM_000401 


Hs.75334 


exostoses (multiple) 2 


1.8 


other 


133649 


U25849 


Hs.75393 


acid phosphatase 1 , soluble 


2 


other 


133690 


AV661185 


Hs.75574 


mitochondrial ribosomal protein L19 


2.8 


other 


133720 


L27841 


Hs.75737 


pericentric tar material 1 


6.8 


other 


133722 


AW969976 


Hs.279C09 


matrix Gla protein 


2.5 


other 


133751 


AW402048.comp 


Hs.334787 


Homo sapiens, £ 


133757 


T52946 


Hs.196209 


RAE1 (RNA export 1, S.pombe) homolog 


1.4 


? 


1 JO f Uu 


BE271766 


Hs.181357 


laminin receptor 1 (67 kD, ribosomal prot 


5.4 


other 


133765 


M62194 


Hs.75929 


cadherin 11, type 2, OB-cadherin (osteob 


5 


other 


133780 


AA557660 


Hs.76152 


decorin 


3.8 


other 


133797 


AL1 33921 


Hs.76272 


retinoblastoma-binding protein 2 


3.1 


? 


133822 


D50525 


Hs.699 


peptidylprolyl isomerase B (cyclophilin 


9.7 


? 


133842 


AW797468 


Hs.285013 


putative human HLA class II associated p 


2.4 


other 


133845 


AA1 47026 


Hs.76704 


ESTs 


2.5 


other 


1 OOUUJ 


AB011155 


Hs.170290 


discs, large (Drosophila) homofog 5 


5 


other 


133867 


AW340125 


Hs.76989 


KIAA0097 gene product 


2.5 


? 


133868 


AB012193 


Hs.183874 


cullin 4A 


2.1 


other 


133922 


U30825 


Hs.77608 


splicing factor, arginine/serine-rich 9 


2.8 


TM 


133924 


D86326 


Hs.325948 


vesicle docking protein p115 


1.8 


SS, 


133929 


NM_006306 


Hs.211602 


SMC1 (structural maintenance of chromoso 


2 


? 


133936 


L17128 


Hs.77719 


gamma-glutamyl carboxylase 


2.6 


other 


133941 


BE244332 


Hs.77770 


adaptor-related protein complex 3, mu 2 


2.9 


other 


133959 


X81789 


Hs.77897 


splicing factor 3a, subunit 3, 60kD 


10.4 


other 


133976 


AI908165 


Hs.169946 


GATA-binding protein 3 (T-cell receptor 


1.9 


other 


133989 


AL040328 


Hs.78202 


SWI/SNF related, matrix associated, act: 


2.6 


SS, 


133997 


AI824113 


Hs.78281 


regulator of G-protein signalling 12 


13 


other 


134010 


AB016092 


Hs.197114 


RNA binding protein; AT-rich element bin 


8.8 


other 


134015 


D31764 


Hs.278569 


sorting nexin 17 


1.5 


SS, 


134070 


NM 003590 


Hs.78946 


cullin 3 


8.3 


other 


134110 


U41060 


Hs.79136 


LIV-1 protein, estrogen regulated 


2.7 


other 


134129 


NM 014742 


Hs.79305 


KIAA0255 gene product 


42 


other 


134134 


H86504 


Hs.173328 


protein phosphatase 2, regulatory subuni 


1.7 


other 


134200 


BE559598 


Hs.197803 


KIAA0160 protein 


2.6 


other 



3.1 



TM 
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134206 AF 107463 


Hs.79968 


splicing factor 30, survival of motor ne 


1.3 


other 


134219 NM 000402 


Ks.80206 


glucose-6-phosphate dehydrogenase 


1.9 


other 


134234 BE300078 


Hs.80449 


Homo sapiens, clone IMAGE:3535294, mRNA, 


10.3 


S3, 


134275 AI878910 


Hs,3688 


dsplatin resistance-associated overexpr 


2.5 


other 


134292 AI906291 


Hs81234 


immunoglobulin superfamily, members 


1.3 


TM 


134301 AW502505 


Hs 81360 


Homo sapiens cDNA: FU21927 fts, done H 


1.6 


TM 


134305 U61397 


Hs 81424 


ubiqurtin-like 1 (sentrin) 


2.1 


TM 


134324 AB029023 


Hs. 179946 


KIAA1 100 protein 


5.3 


? 


134326 AW903838 


Hs 81800 


chondroirjn sulfate proteoglycan 2 (vers 


2.5 


TM 


134329 N92036 


Hs.81848 


RAD21 (S. pombe) homolog 


3.9 


? 


134337 NM 004922 


Hs.31964 


SEC24 (S. cerevisiae) related gene femit 


2.4 


TM 


134348 AW291946 


Hs 82065 


interieukin 6 signal transducer (gp130, 


6.8 


TM 


134367 AA339449 


Hs 82285 


phosphoribosylgtydnamide formyttransfer 


2.3 


TM 


134376 X06560 


Hs 82396 


Z.S'-oligoadenylate synthetase 1 (40-46 


5.5 


other 


134379 AW362124 


Hs323193 


hypothetical protein MGC3222 


5.9 


TM 


134364 AI589941 


Hs8254 


Homo sapiens, Similar to tumor different 


2.2 


other 


134391 AA417383 


Hs 82582 


integrin, beta-like 1 (with EGF-fike rep 


2.1 


other 


134395 AA456539 


Hs8262 


lysosomal 


2.3 


other 


134405 AW067903 


Hs82772 


collagen, type XI, alpha 1 


72.9 


other 


134411 BE272095 


Hs 167791 


reticulocalbin 1, EF-hartd calcium bind'n 


4.4 


other 


134415 AI750762 


Hs 82911 


protein tyrosine phosphatase type IVA, m 


2.3 


other 


134421 AU077196 


Hs 82985 


collagen, type V, alpha 2 


6.8 


? 


134424 244190 


Hs 83023 


peroxisomal biogenesis factor 11B 


2,4 


other 


134446 AA1 12036 


Hs 83419 


KIAA0252 protein 


29 


other 


134447 M58603 


Hs 83428 


nuclear factor of kappa light polypeptid 


6.7 


other 


134470 X54942 


Hs 83758 


CDC28 protein kinase 2 


2.4 


other 


134480 NM 005000 


Hs 83916 


Empirically selected from AFFX single pr 


63 


? 


134485 X82153 


Hs 33942 


cathepsin K (pycnodysostosts) 


1 9 


other 


134498 AW246273 


Hs 84131 


threonyl-tRNA synthetase 


1.8 


other 


134513 AA425473 


Hs84429 


KIAA0971 protein 


14 


other 


134516 AK001571 


Hs 273357 


hypothetical protein FLJ10709 


14 


other 


134520 BE091005 


Hs 74861 


activated RNA polymerase II transcriptio 


56 


other 


134529 AW411479 


Hs 848 


FK506-binding protein 4 (59kD) 


28 


? 


134577 BE244323 


Hs 85951 


exportin, tRNA (nuclear export receptor 


17 


other 


134582 AA927177 


Hs 86041 


CGG triplet repeat binding protein 1 


1 7 


TM 


134612 AW068223 


Hs.171581 


ubiquitin C-terminal hydrolase UCH37 


21 


other 


134624 AF035119 


Hs8700 


deleted in liver cancer 1 


1 3 


other 


134632 X78520 


Hs174139 


chloride channel 3 


2 1 


? 


134654 AK001741 


Hs8739 


hypothetical protein FU 10879 


23 


other 


134666 BE391929 


Hs8752 


transmembrane protein 4 


4 


other 


134687 U62317 


Hs88251 


arylsulfatase A 


62 


other 


134692 NM 003474 


Hs8850 


a disintegrin and metal lo proteinase doma 


2 


other 


134705 BE161887 


Hs 88799 


anaphase-promoting complex subunit 10 


13 


SS, 


134714 Y14768 


Hs.890 


lysosomal 


72 


? 


134719 AA852985 


Hs 89232 


chromobox homolog 5 (Drosophila HP1 alph 


3 2 


other 


134722 AF129536 


Hs.284226 


F-box only protein 6 


25 


other 


134746 X07871 


HS89476 


CD2 antigen (p50), sheep red blood cell 


5 


other 


134751 AW630803 


Hs 89497 


lamin B1 


61 


other 


134790 BE002798 


Hs 287850 


integral membrane protein 1 


56 


TM 


134834 AW451370 


Hs.8991 


adaptor-related protein complex 1, gamma 


53 


other 


134850 AI701162 


Hs.90207 


hypothetical protein MGC1 1 138 


91 


other 


134853 BE268326 


Hs 90280 


5-aminoimidazoie-4-carboxamide ribonucie 


24 


other 


134880 AI879195 


Hs 90606 


15 kDa selenoprotein 


2.7 


other 


134925 AW885909 


Hs6975 


PR0 1073 protein 


1.5 


other 


134955 AW401361 


HS91773 


protein phosphatase 2 (formerly 2A), cat 


4.9 


other 


134971 AI097346 


Hs 286049 


phosphoserine aminotransferase 


2 


other 


134975 R50333 


Hs 92186 


Leman coiled-coil protein 


26 


TM 


135011 AB037835 


Hs92991 


KIAA1414 protein 


14 


? 


135022 NM_0OO4O8 


Hs.93201 


g !y cerol-3- phosphate dehydrogenase 2 (mi 


16 


? 


135032 AW301984 


Hs.173685 


hypothetical protein FU12619 


1.4 


other 


135077 AW503733 


Hs.9414 


KIAA1488 protein 


1.8 


other 


135083 AB036063 


Hs94262 


p53-inducible ribonudeotKie reductase s 


2.5 


other 


135095 AF027219 


Hs.9443 


zinc finger protein 202 


1.5 


TM 


135096 AA081258 


Hs. 132390 


zinc finger protein 36 (KOX 18) 


2.1 


other 


135153 AI093155 


Hs.95420 


JM27 protein 


4.4 


? 


135181 BE250865 


Hs.279529 


px 19- like protein 


14.9 


? 


135199 AA477514 


Hs.96247 


translin-associated factor X 


1.3 


other 
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135207 N26427 


Hs.9634 


ESTs, Highly similar to C10.HUMAN PUTATI 


1.7 


other 


135214 T78802 


Hs.96560 


hypothetical protein FLJ1 1656 


6.2 


other 


135243 BE463721 


Hs.97101 


putative G protein-coupled receptor 


2.8 


TM 


135245 AI028767 


Hs,262603 


ESTs 


122 


TM 


135257 AW291023 


Hs.97255 


ESTs, Weakly similar to A46010 X-linked 


7.7 


TM 


135263 A1088775 


Hs.55498 


geranylgeranyl diphosphate synthase 1 


1.8 


other 


135274 AA448460 


Hs.112017 


GE36 gene 


4.2 


SS, 


135294 M1 50320 


Hs.9800 


protein kinase N)mu-R1 


1.2 


other 


135295 AI090838 


Hs.98006 


ESTs 


4.9 


other 


135307 AI743770 


Hs.98368 


ESTs, Weakly similar to KIAA0822 protein 


59 


? 


135321 A1652069 


Hs.98614 


ribosome binding protein 1 (dog 180kD ho 


12.3 


TM 


135354 AA45B454 


Hs.183418 


cell division cycle 2-tike 1 (PITSLRE pr 


58 


? 


135361 AA373452 


Hs. 167700 


Homo sapiens cDNA FLJ10174 fe, clone HE 


8.1 


other 


135389 U05237 


Hs.99872 


fetal Alzheimer antigen 


19 


other 


135400 X78592 


Hs.99915 


androgen receptor (dihydrotestosterone r 


13.9 


TM 


302256 AA857131 


Hs. 171 595 


HIV TAT specific factor 1 


1.6 


other 


302276 AW057736 


Hs.323910 


HER2 receptor tyrosine kinase (cert)-b2, 


5.3 


other 


303135 AW592789 


Hs.279474 


HSPC070 protein 


22 


TM 


303686 AK000714 


Hs 109441 


MSTP033 protein 


1.4 


SS, 


^ 1008.5 R431Q1 


Hs.101248 


Homo sapiens clone IMAGE:32553, mRNA seq 


5.2 


other 


115S1R AA808229 


Hs.167771 


ESTs 


23 


? 


117781 NM 0070^7 


Hs 42650 


ZW10 interactor 


29 


? 


1?nft1fi At9fiR c )Q7 


Hs 197289 


rab3 GTPase-activating protein, non-cata 


2 


other 


191114 AAQ099S6 


We 78979 


Golgi apparatus protein 1 


56 


SS, 


197791 N9491fi 


Hs 17Qfifi? 
nil i f JWt 


niirlpnsnmp assemblv Drotein 1-like 1 


1.4 


? 


199474 AP 118081 


Hs .29494 


PR0 191 2 protein 


13 


other 


199WR RF0414 f i1 


He 177507 

1 to* 1 f t JU/ 


hypothetical protein 


29 


SS, 


^91^41 AP9Q9im 


He m4fiii 

no. iuho I o 


RP49 hnmnkvi 


16 


other 


4H7R97 RF97R411 

4U7 0£/ DCZ/04JI 


Hs 40191 


BUB3 (budding uninhibited by benzimidazo 


1 8 


other 


4nfiiQfi ai ni4*i4a 

HUD 1 J\j rM_Ui?*r^tU 


Hs 43627 


SRY (sex determining region Y)-box 22 


1 6 


other 




He 4R9Q5 
no.40tjj 


RNA hplirasp familv 

IX 1 »/A Llwlll/OOC lalllllj 


62 


other 


40Q17R R71797 


Hs 101R17 
no. iu iu i/ 


ESTs, Weakly similar to T32527 hypothefi 


57 


other 


4flQ4R7 HIQRRfi 




nb*vn57a05 r1 Soarps aduit brain N2b5HB5 


27 


other 


4iiR7n ARnnnn^ 


Hs 75470 


hypothetical protein, expressed in osteo 


26 


? 


4 14 IU0 Mi^o^oyz 


Hs 75761 


^PR^ nrnip'm kinasp 1 
orrvo uiuicbi miicioc. i 


24 


TM 


4 1 4 040 MW0U44D4 


uq 774qj; 


UDA UUIllalt l~<jUMla)l III iy 1 


24 


other 


41Rfl4n AWftlQI^ 


Hs 9RQ044 


Hnmn sflnipns rDNA FLJ1P048 fis rlorte 


23 


other 


/MCQflfi AAIQU'VI 
4ioyou AAOO 1 1 JO 


He RHRR4 


hlnh^mriKDHv nm\ in /nnnhlelnno phmnvien 

1 ItUI 1 MIUUllllj UIUUU U IUI lllloUUI IC \A II Ul 1 rVJOV 


42 


TM 


41717ft QWIW 
41*0/0 nO/iJO 


He R9fl17 


TATA hnv hinHinn nrntpin fTRPWissoriate 
i ni n uua uti lull iy ui uicii i \ i ur j~aoo\j\siai^ 


23.6 


other 


41R9R1 c^QAQ^ 


He R1Q49 


rjithpn<iin K fnvrnnrlvsnstfisis^ 


5 8 


other 


41R4R7 MM OflfiQIO 


Hs R5971 


rptinnhl^stnrrw -hinHinn nmtpin fi 

i eui ruuidoLun iq l/newiii^ ^iviwm u 


13 


other 


49H9flQ 1 I79Q17 
4 zutoy u / «o / 


Hs QR9R4 


alnha thalasspmia/mpntal nptarrl^tinn svn 

alL/lla 11 1 CL laoOC 1 1 HOj 1 1 rCI 1 Ujl IvJLu 1 UC11JU1 1 Oj 1 1 


1.6 


? 


49flRTI9 1 19917R 


Hs.1334 


i/-mvh ?iui3n mvplnhlastrisLs viral onnoaen 


2 3 


? 


49199*i AA4R17QR 


Hs 102696 

f lO* 1 \J£*\Ju\J 


MCT-1 protein 


1.6 


? 


491R49 AF179HRR 
4Z I04Z nT l/^WU 


Hs 106146 

( is. iuu\J*»u 


rptinnir arW rpnrpssihlp nrntpin 


3.5 


other 


4/ IDZO AVVoif IjDj 


Hs 97Q7RQ 


histfinp rip^if^ptvl^sp 1 

1 IKDLUI IC UCCHjiCIJ IOOC O 


5 


other 


491QR1 A!9R9fl4n 


Hs 110364 


peptidyl prolyl isomerase C (cyclophilin 


3.1 


TM 


499fl*;9 AAQH9744 
4ZZU0t /v\OU£f44 


Hs 104518 

113. 1 U*T>J IU 


ESTs 


1.9 


TM 


499fl^ MM ni419fl 
4ZZU30 InIVI UIHO/CU 


Hs 111099 


mitafhyp hpmp-binHlnn nrntpin 

^fUUJllVC 1 ICI 1 f\* LrlllUlllvJ plUKflll 


2.4 


other 


423750 AF1 65883 


Hs.298229 


prefoldin 2 


4.2 


? 


424001 W67883 


Hs. 137476 


paternally expressed 10 (PEG 10; KIAA 105 


7.1 


? 


425182 AF041259 


Hs.155040 


zinc finger protein 217 


2.3 


other 


425284 AF155568 


Hs.155489 


NS1 -associated protein 1 


3.5 


other 


426372 BE304680 


Hs.169531 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 


1.9 


? 


428049 AW183765 


Hs.182238 


GW1 28 protein 


7.6 


7 


428477 AW500533 


Hs. 11482 


splicing factor, arginineyserine-rich 1 1 


1.7 


other 


437562 AB001636 


Hs.5683 


DEAD/H (Asp-Glu-AJa-Asp/H'ts) box polypep 


2.4 


other 


438449 AK001333 


Hs.6216 


Homo sapiens hepatocellular carcinoma-as 


3.8 


other 


441560 F13386 


Hs.7888 


Homo sapiens clone 23736 mRNA sequence 


5.6 


other 


445580 AF167572 


Hs.12912 


skbl (S. pombe) homolog 


2 


TM 


446999 AA151520 


Hs.334822 


hypothetical protein MGC4465 


7.6 


other 


447111 A1017574 


Hs.17409 


cysteine-rich protein 1 (intestinal) 


2.2 


other 


447778 BE620592 


Hs.71190 


ESTs, Weakly similar to S1 6506 hypotheti 


2.9 


other 


448873 NM 003677 


Hs.22393 


density-regulated protein 


1.8 


other 


449687 W68520 


Hs.331328 


intermediate filament protein syncoilin 


5.9 


other 


450701 H39960 


Hs.288467 


Homo sapiens cDNA FU12280 fis, done MA 


5.7 


other 


450703 AA011202 


Hs.184771 


nuclear factor l/C (CCAAT- binding transc 


1.4 


other 
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452461 N78223 


HS.IUolvb 


transcription factor 


& ft 
4.0 


? 


452511 BE408178 


Hs. 2851 bo 


Homo sapiens cdna ri_j/utJ4o ns, cione au 






453157 AF077UJ6 


Hs.31bo9 


Utsrirooovjl / ii proietn 


19 1 


SS TM 


453658 BE5419Q6 


(J« 070 AQ 

HS.87B19 


homo sapiens, clone Mou.z^yz, minna, comp 


A ft 


other 


100685 AA328229 


HS.1o45o2 


ribosoma! protein L24 


1 ft 


TM 


100690 AA3o325o 


Hs.1657 


estrogen receptor 1 


1 


other 


100833 AF135168 


Hs. 108802 


N-ethytmaleimide-sens;tjve factor 


1 1 

1 .J 


otner 


100850 AA836472 


ns.2y7yjy 


cathepsin B 


1 7 


9 


101161 NMJJQ6262 


HS. 37044 


peripherin 


1fi Q 

ID. 3 


other 


102481 U50360 




gtxHuman calcium, cafrrtodul in -dependent p 


i.l 


other 


102831 AA262170 


Hs.80917 


adaptor-related protein complex 3 t sigma 


9 


o 


103549 BE270465 


Hs.78793 


protein kinase C, zeta 


a 
O 


other 


103749 AL135301 


Hs.8768 


hypothetical protein rU lUo4y 


1.0 


other 


104331 AB040450 


Hs. 279862 


cdk inhibitor p21 binding protein 


2 




104532 AI498763 


Hs.203013 


hypotheficaJ protein FIJI 2748 


2.1 


other 


104563 AL117403 


Hs.306189 


DKFZP434F1735 protein 


1 .2 


other 


105032 AA127818 




gbd12a02.s1 Soares_pregnanLutenjs_NbH 


7 




105039 AA907305 


Hs.36475 


ESTs 


2.6 


? 


106531 AA454036 


Hs.8832 


ESTs 


1.6 


other 


106977 AL043152 


Hs.50421 


K1AA0203 gene product 


4.9 


other 


107298 N95657 


Hs.6820 


ESTs, Moderately simitar to Y0J1_CAEEL H 


2.5 


TM 


108717 AA122393 


Hs.70811 


hypothetical protein FLJ20516 


1.3 


other 


110018 AW579842 


Hs. 104557 


hypothetical protein FLJ 10697 


5.3 


TM 


110330 A1288666 


Hs.16621 


DKFZP4341116 protein 


6.3 


other 


111391 NM_003896 


Hs .225939 


siaryrtransferase 9 (CMP-IMeuAdactosylc 


5,1 


no 

oo, 


111392 W46342 


Hs.325081 


Homo sapiens, clone IMAGE:3659680, mRNA, 


8.4 


other 


113554 AW503990 


Hs.142442 


HP1-BP74 


3,7 


TM 


113722 AV653556 


Hs.184411 


abumin 


13 


other 


115008 AK001827 


Hs.87889 


helicase-moi 


2 


other 


115062 AA253314 


Hs.154103 


LIM protein (similar to rat protein kina 


1.5 


other 


115121 A1634549 


Hs.88155 


ESTs 


28 


other 


117881 AF161470 


Hs.260622 


butyrate-induced transcript 1 


58 


TM 


119075 M 10905 


Hs. 287820 


fibronectin 1 


5 7 


other 


119615 AL034423 


Hs.75875 


ubiqu'rtin-conjugating enzyme E2 variant 


1 3 


other 


120253 AA131376 


Hs.326401 


fibroblast growth factor 12B 


38.9 


other 


125006 BE065136 


Hs. 145696 


spicing factor (CC 1.3) 


2.9 


o 
r 


127609 X80031 


Hs.530 


collagen, type IV, alpha 3 (Goodpasture 


1 8 


other 


128868 AA4190O8 


Hs.106730 


chromosome 22 open reading frame 3 


3 


other 


128891 F 34856 


Hs.292457 


Homo sapiens, clone MGC:1 6362, mRNA, com 


13.3 


other 


128959 AI580127 


Hs. 107381 


hypothetical protein FLJ1 1200 


10.9 


other 


129209 R62676 


Hs.17820 


Rho-assodated, coiied-coil containing p 


2 4 


other 


129449 AI096988 


Hs.111554 


ADP-ribosylation factor-like 7 


82 


TM 


129453 AW974265 


Hs.111632 


Lsm3 protein 


3 3 


? 


129629 AK000398 


Hs.11747 


hypothetical protein FLJ20391 


3.9 


other 


129917 M30773 


Hs.278540 


protein phosphatase 3 (formerly 2B), reg 


5.3 


TM 


129922 AF042379 


Hs.13386 


gamrna-tubulin complex protein 2 


46 


other 


129989 AB015856 


Hs.247433 


activating transcription factor 6 


4 


SS, 


130182 BE267033 


Hs.192853 


ubiqurtin-conjugating enzyme E2G 2 (homo 


4.6 


other 


130365 W56119 


Hs.155103 


eukaryottc translation initiation factor 


11 


other 


130471 AL1 21438 


Hs.183706 


aodxin 1 (alpha) 


9 7 


other 


130542 U64675 


Hs.179825 


RAN binding protein 2-like 1 


7.9 


other 


130586 AB007891 


Hs. 16349 


KIAA0431 protein 


5.6 


TM 


130768 AF258627 


Hs.211562 


ATP-binding cassette, sub-family A (ABC1 


5.2 


other 


130992 BE398091 


HsJ4316 


desmoplak'n (DP!, DP1I) 


1.8 


TM 


131047 H23230 


Hs.22481 


ESTs, Moderately similar to A46010 X-lin 


1.7 


? 


131135 NM 016569 


Hs.267182 


TBX3-'so protein 


3.3 


TM 


131339 AF058696 


Hs.25812 


Nijmegen breakage syndrome 1 (nibrin) 


2.6 


other 


131760 X76732 


Hs.3164 


nudeobind'm 2 


2.9 


TM 


131774 BE267158 


Hs.169474 


DKFZP586J01 19 protein 


5.6 


other 


131853 AI681917 


Hs.3321 


ESTs, Highly similar to IRX1_HUMAN IROQU 


1.3 


other 


131881 AW361018 


Hs.3383 


upstream regulatory element binding prot 


3.2 


TM 


131887 W17064 


Hs.332848 


SWI/SNF related, matrix associated, act* 


3.2 


other 


132031 AF193844 


Ks.3758 


COP9 complex subunit 7a 


5.9 


? 


132192 AA206153 


Hs.4209 


mitochondrial ribosoma! protein L37 


2.2 


TM 


132203 NM.004782 


Hs.194714 


synaptosomal-associated protein, 29kD 


7.9 


? 


132240 AB0 18324 


Hs.42676 


KIAA0781 protein 


4.3 


other 


132348 AW067708 


Hs. 170311 


heterogeneous nuclear ribonucleoprotein 


12.5 


other 
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132528 
132571 
132726 
132863 
133016 
133053 
133197 
133240 
133266 
133285 
133383 
133540 
133784 
133791 
133850 
133859 
133881 
134208 
134403 
134724 
134806 
134859 
135193 



T78736 

AW674699 

N52298 

BE26804B 

AI439688 

AI065016 

AI275243 

AK001489 

Al 160873 

M76477 

BE313555 

AL037159 

BE622743 

M34338 

W29092 

U86782 

U30872 

NM_000288 

AA334551 

AF045239 

AD001528 

D26488 

X95525 

AA243007 

T70541 

X57766 

S66431 

AA453483 

R63925 

AA173417 

AA280588 

AA504223 

AA609996 

F02907 

AA480103 

AA024664 

AA251776 

AA399047 

N34059 

U95367 

AA490899 

T54762 

Z41963 

AA521186 

AA400195 

AA045083 

AA099589 

W85712 

W45728 

U61232 

AA425154 

T39176 

AA49600C 

W38150 

T96595 

AA227463 

R46025 

AA233177 

AA338760 

AA412106 

L47276 

D82307 

AA293568 

R37778 

AA250843 

W49521 

D80000 



Hs.50758 SMC4 (structural maintenance of chromoso 7.4 

Hs.5169 suppressor of G2 allele of SKP1, S. cere 6.9 

Hs.55608 hypothetical protein MGC955 14.3 

Hs.236494 RAB10, member RAS oncogene family 10.3 

Hs.6289 hypothetical protein aJ20886 4.4 

Hs.6390 Homo sapiens clone RB3344 PRO0845 mRNA, 1 .8 

Hs.180201 hypothetical protein FLJ20671 1.8 

Hs.242894 ADP-ribosylation factor-like 1 1 .8 

Hs.69233 zinc finger protein 16.1 

Hs.289082 GM2 ganglioside actfvator protein 10.4 

HsJ252 K1AA1224 protein 15 

Hs.74619 proteasome (prosome, macropain) 26S subu 1.7 

Hs.301064 arfaptin 1 12.1 

Hs.76244 spermidine synthase 9.7 

Hs.7678 celular retinoic acid-binding protein 1 4.2 

Hs.178761 26S proteasome -associated pad1 homobg 2.2 

Hs.77204 centromere protein F (350/400 kD, mitosin 9.1 

Hs,79993 peroxisomal biogenesis factor 7 3,2 

Hs.82767 sperm specific antigen 2 1.4 

Hs.321576 ring finger protein 22 1.4 

Hs.89718 spermine synthase 2.6 

Hs.90315 KIAA0007 protein 13.3 

Hs.96103 TATA box binding protein (TBP)-associate 3.1 

ESTs 1.6 

ESTs 2.5 

Human stromelysin-3 mRNA 4.5 

Homo sapiens clone 23592 mRNA sequence 3, 1 

ESTs 4 6 

ESTs 1 4 

ESTs 1 9 

ESTs 2.2 

ESTs Highly similar to CHROMOSOME 2 4 

ESTs Highly similar to Surf-4 protein [M.musculusl 5 5 

ESTs 23 

ESTs Weakly similar to !!!! ALU SUBFAMILY J 2 8 

Human NADH; ubiquinone oxidoreductase subunit 6 2 

ESTs 2 3 

ESTs 2.4 

EST-RC_N34059 3 3 
Human GABA-A receptor pi subunit mRNA complete cds 

- ESTs 3.3 

ESTs 2.9 
Homo sapiens HP protein (HP) mRNA complete cds 1 3 

ESTs 16 

ESTs 1 3 
VITAMIN K-DEPENDENT GAM MA-CAR BOXYLASE 
Homo sapiens mRNA for GDP dissociation inhibitor beta 
ESTs Weakly similar to PROCOLLAGEN ALPHA 2(iV 

ESTs Highly similar to HETEROGENEOUS 3.7 
Human tubulin-folding cofactor E mRNA complete cds 



ESTs 



53 
26 
19 
1.7 
1.8 



ESTs Weakly similar to ZK1058.4 [C.elegans] 
ESTs 

EST-RCJV38150 
EST - RC.T96595 

ESTs Weakty similar to No definition Ine found [C.elegans] 

ESTs 2.8 

ESTs 2 

ESTs 1 3 

ESTs 6.2 

EST-L47276 3 4 
ESTs Weakty similar to TH1 protein [D.melanogaster] 

ESTs 1.5 

ESTs 2.4 

Interferon regulatory factor 5 14.6 

Human prolyl 4-hydroxylase alpha (II) subunit 6.5 

Human mRNA for KIAA0178 gene partial cds 2 

174 



? 

other 
? 

other 
other 

SS.TM 
other 
other 
other 

SS, 
? 

other 
other 
other 

SS, 
other 
other 
other 
other 
other 
other 
other 
other 
? 

SS, 
other 
other 

TM 

other 

other 

other 

other 

? 

TM 

TM 

other 

other 

other 

other 

1.7 
other 
? 
? 

TM 

other 
2.5 
16 
2.6 

other 
2.1 

other 
SS.TM 
SS, 

? 

TM 

I. 9 

SS, 
other 
? 

other 
other 

II. 4 
other 
other 
? 

? 

other 



TM 



other 
TM 
TM 

other 



other 
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R99978 

AA1 95036 

Z38501 

U37547 

AA479961 

X57579 

AA449071 

N51855 

AA421213 

AA355201 

N78717 

N73808 

U86782 

AA234817 

D13666 

AA236177 

U50648 

M28211 

AA446949 

W03C07 

W61011 

W87544 

X02751 

Z14077 

Z38839 

AA410894 

AA504499 



ESTs Weakly similar to line-1 protein ORF2 [H .sapiens] 6.1 
Human Ro/SSA ribonucteoprotein homolog (RoRet 5.3 ? 

ESTs Weakly similar to PROBABLE E5 1 .4 other 

Hunr^lAPhorno^8(M!HB)rnF^Acomp!etecds3 2 other 

ESTs 17 other 

inhibin beta A (activin A activin AB alpha polypeptide) 15.8 

ESTs 1 3 TM 

ESTs Moderately similar to NAD(+) A DP- 1 3 other 

ESTs Weakly similar to F28F8.3 [C.etegans] 3.2 other 

ESTs \2 SS.TM 

H.sapiens mRNA fbrtransl/n 1 5 ? 

ESTs 5 ? 

Human 26 S proteasome-associated pad1 22 other 

ESTs 13 other 

Homo sapiens mRNA for osteoblast specific 7 5 SS t 
ESTs 71 ? 
Protein kinase interferorvinducible double 41 ? 

Homo sapiens GTP-binding protein (RAB4) 2 9 other 

ESTs 2 2 other 

ESTs 12 other 

ESTs 1.2 other 

ESTs 1 2 other 

Neuroblastoma RAS viral (v-ras) oncogene homolog 1 2 ? 

YY1 transcription factor 1 2 other 

ESTs 1.2 ? 

ESTs 1.7 other 

ESTs Highly similar to probable chloride channel 3 [Ksap 1.3 



other 
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TABLE 7A 

Table 7 A shows the accession numbers for those pkeys lacking unigenelD's for Table 7. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 

15 

Pkey CAT number Accession 

102481 31281.-28 U50360 
20 105032 genbank_AA127818 AA127818 

409487 1 134778J H19886 AW402806 T10231 
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TABLE 8: Figure 8 from BRCA 001-1 US 

Table 8 shows genes upregulated in tumor tissue compared to normal breast tissue. 
Specifically, one column shows the ratio of expression of the indicated gene in breast tumor 
tissue compared to other body tissues, and another column shows the ratio of expression of 
the indicated gene in breast tumor tissue compared to normal breast tissue. 



Pkey: 


Unique Eos probeset identifier number 






ExAccn: 


Exemplar Accession number, Genbank accession number 






UnigenelD: Unigene number 








Unigene Title: Unigene gene title 






R1: 


Ratio of tumor to normal body tissue 






R2; 




Ratio of tumor to normal breast tissue 






Pkey 


ExAccn 


ii- in 

UnigenelD 


Unigene Title 


R1 


R2 


100075 


AF152333 




protocadherin gamma subfamily B, 4 


1 


3.8 


100229 


AV652249 


nS.loUlUf 


polymerase (DNA directed), beta 


17 


5.3 


100262 


D38500 


Ue ITIMKfl 


postmetotic segregation increased 2-Jike 


08 


4.8 


100271 


BE160081 


Up otcoon 


S100 calcium-binding protein A11 (calgtz 


32 


2.3 


100355 


AI907114 


Up 

ns.nioo 


squalene epoxidase 


33 


1.4 


100522 


X51501 


Hs 99949 


prolactin-induced protein 


11.9 


0.4 


100552 


AA019521 


Hs 301946 


lysosomal 


38 


1.2 


100599 


X77343 


Hs 334334 


transcription factor AP-2 alpha (activat 


94 


9.4 


100676 


X02761 


Hs 287820 


fibronectin 1 


3 


7.8 


100690 


AA383256 


Hs1657 


estrogen receptor 1 


44 


4.4 


100895 


U01351 


Hs 75772 


nuclear receptor subfamily 3, group C, rn 


1 


3.9 


101046 


K01160 




NM_002122:Homo sapiens major histocompatl 7 


4 


101086 


AA382524 


Hs.250959 


histatin 1 


08 


4.1 


101148 


NMJXJ2923 


Hs78944 


regulator of G-proteln signalling 2, 24k 


12 


12 


101161 


NM 006262 


Hs 37044 


peripherin 


31 


1.1 


101201 


L22524 


Hs2256 


matrix metalloproteinase 7 (MMP7; uterin 


44 


0.6 


101212 


Al 186220 


Hs83164 


collagen, type XV, alpha 1 


31 


3.4 


101441 


AW468397 


Hs 100000 


S100 calcium-binding protein A8 (calgran 


0.9 


4.2 


101447 


M21305 




gb:Human alpha satellite and satellrte 3 


29.9 


0.3 


101469 


AA310162 


Hs 169248 


cytochrome c 


08 


4.9 


101567 


M33552 


Hs.56729 


lysosomal 


1 


5.9 


101600 


BE561617 


Hs 119192 


H2A histone family, member Z 


28 


4 


101624 


M55998 




gb:Human alpha- 1 collagen type I gene, 3 


31 


1.7 


101674 


NK.002291 


Hs 82124 


laminin, beta 1 


1.5 


41 


101861 


AA350659 


Hs 83347 


angichassociated, migratory cell protein 


3.1 


1.4 


101977 


AF112213 


Hs 154062 


putative Rab5-interacfing protein 


13 


6.9 


102193 


AL036335 


Hs 313 


secreted phosphoprotein 1 (osteopontin, 


1.9 


4.9 


102199 


AA334592 


Hs79914 


lumican 


22 


3.8 


102304 


AF015224 


Hs 46452 


mammagbbin 1 


4.2 


0.7 


102345 


NM 003480 


Hs 300946 


Microfibril-associated grycoprotein-2 


11 


4.2 


102457 


NM 001394 


Hs2359 


dual spedftcrty phosphatase 4 


45 


0.5 


102534 


U96759 


Hs. 198307 


von Hippel-Lindau binding protein 1 


1.4 


4.2 


102541 


AI379954 


Hs.79025 


KIAA0096 protein 


09 


3.9 


102827 


BE244588 


Hs6456 


chaperonin containing TCP1, subunit 2 (b 


1.5 


10.9 


102962 


R50032 


Hs. 159253 


collagen, type VI, alpha 2 


2.2 


6.2 


102991 


AW293542 


Hs 75309 


eukaryotic translation elongation factor 


56 


57 


103119 


X63629 


Hs.2877 


cadherin 3, type 1, P-cadherin (placenta 


37 


0.5 


103175 


X69089 


Hs.79227 


myomesln (M-protein) 2 (165kD) 


1.3 


4 


103286 


D38616 


Hs.54941 


phosphorylase kinase, alpha 2 (liver) 


1.3 


3.8 


103319 


X83492 


Hs.82359 


tumor necrosis factor receptor superfami 


08 


4.6 


103372 


BE536700 


Hs.4888 


seryl-tRNA synthetase 


0.9 


8 


103419 


T34708 


Hs.272927 


Sec23 (S. cerevisiae) homolog A 


1.1 


5.1 


103471 


Y00815 


Hs.75216 


protein tyrosine phosphatase, receptor t 


37 


12 


103546 


Z14244 


Hs.75752 


cytochrome c oxidase subunit VI lb 


0.9 


4.4 
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10165ft 


mm 000088 


his, 172928 collagen, type I, alpha 1 


3.2 


3 






gb:zn13e04.r1 Stratagene hNT neuron (937 


0.9 


10 


103774 


H24185 


Hs 92918 hypothetical protein 


1.9 


15.9 


103821 




Hs.198793 Homo sapiens cDNA; FLJ22463 frs, done H 


1.2 


3.9 


103869 


BE439604 


Hs 24322 ATPase, H+ transporting, lysosomal (vacu 


1,4 


3.9 


103980 


AW1 10242 


Hs.293476 hypothetical protein FKSG44 


16 


4.1 


104054 


AK001913 


Hs 7100 hypothetical protein 


15 


4.3 


104115 


AF1B1810 


Hs.26102 opposite strand to trichorhinophalangea! 


7 


7 


104189 


AB040927 


Hs.301804 KIAA1494 protein 


2 


4.6 


ind91fl 




Hs 15303 KIAA0349 protein 


07 


4.5 


1fW97ft 


AW5A3RQ1 


Hs. 109253 N-terminal acetyltransferase complex ard 


3 3 


3.3 


104295 


AW16557? 


Hs.103657 hypothetical protein PR0221 9 


23 


4.2 


1ftd11Q 

[wo ly 




Hs 9950 Sec61 gamma 


31 


7 






Hs 35380 x 001 protein 


4 


1.3 




vcicni 
AO \D\I 1 


Hs 99949 prolactin-induced protein 


3.8 


0.6 






Hs 54642 methionine adenosyltransferase II, beta 


08 


6.7 




nr\UU I/O ■ 


Hs.106390 Homo sapiens mRNA; cDNA DKF2p586H0924 (f 


1.7 


ILK* DUO 


VAKMR9A 


Hs.11565 RIKENcDNA 2010100012 gene 


2 


7.5 


IfVIKQO 

iwoyz 


AWDJLWOo 


Hs 325820 protease, serine, 23 


19 


7.4 


in/icii 

ILWD 1 J 




Hs 24713 hypothetical protein 


1.1 


6 3 


irw£ifi 


Rft99R9 


Hs 106106 protein kinase (cAMP-dependent, cataiyti 


12 


4 




AWZf UDDD 


Hs.171774 hypothetical protein 


14 


3 9 


in/1700 




Hs 305953 zinc finger protein 83 (HPF1) 


15 


4 2 






Hs 274369 uncharacterized hypothalamus protein HAR 


11 


4 1 


104849 




Hs 241507 ribosomal protein S6 


1.3 


4.6 


104850 


a i i ionic 


Hs 8728 hypothetical protein DKFZp434G171 


12 


3.6 


104852 


ur7fHC/i 
WrUIW 


HS20107 ESTs 


08 


4 2 


104861 


AAOoodoU 


Hs 29759 RNA POLYMERASE I AND TRANSCRIPT RELEASE 


1 7 


104873 


(AfAIQ 11 


Hs 20597 host cell factor homolog 


08 


5.4 


104891 


W44o2d 


Hs 30627 ESTs 


07 


6 8 


104920 


AW9550oy 


Hs 306083 Novel human gene mapping b chomosome 22 


1 3.9 


104926 


BE2988Qo 


Hs 33363 DKFZP434N093 protein 


33 


^ 1 


104952 


AVW6098 


Ha74316 desmoplakin (DPI, DPI I) 


12 


1 7 


104963 


AB0z9u2U 


Hs 173694 KIAA1097 protein 


11 


5 5 


104977 


AI,jy2b4U 


Hs 1 8272 amino add transporter system A1 


32 


1 4 


i Acniri 
1Q5U3U 


bftbloUbi 


Hs 337772 Homo sapiens, Similar to RIKEN cDNA 0610 1 6 


1 1.4 


105035 


M1fi7 cn 


Hs8859 Homo sapiens, Similar to RIKENcDNA 5830 15 


7 2 


lubUoo 


DC/drMIQ 


Hs 9006 VAMP (vesicle- associated membrane protei 


11 


3 5 


105159 


Arl4oz/ f 


Hs 265561 CD2-associa ted protein 


12 


10 


105178 


A A 11 1Q1C 


Hs.21941 AD036 protein 


36 


8 3 


105182 


bt4Uf yon 


Hs 18271 goigi phosphoprotein 3 


17 


6.8 


105274 


AtoiHy^y 


Hs 281866 ATPase, H+ transporting, lysosomal (vacu 


11 


3.7 


105303 


□CI/1 11T7 

BE24oo2/ 


Hs 182526 chromosome 22 open reading frame 5 


15 


4 


lAOl 11 


A IA1 C7flQ 


Hs 172089 Homo sapiens mRNA; cDNA DKFZp586l2022 (f 


1 5 


105426 


W2laV/ 


Hs.23439 ESTs 


43 


2.9 


1U54J2 


W03o1b 


Hs 76698 stress-associated endoplasmic reticulum 


15 


5 


105443 




Hs 12144 KJAA1033 protein 


12 


3.6 


105483 


A 1 A T71C7 


Hs 23458 Homo sapiens cDNA: FLJ23015 fis, clone L 


1.7 


15.8 


105492 


A IQAE71 7 


Hs 289112 CGI-43 protein 


2 


4 8 




Al 017715 


Hs 28785 microfibrillar-associated protein 3 


13 


39 


105539 


AB040884 


Hs 109694 KIAA1451 protein 


27 


11.4 


105594 


AB024334 


Hs 25001 tyrosine 3-monooxygenase/tryptophan 5-mo 


13 


61 


105623 


BE504200 


Hs 30127 hypothetical protein 


17 


45 


105807 


AA788946 


Hs.16869 ESTs, Moderately similar to CA1C RAT COL 


3.9 


24.6 


105812 


BE614149 


Hs.20814 CGI-27 protein 


18 


36 


105823 


AI559444 


Hs 293960 ESTs 


19 


66 


105831 


AA329449 


Hs 247302 twisted gastru la tion 


1.5 


43 


105851 


AI827976 


Hs.24391 hypothetical protein FLJ 1361 2 


38 


1.9 


105879 


BE392914 


Hs 30503 Homo sapiens cDNA FLJ1 1344 fis, clone PL 


1 7 


4 


105918 


AW028485 


Hs 26136 hypothetical protein MGC14156 


17 


74 


105939 


AL137728 


Hs 12258 Homo sapiens mRNA; cDNA DKFZp434B0920 (f 


12 


105941 


AB033075 


Hs. 10669 development and differentiation enhancin 


13 


4.6 


105969 


AB030656 


Hs 17377 coronin, actin-binding protein, 1C 


1 1 


5.9 


105990 


AI690586 


Hs.29403 hypothetical protein FLJ22060 


2 


4.6 


106012 


AI240665 


Hs.8895 ESTs 


4.1 


1.2 


106060 


NMJX01329 


Hs 171391 C-termtnal binding protein 2 


2,6 


7 


106070 


T74445 


Hs.5957 Homo sapiens clone 24416 mRNA sequence 1.4 


10.7 



4.8 



5.1 



14 



3.8 
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106083 H62087 Hs.31559 thyroid hormone receptor-associated p rot 1.5 

106156 AA425414 Hs.33287 nuclear factor l/B 5.4 

106255 BE613206 Hs.279607 calpastatin 1.8 

106414 BE568205 Hs28827 mitogen-actjvated protein kinase kinase 5.1 

5 106538 AK00O274 Hs .278635 HDCrM18P protein 1-2 

106568 AW051564 Hs28285 patched related protein translocated in 1.8 

106574 BE044325 Hs 227280 U6 snRNA-associated Sm-like protein 2.3 

106613 N88604 Hs 30212 thyroid receptor interacting protein 15 12 

106617 H09548 Hs 5367 ESTs, Weakly similar to 1 38022 hypo the ti 0.9 

10 106619 AA459480 Hs 23956 hypothefical protein FLJ20502 1.3 

106701 BE387614 Hs 25797 splicing factor 3b, subun'rt 4, 49 kD 1.6 

106721 AA741038 Hs6670 ESTs 17 

106776 AA206079 Hs5693 hypothetical prote'r FLJ2O420 1 

106866 AA487416 Hs.268231 Homo sapiens cDNA: FLJ23111 fts, done L 1,6 

15 106868 BE1 85536 Hs 301183 molecule possessing ankyrin repeats indu 3 3 

106887 BE503373 Hs 334335 hypothetical protein FLJ 13576 1,4 

106940 T85594 Hs.339808 hypothetical protein FLJ 101 20 3,3 

106968 AF216751 Hs.26813 CDA14 3 

107052 BE391904 Hs. 12482 glycerophosphate O-acy transferase 1.7 

20 107061 BE147611 Hs.6354 stromal cell derived factor receptor 1 1.2 

107149 AI289507 Hs.299883 hypothetical protein FLJ23399 18 

107222 BE172058 Hs.82689 tumor rejection antigen (gp96) 1 12 

107233 BE267795 Hs,22595 hypothetical protein FLJ10637 14 

107295 AA1 86629 Hs.80120 UDP-N-acetyi-alpha-D^atactosamkie:polyp 2.6 

25 107679 AA011510 Hs.60512 ESTs 18 

107914 AA027229 Hs 61329 ESTs, Weakty similar to T 16370 hypotheti 13 

107965 AF109219 Hs. 108787 phosphati'dyiinositol gly can, class N 16 

108033 AW368993 Hs 323748 Homo sapiens clone C DAB P0086 mRN A sequ en 

108060 AA291440 Hs 73149 paired box gene 8 11 

30 108081 AA093668 Hs 28578 musdeblind (DrosophilaHike 0 7 

108137 AI283611 Hs 263479 ESTs, Weakly similar to HMG1_HUMAN HIGH 

108186 AW068579 Hs 7780 Homo sapiens mRNA; cDNA DKFZp564A072 (fr 

108215 AI879238 Hs 299315 collapsin response mediator prote'm-5; C 1 5 

108297 AA333660 Hs 71331 hypothetical protein MGC 5350 15 

35 108339 AW151340 Hs51615 ESTs, Weakly similar to A LU7„HU MAN ALUS 

108371 AA074374 Hs 67639 ESTs 13 

108399 AF086070 Hs237519 EST 1 

108469 AA079487 gb:zm97K)8.s1 Stratagene colon HT29 (937 1.5 

108470 AA079500 gb:zm96h10.s1 Stratagene colon HT29 (937 1 1 
40 108564 M23114 Hsl526 ATPase, Ca++ transporting, cardiac muscl 2 

108641 AA112059 Hs.429 ATP synthase, H+ transporting, mitochond 

108668 AA058522 Hs.185751 ESTs 

108694 AA036725 Hs 61847 ESTs 

108824 AK001332 Hs.44672 hypothetical protein FLJ 10470 

45 108863 AA133456 Hs. 102548 glucocorticoid receptor DNA binding fact 

108893 BE276891 Hs 194691 retinoic acid induced 3 

108992 AA152312 Hs 72047 ESTs 

109072 AI732585 Hs.22394 hypothetical protein FLJ 10893 

109097 AA167512 gb:zp10f12.s1 Stratagene fetal retina 93 

50 109160 BE220601 Hs 301997 hypothetical protein FLJ 1303 3 

109244 BE179030 Hs 64239 Human DNA sequence from clone RP5-1174N9 

109481 AA878923 Hs 289069 hypothetical protein FLJ21016 3.8 

109484 AA366263 Hs 72531 hypothetical protein FLJ 11838 19 

109795 AA173942 Hs 326416 Homo sapiens mRNA; cDNA DKFZp564H 1916 (f 

55 110009 BE075297 Hs.6614 ESTs, Weakly similar to A43932 mucin 2 p 4.6 

110107 AW151660 Hs 31444 ESTs 

110411 AW001579 Hs,9645 Homo sapiens mRNA for KIAA 1741 protein, 

110731 NMJJ14899 Hs,188006 KIAA0878 protein 
110756 N21207 Hs.182999 ESTs 

60 110930 BE242691 Hs.14947 ESTs 

110935 AI753230 Hs.323562 hypothetical protein DKFZp564K142 
111051 AI681293 Hs.12186 hypothetical protein FLJ 22558 
111110 AK001566 Hs.23618 hypothetical protein FLJ 10704 
111356 BE301871 Hs.4867 mannosyl (alpha-1,3-)-glycoprotein beta- 

65 111357 BE314949 Hs.87128 hypothetical protein FLJ23309 

111770 R27975 Hs.269401 ESTs, Moderately similar to S656 57 a ^ ha 
111900 AF131784 Hs.25318 



1 1 
12 
14 
14 
12 
13 
11 
12 
13 
4 



\2 
3.7 
2.8 
16 
3.1 
19 
2 

1.1 
1 

3.3 
1.2 
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3.6 
1.2 
4 

6.1 

59 

54 

112 

36 

4.4 

36 

73 

61 

54 

5.4 

12 

6.3 

IB 

3 

7.6 
4.3 
6.5 
6.9 
3.5 
4.3 
4 

3.5 
3.5 
18 
3.5 
5.6 
12 
3.1 
4.6 
4 

6.3 
3.8 
3.6 
3.6 
4.3 
4.9 
3.5 
3.6 
3.6 
3.5 
4 

3.6 
4.1 
3.5 
5 

6.1 
17 
7.7 
4 

3.7 
7.4 
3.5 
3.3 
3.7 
3.5 
12 
7.5 
4 

3.8 
8.2 
6.1 
5.4 
0.8 
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111903 NM_014906 Hs.166351 KIAA 1072 protein 1 5.4 

111951 NM_014927 Hs 100527 KIAA0902 protein 1 3.8 

112141 AW137198 Hs 278682 Phosphatidylglycerophosphate Synthase 1,4 3.5 

112193 R49499 Hs 138238 ESTs 1.5 3 6 

5 112197 NMJ03655 Hs.5637 ESTs 4.6 2 

112610 AW500106 Hs.23643 serine/threonine protein kinase MASK 3.3 10.5 

112971 Z42387 Hs 83883 transmembrane, prostate androgen induced 3.2 3 

112984 T16971 Hs.289014 ESTs, Weakly similar to A4 3932 mucin 2 p 3.7 10.8 

113056 AF019226 Hs8036 glioblastoma overexpressed 4.5 3 7 

10 113449 AW160683 Hs.158006 hypothetical protein 1.2 4 4 

113497 AF143321 Hs.15572 hypothetical protein IMAGE 109914 0.9 3 6 

113508 AL042936 Hs.211571 holocytochrome c synthase (cytochrome c 1.1 35 

113531 AK001898 Hs.16740 hypothetical protein FLJ 11 036 1.2 3.9 

113604 AI075407 Hs.296083 ESTs, Moderately similar to I54374 gene 1.7 5 3 

15 113674 NM_014214 Hs.5753 inositol(myo)-1(or4)-monophosphatase 2 0.8 6.1 

113841 W30681 Hs 146233 Homo sapiens cDNA: RJ221 30 fis, clone H 1.7 6 2 

113857 AW243158 Hs5297 DKFZP564A241 6 protein . 1.2 4 6 

113931 BE255499 Hs.3496 hypothetical protein MGC1 5749 1.5 4 

113936 W17056 Hs 83623 nuclear receptor subfamily 1, group I, m 3.8 1 

20 113987 AA345519 Hs9641 complement component 1, q subcomponent, 1.2 4.7 

114132 AI342493 Hs.24192 Homo sapiens cONA FU20767 fis, clone CO 0.3 4.3 

114156 BE179882 Hs 336920 glutathione peroxidase 3 (plasma) 1.1 4 3 

114213 N58309 Hs 19575 CGt-1 1 protein 1.6 9 2 

114636 AA075488 gb;zm88d01.s1 Stratagene ovarian cancer 1.6 3 7 

25 114760 AI929382 Hs 252692 hypothetical protein a J20343 1 4 4 

114781 T10446 Hs 95388 ESTs 1 4 3 

114795 AB037858 Hs 173484 hypothetical protein FLJ 10337 1 6 9 2 

114901 AV660012 Hs 196437 hypothetical protein FLJ 10788 1 4 5 2 

115096 AI683069 Hs 175319 ESTs 3.7 1 

30 115518 BE541042 Hs 23240 Homo sapiens cDNA; FU21848 fis, clone H 3.2 4 2 

115646 N36110 Hs 306971 solute carrier family 2 (facilitated glu 15 39 

115764 AW582256 Hs 91011 anterior gradient 2 (Xenepus la e vis) horn 13 59 

115802 AW410233 Hs 206521 YME1 (S.cerevisiae)-like 1 17 6 6 

115994 AB037836 Hs 109315 KIAA1 41 5 protein 1.5 91 

35 116032 BE383668 Hs 42484 hypothetical protein FU 106 18 0 9 4 3 

116046 BE395293 Hs 94491 hypothetical protein RJ20297 1 6 5 5 

116274 AI129767 Hs 182874 guanine nucleotide binding protein {G pr 3 2 24 

116310 Z24854 Hs 42299 ESTs 0 8 4 7 

116356 AI371223 Hs.288671 Homo sapiens cDNA FU 11997 fis, clone HE 2 4 3 9 

40 116429 AF191018 Hs 279923 putative nucleotide binding protein, est 5 5 5 5 

116461 AA313607 Hs 58633 Homo sapiens cDNA: FLJ22145 fis, clone H 5 13 

116470 AI272141 Hs 83484 SRY (sex determining region Y)-box 4 8 7 4 5 

116578 D21262 Hs 75337 nucleolar and coiled-body phosphprotein 3.2 69 

116579 AW888411 Hs 81915 leukemia -associated phosphoprotein p18 ( 3 2 3 
45 116589 AI557212 Hs 17132 ESTs, Moderately similar to I54374 gene 31 8.3 

116786 H25836 Hs 301527 ESTs, Moderately similar to unknown [H.s 3 2 4.5 

117170 N25929 Hs 42500 ADP-ribosylation factor-like 5 7 5.5 

117215 N20066 Hs 133207 PTPRF interacting protein, binding prote 12 6.2 

117280 M18217 Hs 172129 Homo sapiens cDNA; FLJ21409 fis, clone C 4 5 2.4 

50 117576 AI383467 Hs 44597 ESTs 1 4 4.2 

117667 U59305 Hs 44708 Ser-Thr protein kinase related to the my 4 3 0.5 

117881 AF161470 Hs 260622 b utyrate- induced transcript 1 21 5.7 

118336 BE327311 Hs 47166 HT021 3 6 7.7 

118475 N66845 gb:za46c11.s1 Scares fetal liver spleen 4.2 0.5 

55 118493 AL353944 Hs 50115 Homo sapiens mRN A; cDNA DKFZp761J 1112 (f 3.5 3.3 

118505 N67343 gb:yz50b07.s1 Morton Fetal Cochlea Homo 2 1 3.8 

119159 AF142419 Hs.15020 homolog of mouse quaking OKI (KH domain 3 7 1.5 

119307 BEO48061 Hs37054 ephrin-A3 3 1.1 

119355 BE218319 Hs 5807 GTPaseRab14 11 5.6 

60 119462 BE041667 Hs 314544 Homo sapiens cervical cancer suppressor- 14 4.3 

119771 AI905687 Hs.2533 EST 3 2 1 

119940 AL050097 Hs 272531 DKFZP586B031 9 protein- 4 3 0.7 

119943 BE565849 Hs 14158 copine III 3.5 1.9 

120407 AA235207 Hs .250456 hypothetical protein DKFZp762F201 1 1 5 3.7 

65 120493 AW968080 Hs, 152939 Homo sapiens done 24630 mRN A sequence 4 1.4 

120677 AF150208 Hs.108327 damage-specific DNA binding protein 1 (1 1.6 6.8 

120867 AA350781 Hs.96967 ESTs 1-1 3.6 
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121368 

121603 

121723 

122223 

122378 

122946 

123155 

123158 

123327 

123495 

123526 

123533 

123768 

123961 

123999 

124000 

124038 

124059 

124083 

124148 

124153 

124252 

124314 

124375 

124432 

124447 

124539 

124543 

124564 

124574 

124605 

124639 

124659 

124737 

124760 

124763 

124792 

124842 

124940 

124949 

124960 

124995 

125030 

125034 

125058 

125076 

125090 

125103 

125154 

125155 

125159 

125170 

125181 

125193 

125260 

125262 

125272 

125388 

125824 

125852 

125970 

126192 

126469 

126510 

127095 

127524 

128312 



BE262S56 

AA416785 

AA243499 

AF169797 

AB032948 

AI718702 

AF121856 

AF161426 

AA421581 

W28673 

AA608657 

AA608751 

AI932318 

AL050184 

AF084555 

BE563957 

AB037860 

BE387335 

AW195237 

BE300094 

AU077333 

BE613340 

AK001552 

D87454 

N39016 

N48000 

D54120 

AI393320 

H66409 

AL036596 

AA749315 

H60193 

AI680737 

BE270465 

AW408586 

BE410405 

R44357 

R56485 

AF068846 

AI903210 

AL023513 

T52700 

AA610577 

BE548446 

T83731 

AA973971 

T91518 

AA570056 

W38419 

AA837043 

AK000669 

AL020996 

R40815 

W67577 

H05635 

AW884980 

BE612888 

W27235 

Z45258 

AW630088 

AW504721 

AW1 60399 

BE384361 

AA057593 

AA340277 

AI243596 

J04182 



Hs.178292 KIAA0180 protein 1,5 

Hs.249495 heterogeneous nud ear nbonucieoprotein 22 

Hs.104800 hypothetical protein FU10134 3.4 

Hs.27413 adaptor protein containing pH domain, PT 3.9 

Hs.21356 hypothetical protein DKFZp762K201 5 1.4 

Hs,308026 major histocompatibility complex, class 1.4 

Hs, 284 291 sorting nexin 6 1-2 

Hs 218329 hypothetical protein 2.4 

Hs 1 78443 ESTs 0 9 

Hs 106747 serine carboxypeptidase 1 precursor prot 1 3 

gb:ae55d04.s1 Stratagene lung carcinoma 2 1 

gb:ae56h07.s1 Stratagene lung carcinoma 2.1 
Hs 188762 ESTs, Moderately similar to H2BL_HUMAN H 1 ,1 

Hs 21610 DKFZP434 B203 protein 1.1 

Hs.7351 cyclic AMP phosphoprotein, 19 kD 1,4 

Hs,74861 activated RNA polymerase II transcriptio 1.9 

Hs, 173933 nuclear factor l/A 15 

Hs, 283713 ESTs, Weakly simitar to S64054 hypotheti 14.8 

Hs.7734 hypothetical protein FLJ22174 1.2 

Hs .227751 lectin, galactoside-binding, soluble, 1 2.5 

Hs 160483 erythrocyte membrane protein band 7.2 (s 1 

Hs 334725 Homo sapiens, Similar to RIKEN cDNA 9430 1.5 

Hs 215766 GTP-binding protein 18 

Hs. 192966 K1AA0265 protein 11 

Hs.268869 ESTs, Weakly similar to ALUC_HUMAN .'))! 1.3 

gb:yy98e12.s1 Soares_multiple_sclerosisL 2.7 

Hs. 146409 cell division cycle 42 (GTP4>inding prot 21 

Hs.104573 ESTs 1 

Hs.108275 ESTs 14 

Hs.42322 A kinase (PRKA) anchor protein 2 0,7 

Hs.77171 m';nichromosome maintenance deficient (S. 1.1 

Hs.21143 DKFZP586C1 324 protein 1.4 
Hs.289068 Homo sapiens cON A FLJ 11 918 fis, clone HE 1,5 

Hs.78793 protein kinase C, zeta 07 
ESTs, Moderately similar to ALU5JHUMAN A 0.9 
calpain 2, (m/ll) large subunit 13 
hypothetical protein FU20736 1 8 

gb:yg93h09.s1 Scares infant brain 1NIB H 1 
Hs.1 03804 heterogeneous nuclear nbonucieoprotein 3 2 
Hs.336780 tubulin, beta polypeptide 1 

Hs.194766 seizure related gene 6 (rnouse)-like 0 9 

Hs.1 10044 ESTs 09 
Hs.1 87775 ESTs 12 
Hs 5167 Homo sapiens mRNA; cDNA DKFZp434F152 (fr 
Hs 3343 phosphoglycerate dehydrogenase 09 
gb;oq02h08.s1 NCI_CGAP_Lu5 Homo sapiens 
gb:ye20105.s1 Stratagene lung (937210) H 3.2 
Hs 122730 ESTs, Moderately similar to KIAA1215 pro 5.3 
gb:zc78a07.s1 Pancreatic Islet Homo sapi 0.9 
Hs 143669 ESTs 11 
Hs 274428 TRF2 -interacting telomeric RAP1 protein 1.1 
Hs 8518 selenoprotein N 11 
Hs.12396 ESTs, Weakly similar to 2004399A chromos 1 
Hs.84298 CD74 antigen (invariant polypeptide of m 12 
Hs.294030 topoisomerase-related function protein 4 1 
Hs.171957 triple functional domari (PTPRF interact 1,3 
Hs,180224 myosin regulatory light chain 1.1 
Hs.64311 a disinteghn and metalbproteinase doma 1.4 
Hs.286013 short coiled-coil protein 2.4 
Hs.76550 Homo sapiens mRNA; cDNA DKFZp564B1264 (f 
Hs.177516 high density lipoprotein binding protein 19 
Hs.30376 hypothetical protein 1.4 
Hs.182885 ESTs, Weakly similar to JC5024 UDP-ga!ac 2 
Hs.334762 hypothetical protein FLJ 14735 1.3 
Hs.10248 Homo sapiens cDNA FLJ20167 fis, clone CO 1.3 
Hs.94630 ESTs, Moderately similar to T03094 A-kin 4.3 
Hs.150101 lysosomal 15 

181 



Hs.91052 
Hs.76288 
Hs,48712 



4.1 

5.5 

3.2 

3.9 

7.1 

37 

4.9 

3.6 

5.2 

5.1 

5.2 

9.3 

3.6 

3.5 

3.8 

11.2 

4.4 

115 

6.2 

12.7 

4.1 

8.4 

10.2 

4.8 
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liver-specific bHLH-Zip transcription fa 


15 


6,2 


422055 


NMJJ14320 


Hs 


111029 


putative heme-bindlng protein 


2 


11.3 


425815 


R94023 


Hs337531 


ESTs, Moderately similar to 138022 hypot 


1.7 


3.6 


426218 


AF1 19043 


Hs.1 68005 


Homo sapiens cDNA FU 13372 fs, clone PL 


33 


2.8 


427397 


AI929685 


Hs.1 77656 


calmodulin 1 (phosphorylase kinase, deft 


13 


4.7 


427466 AA523543 


Hs.7678 


cellular retinoic acid-binding protein 1 


11 


3.7 


427505 AA361562 


Hs.178761 


26S proteasome-associated pad1 homolog 


32 


2.5 


427723 


AI355260 


Hs279789 


histone deacetylase 3 


28 


22 


428673 AW601325 


Hs.324278 


Homo sapiens mRNA; cDNA DKFZp566M063 (fr 


1.1 


430219 


X99209 


Hs.235887 


HMT1 (hnRNP methyitransferase, S. cerevi 


1 8 


8.8 


430450 


R23553 


Hs.241489 


hypothetical protein 


11 


5.6 


432866 


BE395875 


Hs.279609 


mitochondrial earner homolog 2 


1.5 


6.1 


433423 


BE407127 


Hs.8997 


heat shock 70kD protein 1A 


13 


7.6 


437562 AB001636 


Hs.5683 


DEAD/H (Asp-Glu-Ala-Asp/His) box potypep 


16 


6.5 


437667 


BE616412 


Hs286218 


junctional adhesion molecule 1 


13 


3.5 


437754 


R60366 


Hs.5822 


Homo sapiens cDNA: FLJ22120 fts, clone H 


2 


5.7 


440252 


BE513940 


Hs.6101 


hypothetical protein MGC3178 


1 1 


6.2 


441471 


AL042986 


Hs.7857 


erythrocyte membrane protein band 


05 


3.7 


448292 


BE2B1316 


Hs.47334 


hypothetical protein FU 14495 


25 


4.9 


449404 


H51066 


Hs.23581 


teptin receptor gene-related protein 


1.1 


3.6 


449964 


AW001741 


Hs.273193 


hypothetical protein FLJ 10706 


14 


3.5 


451389 


N73222 


Hs279009 


matrix Gla protein 


4 


11.2 


452685 AI634651 


Hs.30250 


v-maf musculoaponeurotic fibrosarcoma (a 


0.8 


5.6 




RC_H15847_s 






peptidylprolyl isomerase B (cyclophilin B) 


18 


4.8 




RC.W84712 






calumenin 


35 


4.6 




X14008_ma1_f 




fysozyme (renal amyloidosis) 


09 


4.5 




RC H86543 f 






ESTs 


1.8 


6.6 




H07011 






ESTs; Weakly similar to SAS [H.sapiens] 


18 


3.9 




RC_AA164586_s 






ESTs 


6.2 




RC.AA070485 






Homo sapiens clone 23967 


34 


2.6 




RC_H98714_s 






ESTs 


1.6 


3.5 




RC_AA406145_f 






ESTs 


4.6 




AA458584 






SRY (sex determining region Y)-box 4 


34 


0.4 




AA031548 






cell division cycle 42 (GTP-b'mding protein; 25kD) 


3.1 




X02761 






fibronectin 1 


36 


15.2 




RC AA487193 






secreted frizzled-related protein 4 


47 


4 




R25326 






Homo sapiens mRNA for putative vacuolar 


09 


5 




RC AA393805 






ESTs; Weakly similar to (defiine not 


1.1 


8.4 




RC AA449333 






ESTs 


29 


4.6 




RC AA287681 


_s 






ESTs 


1.3 




RC AA490864 






ESTs; Highly similar to heat shock factor 


1.4 


5 




RC C14243J 






ESTs; Highly similar to heat shock factor 


17 


5 




R21443 






ESTs 


16 


37 




RC AA251902 






Homo sapiens lysophospho lipase (LPL1) 


2.2 


3.8 




M2?121_s 






small inducible cytokine A5 (RANTES) 


0.9 


9.9 




C00038_s 






ESTs 


2.8 


4.8 




Y00503 






keratin 19 


3.1 


1.1 




RC_R27006_f 






ESTs 


1.6 


3.7 




RCJVA416886 






ESTs; Weakly similar to predicted using 


3.1 


3.1 




RC_AA460450 




fibroblast growth factor receptor 2 (bacteria- 


1.5 


3.7 




RC.AA488433 






ESTs; Weakly similar to deduced amino acid 


11 


4 




RC_AA278400_f 




Homo sapiens HF 




U28831 






Human protein immuno-reactfve with anti-PTH4.4 


0.6 




RCAA1 99588 






Homo sapiens actin-related protein Arp3 (ARP3) 


1.8 



5.2 



0.8 



3.9 



1.5 



3.6 



4.7 
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AF006082 
RC_H90899 
RC W95070 
RCJ90946J 
D87258 
AA313414_s 
RC_H73484_s 

AFFX-HUMSSGF3A/M97935.3 
AFFX-HUMRGE/M10098_5 
AFFX-M27830.5 
AFFX-M27830.5 
RC_AA063431_f 

RC_T63769J ferritin; light polypeptide 



Homo sapiens actin-related protein Arp2 (ARP2) 
desmoplakin(DPl;DPII) 5.4 
desmoplakin (DPI; DPII) 5 
Human mRNA for KIAA263 gene; complete cds 
protease; serine; 11 (IGF binding) 2.4 
ESTs; Weakfy similar to cDNA EST EMBLT1 157 
ESTs; Weakly similar to similar to Yeast 1,3 



1.1 



ESTs 
1.1 



1.6 
5.5 
2.6 
1.1 
3.5 
1.5 
6.3 
2.3 
7.9 
0.5 
0.6 
0,8 
3.7 



10.9 

3.9 

5.3 

13.5 

7.4 
5.4 
4.1 
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TABLE 8A 



Table 8A shows the accession numbers for those pkeys lacking unigenelD's for Table 8. For 
5 each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

10 

Pkey: Unique Eos probeset identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

108469 116761.1 AA0794B7 AA128547 AA128291 AA079587 AA079600 
20 125076 190299.1 AA973971 T88817 M253263 

114636 109698.1 AA075488 AA129081 AA074851 AA082852 AA074732 AA084908 AA084751 AA076042 AA131172 AA085374 AA079519 
AA074510 AA1 1 3824 AA1 02437 AA070833 AA070143 AA084693 AA084389 AA076373 AA075492 AA062834 AA034335 
AA078829 AA079344 AA069916 AA079275 AA070914 AA654069 AA081976 AA080957 AA0831 15 AA070942 AA085296 

123526 genbank_AA608657 AA608657 
25 123533 genbank_AA60875l AA608751 

125090 genbank_T91518 T91518 

125154 genbank_W38419 W38419 

118475 genbank„N66845 N66845 

118505 genbank_N67343 N67343 
30 101046 entrezJ<01160K01160 

129982 221 267 Z14221 AW381862 M97920 AW401444 Z66542 M29470 AW406502 X61011 M34024 AA327072 Z1 4166 Z1 4167 Z14 165 

AW403806 Z14200 AA383972 Z14205 Z14201 M18513 Z14202 AW403684 X14584 AF062221 U43760 X65892 X65883 X62107 
Z80847 X65885 X65893 AF062142 X65891 X17675Z47274 Z47277 Z47276 X65888 Z47275 X62109 AF062140 L01278 
AF062134 AF062139 X81723 Z80840 X81733 X81743 X81744 X81732 Z80843 AW402942 AW403516 X65919 AF062190 
3 5 AF0621 77 AF062222 AF0621 1 5 Z47240 AF062263 AF062261 AF062223 AF06221 1 Z47238 AW401714 AW404008 AW404991 

AF062280 M99595 Z47214 Z47232 Z47218 M26995 AF0621 84 X65895 L38433 X81731 Z1 1946 Z47226 AF062205 AF17401 2 
L01276 AF062168 AF062136 X81755 X81748 AF174019 AF0622B5 L06924 L11699 AW402665 L09070 L28049 L08090 
AW407843 Z14171 AW402944 L08083 AW405627 L33035 L26907 M17750 L28052 M17751 AJ239360 U19885 L14821 X56526 
AB014341 L12087 L12098 U68231 L12184 AF062242 AF022000 U64499 U00570 Al 268 504 Y 15773 X64239 X62969 U00506 
40 X73605 X99360 U00577 D83677 AB021539 AF035796 Z33899 U00588 AJ239353 AF174062 Z33901 X98899 AF174058 X63080 

D83676 AB021529 Z18318 U00488 L01412X81746 U21262 U21272 U00560 AF174060 U00547 U00561 Z18321 S73957 
M26435 AF1 151 30 AF1 151 17 Z92896 1)21254 Z92895 AF1 151 12 AF062290 T28938 Y09386 AF174067 U27189 Y09384 
U77373 AF174057 M 17749 X69692 AF 174038 U64478 U64486 AF174098 X99365AF 174077 AF1 74083 AF1 74089 AF015133 
AF127792 AJ006171 U00510 U58 144 AF004323AF1 15109 AF004324 AF015130 AF090414 AF090418 AB021536 AB021519 
4S Z96957 AF021986 Z92898 Z96956 L34164 AF062251 AF052524 U00549 U00541 D83687 AF052522 AF087420 AF052520 

AF02201 1 L43085 U00536 L19915 L43083 AF090420 AB021530 AB021534 Z96955 Z96954 Z33903 X62964 AF103282 U00509 
AF062298 AF062289 AF062307 AW408326 L33037 L04337 L04328 X81742 L04336 L03385 AF032360 S56184 AF062191 
AF1 91092 AI906954 AF103184 L04343 AW364860 AF001424 AF103163 Z98717 AF103143 AJ008250 L04323 AF103321 
AF103335 L04342 L03818 L03817 AW404978 AW403690 U86801 AF103150 AJ010334 AF035027 AJ007327 AF103115 
50 AF017458 AJ008207 AJ008183 AJ008196 AJ008241 AJ008208 AF103210 AF068668 AF068670 AF068671 AF068664 

AF068669 AF068666 AF068665 AF068672 AW371244 AW403670 AW408074 AW404575 AW362153 AW403803 AW406702 
AW351514 D78345 T29140 J00231 NM.002179 AW405146 AA301091 X04646 H64660 AW402990 AW406534 T93007 
AI857980 AW368899 AI905833 AW406586 AA482084 AI872299 AA715266 AW404328 AI831674 AI709348 AA6031 12 
AW514864 AA485775 H64492 AW404789 AA487630 AA715498 AA295885 T27613 T98113 
55 108470 genbank. AA079500 AA079500 

101447 entrezjt121305 M21305 

124447 genbank _N48000 N 48000 

101624 entrez_M55998 M55998 

131791 221_260 X62111 S67984 AJ 131056 Z47243 Z47235 AF062268 Z47237 AJ131058 AF062130 AF062282 X62108 AA385989 AA464794 
60 X69861 AW402964 M90808 Z98735 Z98734 Z98736 AF035018 X79161 U00545 AF174046 AF174071 U00552 U96288 

AF021989 AF062255 AF174061 S73953 AF062135 AF062155 X64147 U00555 AF174070 AF129754 U86789 Z98714 Z98738 
S75886 AJ008175 AJ008186 AJ008188 Y08403 H15753 H22208 AA327241 AW405737 H42300 H39677 H26239 H26903 
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H45128 R86072 AA327565 AI660584 AW361537 AW383759 S71043 H15014 H45570 H42819 H45523 H45134 R72Q43 
H24543 H27636 H27610 T28147 H26496 AW364071 AW364072 H45561 H45556 H42605 AI735017 T47421 R48719 H27570 
H44599 A1459598 H42347 H41938 H24993 AA345888 H22339 A1538691 AJ012264 AA664201 Al 880450 AA3273 10 AI991250 
AI833028 AW001210 A1956075 B30467 AA326915 H41943 AI749266 A 1744441 AA327377 AW512326 AI735170 H01634 
AI587047 AI571623 AA327486 AA327103 AA327195 AA326973 T28143 

124842 217726.1 R 56485 R 37248 R5999 2 

103758 AA0848*74J_at AA084874J 

130064 221J64 X57815 L29157 AA367448 AW328098 AW404536 L29156 L29163 H27683 R83195 AA2 95096 A W3 27822 H25458 AW404692 
X57819 X57823 AW405604 AW404447 Z34914 AW406M2 AA427726 AA604389 AW405606 AW405918 AW4051 17 
AW407182 L03632 AW405058 L03627 AW407470 R72738 L21959 AW375738 X87888 AF124166 AF054638 AJ241406 
U09902 AW405284 M 12025 AA360219 L03631 L03629 AW327727 AF1 94810 AF058076 AF194636 AF063774 AF063737 
AF063755 AF063740 AF063739 AF063738 $73129 L43088 AF063775 L33026 AF060138 AF194813 AF194812 AF19481 1 
AF194809 AF194814 AF194808 AF064505 AF064503 AF063765 AF063757 AF063752 AF0637 16 AF063736 AF194806 
AF058077 AF063747 AF063772 AF063781 AF060137 AF194805 AF060134 AF060132 AF058074 AF063754 AF063704 
U38589 Z18332 AF060122 AF194807 AF060135 AF064506 AF064504 AF063773 L26540 AF058072 AF060131 AF064500 
AF060136 AF064513 AF194683 AF194711 AF058075 AF063717 Z19546 AF194581 X72746 U96393 U09890 X98897 
AF194592 M80916 AJ241405 AF194632 AJ388659 AF194625 274694 AF194588 AF194601 U09901 U09911 U09892 
AJ010336 AJ006162 AJ249377 AJ241414 AA327392 U97248 X72747 Z46850 X95739 AW406701 AJ2431 09 AF194609 
S80758 AF194595 AF194596 L22483 Z70262 D84141 AW405758 AF001788 AF194580 D84143 U76684 AF194593 L03630 
X87892 X91134 U21249 Z46346 AJ 132426 AF 103659 AJ233718 AF021038 AJ233727 H24657 U09882 S75627 AA573599 
AF047231 AF047232 AF103616 AW404484 Z46848 S76132 AF103663 AF103713 Y17940 AF047216 AF103595 Y17956 
AF052799 AF052797 AF052802 AF052798 AF052801 AF052794 AF052796 AF047218 AF052800 AF047217 AF052795 
AF032351 AF103701 AF1O3708 AF103710AF103706 S62232 AF103645 AF103632 AF1 03647 AF1 03644 AF1 03640 
AF103692 AF093581 AF103620 AW405934 AI445389 AW383753 AA360256 AF099676 H21654 H39501 AI820828 H53689 
W26785 AW384496 AW407708 AA541663 AA911602 AI821461 AA588300 AA327050 H42717 AI951280 AA421322 AI923193 
AA864302 H25133 D87023 J03011 M61771 D87017 AA526855 AA253450 

130232 18831.2 U29453 U06715 W24970 AA584303 T39581 AA1 55603 AA305043 M429426 W05664AA 102382 AA482044 W24487AA3 19060 

T88946 F101O6~AA232161 AA243117 AA158937 AA10O864 

109097 genbank„AA167512 AA167512 
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TABLE 9: Figure 9 from BRCA 001-2 US 

Table 9 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Titie: Unigene gene title 



Pkey ExAccn UnigenelD UnigeneTitle 

100690 AA383256 Hs.1657 estrogen receptor 1 

102211 BE314524 Hs,78775 putative transmembrane protein 

103587 BE270266 Hs.82128 5T4 oncofetal tropho blast glycoprotein 

104115 AF183810 Hs.26102 opposite strand to trichorhinophaiangeal syndrome I 

105038 AW503733 Hs.9414 KIAA1488 protein 

105500 AW602166 Hs.222399 CEGP1 protein 

105990 AI690586 Hs.29403 hypothetical protein FU 220 60 

106155 AA425414 Hs.33287 nuclear factor l/B 

106373 AW503807 Hs.21907 histone acety (transferase 

106414 BE568205 Hs.28827 mitogen -activated protein kinase kinase kinase 2 

1 10009 BE075297 Hs.6614 ESTs, Weakly similar to A43932 mucin 2 precursor, intestinal 

111900 AF131784 Hs.25313 Homo sapiens clone 25194 mRNA sequence 

114540 AI904232 Hs.75323 prohibitin 

1 16470 AI272141 Hs.83484 SRY (sex determining region Y)^ox 4 

117280 M18217 Hs.172129 Homo sapiens cDNA: FLJ21409 fis, clone COL03924 

119771 AI905687 Hs.2533 EST 

121723 AA243499 Hs.104800 hypothetical protein FLJ 101 34 

124059 BE387335 Hs.283713 ESTs, Weakly similar to S64054 hypothetical protein YGLOSOw 

131148 AW953575 Hs.303125 p53-induced protein PIGPC1 

132371 AA235448 Hs.46677 PRO2O00 prote'n 

134169 AI690916 Hs.178137 transducer of ERBB2, 1 

302235 AL049987 Hs.166361 Homo sapiens mRNA; cDNA DKFZp564F112 

452410 AL133619 Hs.29383 Homo sapiens mRNA; cDNA DKFZp434 
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TABLE 10: Figure 10 from BRCA 001-3 PCX 

Table 10 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn; Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal body tissue 

R2: Ratio of 90* percentile tumor to body 

R3: Ratio of 75 th percentile body to tumor 

R4: Ratio of tumor to normal breast tissue 



Pkey 


ExAccn 


UnigenelD 


Unigene THIe 


R1 


R2 


R3 


R4 


100082 


AA130080 


Hs.4295 


proteasome (prosome, macropain) 26S subu 


4.2 


152 


36 


12.2 


100103 


AA380887 


Hs.5085 


dolichyl-phosphate mannosyltransferase p 


9.8 


123 


13 


5 


100131 


D12485 


Hs.11951 


ectonucleotide pyrophosphatase/phosphodi 


13.2 


244 


19 


9.9 


100147 


D13666 


Hs.136348 


osteoblast specific factor 2 (fasciclin 


157 


1030 


66 


5 


100154 


H60720 


Hs.81892 


KIAA0101 gene product 


4.1 


320 


78 


10.6 


100157 


D14661 


Hs.119 


Wilms' tumour 1 -associating protein 


4.7 


119 


26 


3 


100169 


AL037228 


Hs.82043 


D123 gene product 


5.1 


106 


21 


9.2 


100203 


BE242284 


Hs.172199 


adenylate cyclase 7 


47 


47 


1 


43 


100210 


D26361 


Hs.3104 


KIAAO042 gene product 


4.7 


47 


4 


07 


100219 


AW972300 


Hs.118110 


bone marrow stromal cell antigen 2 


3.8 


350 


93 


1.9 


100234 


D29677 


Hs.3085 


KIAA0054 gene product Helicase 


4.1 


64 


16 


3 


100248 


NM 015156HS.78398 


KIAA0071 protein 


34 


77 


23 


5 9 


100252 


NM_006207Hs.1 70040 


platelet-derived growth factor receptor- 


4.5 


45 


4 


4 


100260 


D38491 


Hs.322478 


KIAA01 17 protein 


5.9 


59 


1 


26 


100279 


D42084 


Hs.82007 


K1AA0094 protein 


3.5 


96 


28 


1,3 


100286 


BE247550 


Hs.86859 


growth factor receptor-bound protein 7 


3,1 


306 


98 


1.5 


100294 


AA331881 


Hs.75454 


peroxiredoxin 3 


12.8 


128 


1 


11.7 


100335 


AW247529 Hs.6793 


platelet-activating factor acetylhydrola 


4.2 


187 


44 


5.4 


100365 


AI878927 


Hs.79284 


mesoderm specific transcript (mouse) horn 


45 


129 


29 


31 


100375 


D80004 


Hs.75909 


KIAA0182 protein 


35 


78 


23 


4 8 


100409 


D86957 


Hs.80712 


KIAA0202 protein 


10.2 


102 


1 


4.8 


100410 


D86961 


Hs.79299 


lipoma HMGIC fusion partner-like 2 


4 


40 


1 


38 


100414 


NM_014735Hs.82292 


KIAA0215 gene product 


32 


32 


2 


29 


100418 


D86978 


Hs.84790 


KIAA0225 protein 


36 


36 


7 


3.2 


100438 


AA013051 


Hs.91417 


topoisomerase (DNA) II binding protein 


56 


76 


14 


2 


100439 


AA347720 


Hs.122669 


KIAA0264 protein 


35 


35 


9 


31 


100448 


AF234887 


Hs.57652 


cadherin, EGF LAG seven-pass G-type rece 


55 


145 


27 


2.2 


100449 


D87470 


Hs.75400 


KIAA0280 protein 


34 


34 


1 


1.2 


100522 


X51501 


Hs.99949 


prolactin-induced protein 


22.7 


760 


34 


1,4 


100552 AA019521 


Hs.301946 


lysosomal 


14.4 


144 


9 


47 


100643 NM.005032Hs.4114 


plastin 3 (T isoform) 


41 


259 


63 


1.9 


100661 


BE623001 


Hs.1 32748 


Homo sapiens ribosomal protein L39 mRNA, 


33 


116 


36 


2.2 


100666 


L05424 


Hs.169610 


CD44 antigen (homing function and Indian 


85 


85 


1 


3.2 


100667 


L05424 


Hs.169610 


CD44 antigen (homing function and Indian 


3 


594 


201 


2.3 


100745 


BE207168 


Hs.144630 


nuclear receptor subfamily 2, group F, m 


5 


82 


17 


0.9 


100774 


JOS 581 


Hs.89603 


mucin 1, transmembrane 


35 


37 


11 


2.8 


100783 AF078847 


Hs.191356 


general transcription factor IIH, polype 


9,7 


97 


10 


7.2 


100821 


M26460 




gb:Homo sapiens (done 104) retinoblasto 


3.3 


33 


1 


0.8 


100864 


BE563957 


Hs.74861 


activated RNA polymerase II transcriptio 


3.7 


477 


130 


3.1 


100877 


X80821 


Hs.27973 


KIAA0874 protein 


6.3 


63 


4 


57 


100892 


BE245294 


Hs.180789 


S164 protein 


4.7 


47 


1 

30 


4.2 


101038 


BE297139 


Hs.79411 


replication protein A2 (32kD) 


3.8 


115 


7.1 


101046 


K01160 




NM_002122:Homo sapiens major histocompat 


3.9 


390 


100 


11.1 


101079 


BE264901 


Hs.250502 


carbonic anhydrase VIII 


3.9 


39 


8 


3.6 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



101084 


AW409934 Hs.75528 


nucleolar GTPase 


4.1 


53 


13 


4 


101104 


AW862258 Hs.169266 


neuropeptide Y receptor Y1 


15.3 


153 


1 


14.1 


101185 


NM_001621Hs,170087 


aryl hydrocarbon receptor 


11.3 


113 


8 


3.9 


101188 


L20320 Hs.1 84298 


cyclln-dependent kinase 7 (homolog of Xe 


3.1 


118 


38 


2 


101201 


L22524 Hs.2256 


matrix metalloproteinase 7 (MMP7; uterin 


8.2 


396 


48 


0.9 


101232 


AU077288 Hs.242894 


ADP-ribosylation factor-like 1 


4 


110 


28 


10.7 


101275 


BE545277 Hs.3273 


Ts translation elongation factor, mitoch 
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120418 AW966893 Hs.26613 Homo sapiens mRNA; cDNA DKFZp586F132 
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120493 AW968080 Hs. 152939 Homo sapiens done 24630 mRNA sequence 

120524 AA261S52 Hs.192905 ESTs 

120554 AA284447 Hs.271887 ESTs 

120562 BE244580 Hs.302267 hypothetical protein FLJ 10330 
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6.1 


61 


1 


5.1 




135172 


AB028956 


Hs.12144 


KIAA1033 protein 


3.4 


88 


26 


1.4 




135242 


AI583187 


Hs.9700 


cydin E1 


3.1 


31 


1 


2.3 




135243 


BE463721 


Hs.97101 


putative G protein -coup led receptor 


3.4 


169 


50 


9.1 




135269 NNL003403HS.97496 


YY1 transcription factor 


3.4 


475 


142 


2.5 




135356 


BE312948 


Hs.18104 


hypothetical protein FLJ 11274 


3.1 


31 


10 


1.7 




135357 


AI565004 


Hs.79572 


cathepsin D (lysosomal aspartyl protease 


4.7 


710 


151 


2.5 




135389 


U05237 


Hs.99872 


fetal Alzheimer antigen 


20.6 


206 


4 


19.1 




135397 


L14922 


Hs.166563 


replication factor C (activator 1)1(14 


3.2 


32 


1 


2.4 




435400 


X78592 


Hs.99915 


aridrogen receptor (dihydrotestosterone r 


3.2 


117 


37 


9.4 






AI471525 


Hs.247486 


ESTs 


3.8 


58 


16 


5.5 






X70683 


Hs.93668 


ESTs 


18 


1047 


596 


1.6 






L14922 


Hs.82128 


5T4 oncofetal trophobiast glycoprotein 


5 


285 


58 


1.2 






M23263 


Hs.904 


amylo-1;6-gfucosidase; 4-alptia-glucanotransferas 


3.1 


31 


1 


2.6 




AI267886 


Hs.148027 


polymerase (RNA) II (DNA directed) polypeptide B 


7.8 


137 


18 


11.9 




AA044840 


Hs. 241 676 


stromal cell-derived factor 1 


4.7 


114 


25 


0.9 






N90960 


Hs.227459 


ESTs; Moderately similar to !!!! ALU SUBFAMILY 


4.7 


151 


32 


9.3 




AA873285 


Hs. 137947 


ESTs 


47 


47 


3 


4.4 






T56679 


Hs.865 


RAP1 A; member of FiAS oncogene family 


4 


40 


1 


3.4 






AA305536 




■EST176522 Colon carcinoma (Caco-2) cell line II 


3.6 


121 


34 


11.8 




AI369384 




arylsuffatase D 


3.5 


113 


33 


1.7 






AA219081 


Hs.242396 


ESTs; Moderately similar to !!!! ALU SUBFAMILY] 


3,4 


107 


32 


9.9 
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TABLE 10A 



Table 10 A shows the accession numbers for those pkeys lacking unigenelD's for Table 10. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 

Pkey; Unique Eos probeset identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 

15 

Pkey CAT number Accession 

123619 371681 J AA602964 AA609200 

20 104602 524482 2 H47610R86920 

121581 283769J AA4 16568 AA442889 AA41 7233 AA442223 

123523 genbank_AA608588 AA608588 

100821 tigr_HT4306 M26460 U09116 

125091 genbank_T91518 T91518 

25 125150 NOT_FOUND_entrez_W38240 W38240 

118475 genbank_N66845 N66845 

104787 genbank_AA027317 AA027317 

106055 genbank„AA417034 AA417034 

113702 genbank_T97307 T97307 

30 101046 entrez_K01160 K01160 

101447 entrez J/121305 M21305 

101624 entrez_M55998 M55998 

124677 genbank_R01073 R01073 

110581 genbank_H61560 H61560 

35 119023 genbank_N98488 N98488 

110775 genbank_N22414 N22414 

112092 genbank_R44538 R44538 

112253 genbank_R51818 R51818 

107014 genbank_AA598820 AA598820 

40 114988 genbank„AA251089 AA251089 
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TABLE 11: Figure 11 from BRCA 001-3 PCT 

5 Table 11 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

1 0 ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal body tissue 

R2: Ratio of 90* percentile tumor to normal body 

1 5 R3: Ratio of 75* percentile normal body b tumor 

R4: Ratio of tumor to normal breast tissue 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Pkey ExAccn UnigenelD Unigene Title 



100131 
100147 
100522 
100666 
101104 
101478 
101724 
101754 
101888 
102165 
102304 
102348 
102457 
102567 
102823 
103557 
103613 
104115 
104667 
104804 
104807 
104896 
104943 
105038 
105329 
105500 
105516 
105730 
106012 
106095 
106155 
107102 
107136 
107151 
107922 
108339 
109112 
109292 
109415 
109912 
110009 
110915 
111164 



D12485 Hs. 11951 
D 13666 Hs 136348 
X51501 Hs 99949 
L05424 Hs.169610 
AW862258 Hs. 169266 
NM_002890Hs.758 
L11690 Hs620 
S70114 Hs 239489 
AL049610 Hs.95243 



BE313280 
AF015224 
U37519 



Hs. 159627 
Hs.46452 
Hs 87539 



NM_001394Hs2359 
U63830 Hs 146847 
D85390 Hs 5057 
AL133415 Hs.297753 
NM_000346Hs.2316 
AF1 83810 Hs 26102 
A1239923 Hs30098 
AI858702 Hs.31803 
Al 139058 Hs 125790 
AW015318 Hs.23165 
AF072873 Hs 114218 
AW503733 Hs9414 
AA234561 Hs 22862 
AW602166 Hs 222399 
AK001269 Hs.3073fl 
AW377314 Hs 5364 
AI240665 Hs 8895 



AF 11 5402 
AA425414 
AB037765 
AV561958 



Hs.11713 
Hs.33287 
Hs 30652 
Hs.8207 



AW378065 Hs8687 
BE153855 Hs 61460 
AW151340 Hs 51615 
AW419196 Hs 257924 
AW975746 Hs 188662 
U80736 Hs.1 10826 
AW390822 Hs 301528 
BE075297 Hs 6614 
BE092285 Hs.29724 
N46180 HS.122489 



ectonudeotide pyrophosphatase/phosphodi 

osteoblast specific factor 2 (fascidin 

prolactin-induced protein 

CD44 antigen (homing function and Indian 

neuropeptide Y receptor Y1 

RAS p21 protein activator (GTPase activa 

bullous pemphigoid antigen 1 (230/240kD) 

TIA1 cytotoxic granuie-assodated RNA-bi 

transcription elongation factor A (SI I )- 

death associated protein 3 

mammaglobin 1 

aldehyde dehydrogenase 3 family, member 

dual specificity phosphatase 4 

TRAF family member-assodated NFKBactiv 

carboxypeptidase D 

vimentin 

SRY (sex determining region Y)-box 9 (ca 
opposite strand to trichortiinophalangeal 
ESTs 

ESTs, Weakly similar to N-WASP [H.sapien 

ieudne-rich repeat-containing 2 

ESTs 

frizzled (Drosophila) homolog 6 

KIAA1468 protein 

ESTs 

CEGP1 protein 
hypothetical protein FLJ10407 
DKFZP564I052 protein 
ESTs 

E74-like factor 5 (ets domain transcript 

nuctear factor i/B 

KIAA1344 protein 

GK001 protein 

ESTs 

Ig superfamily receptor LNIR 

ESTs, Weakly similar to ALU7^HUMAN ALU S 

hypothetical protein FLJ13782 

KIAA1702 protein 

trinucleotide repeat containfng 9 

L-kynurem'ne/alpha-aminoadipate aminotra 

ESTs, Weakly similar to A43932 mucin 2 p 

hypothetical protein FLJ13187 

Homo sapiens cDNA FLJ13289 fis, done OV 

210 



R1 


R2 


R3 


R4 


13.2 


244 


19 


9.9 


15 7 


1030 66 


5 


22 7 


760 


34 


1.4 


8 5 


85 


1 


3.2 


15.3 


153 


1 


14.1 


96 


96 


1 


8.5 


94 


94 


1 


0.3 


89 


89 


5 


8 


73 


73 


1 


5.3 


93 


93 


5 


8 


85 


2058 243 


1.4 


64 


428 


67 


23 


20 2 


202 


5 


13 


82 


82 




68 


5.6 


56 




53 


75 


136 


18 


34 


73 


73 




52 


29 


290 




26.8 


149 


149 




64 


77 


77 




51 


7 


70 




65 


74 


74 




6 


16.2 


162 




42 


55 


55 




52 


28 


131 


47 


39 


254 


508 


20 


3 


83 


83 


3 


1 8 


6.9 


69 


1 


4.4 


212 


212 


6 


17.4 


26.3 


356 


14 


1 


99 


483 


49 


18 


63 


63 


1 


54 


25 


392 


155 


43 


15.6 


156 


7 


10.8 


9 


90 


1 


55 


18.7 


187 


1 


17 


4.1 


334 


82 


34 


7.1 


71 


1 


65 


12,3 


123 


1 


11.3 


14.2 


142 


1 


95 


6.3 


693 


110 


7.2 


20.9 


209 


1 


19.5 


7.7 


77 


1 
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65 



111170 A UTWH Tfi 
1 1 1 1/3 AKUUU1 JO 


ns. lUf ou 


asporin(LRR class 1) 


25 1 


ZOO 


1 O 

12 


C 7 
O.f 


111100 AwnnoriKA 
IliiyU AftUUtUDD 


MS. ID1U40 


hypothet cal protein FLJ11 193 


6.3 


CI 


1 


3.0 




ns.j04wo 


KIAA1866 protein 


3.6 


a no 
4Uz 


1 M 


A Q 

4.y 


11100/ DtJi*ty4y 


Up Q71 OQ 

Hs.oY izo 


hypothetical protein FLJ23309 


3.8 


425 


111 


4 




nS./UoZJ 


KIAA1077 protein 


57 


DOf 


inn 




111A>17 AIC71G/tn 


U. 7CXQ 


ESTs 


9.6 


124 


1 O 
1 J 


9 


111 7f\0 TQ7 1A7 




gb:ye53h05.s1 Soares fetal to spleen 


123 




1 1 


117 

n./ 


1141/'+ W0/DD4 


ns.izouiy 


lymphoid nuclear protein (LAF-4) mRNA 


24.2 


OXI 


1 (\ 

1U 


b.b 


Imoo AWJtwyj 


nS.lDi^U 


Homo sapiens mRNA; cDNA DKFZp434E033 (fr 67 


C7 
Of 


4 

1 


0.0 


114/00 ArZl/!o4o 


Lie 1Q001Q 


ets homologous factor 


137 


137 


1 


8.9 


■H/IOOTl At 1C7Cj1C 


HS.421/9 


bromodomain and PHD finger containing, 3 


91 


91 


1 


7 6 


■M^flCf A 1703004 


Hs.72472 


BMP-R1B 


35 9 


359 


4 rt 

10 


OO 7 

29.7 


114yoo AA2510o9 




gb:zs04f05.s1 NCI_CGAP_GCB1 Homo sapiens 1 1 5 


115 


1 


6 9 


115206 AW 183695 


Hs 1 86572 


ESTs 


58 


58 


1 


5 


115719 AW992405 


Hs 59622 


Homo sapiens, clone 1MAGE:3507281, mRNA, 


76 


144 


19 


13.9 


115644 AI373062 


Hs.332938 


hypothetical protein MGC5370 


62 • 


62 


1 


5 4 


116470 AI272141 


l 1 _ fit AO A 

Hs. 83484 


SFIY (sex determining region Y)-box 4 


1 8 


1047 


596 


16 


116786 H25836 


Hs. 301527 


ESTs, Moderately similar to unknown [H.s 


228 


228 


9 


12.4 


117280 M18217 


Hs 172129 


Homo sapiens cDNA: FU21409 fis, clone C 


39 


322 


83 


4 4 


117412 N32536 


Hs42645 


solute carrier family 1 6 {monocarboxyfic 


17.4 


174 


9 


69 


118472 AL1 57545 


Hs 42179 


bromodomain and PHD finger containing, 3 


14.5 


145 


1 


24 


4 4PlO*TH A JAP 4\ 4 4 ft 

119271 AI061118 


Hs.65328 


Fanconi anemia, complementation group F 


82 


82 


1 


64 


119771 A 1905687 


Hs 2533 


EST 


35 


2073 


595 


2 1 


120562 BE244580 


Hs 302267 


hypothetical protein FLJ10330 


85 


127 


15 


1 6 


121463 AKD00282 


Hs 239681 


hypothetical protein FU20275 


103 


103 


1 


9 3 


121723 AA243499 


Hs. 104800 


hypothetical protein FU 10134 


29 


214 


74 


37 


122963 AA478446 


Hs 69559 


KIAA1 096 protein 


72 


72 


1 


57 


123137 AI073913 


Hs 100686 


ESTs, Weakly similar to JE0350 Anterior 


9.9 


351 


36 


13.9 


123619 AA602964 




gb:no97c02.s1 NCI CGAP Pr2 Homo sapiens 


85 


85 


1 


43 


123709 AA706910 


Hs 112742 


ESTs 


39 


60 


16 


4 8 


124006 AI147155 


Hs270016 


ESTs 


58 


321 


55 


17 


124059 BE387335 


Hs.283713 


ESTs, Weakly similar to S64054 hypotheti 


104 


880 


85 


53 


124308 AA249027 


Hs241507 


ribosomal protein S6 


105 


105 


1 


99 


125279 AW401809 


Hs.4779 


KIAA1 150 protein 


131 


131 


1 


51 


125617 AA287921 


Hs. 164950 


ESTs 


67 


67 


1 


6 


127439 D60237 


Hs 14368 


SH3 domain binding glutamic acid-rich pr 


30.6 


306 


4 


26.5 


128305 AI954968 


Hs.279009 


matrix Gla protein 


75 


75 


1 


6 5 


128482 AI694U3 


Hs.296251 


programmed cell death 4 


72 


72 


1 


58 


128790 AF026692 


Hs.105700 


secreted frizzJed-related protein 4 


17.4 


409 


24 


7 8 


128925 R67419 


Hs 21851 


Homo sapiens cDNA FU12900 fis, clone NT 


7.1 


392 


56 


36 


129017 AA1 15333 


Hs 107968 


ESTs 


82 


82 


1 


74 


129229 AF013758 


Hs 109643 


polyadenylate binding protein -interactin 


71 


71 


1 


62 


129337 NM_014918Hs 110488 


KIAA0990 protein 


95 


95 


1 


8 5 


129366 BE220806 


Hs 184697 


Homo sapiens clone 23785 mRNA sequence 


71 


150 


21 


14,5 


129821 AB028945 


Hs 12696 


cortactin SH3 domain-binding protein 


11.4 


114 


1 


10 


130036 BE061916 


Hs 125849 


chromosome 8 open reading frame 2 


67 


67 


1 


57 


130057 AF027153 


Hs324787 


solute earner family 5 {inositol transp 


1 


1 


1 


1 


130095 AK001635 


Hs 14838 


hypothetical protein FU10773 


146 


219 


15 


7.6 


130343 AB040914 


Hs278628 


KIAA1481 protein 


132 


331 


25 


12.4 


130385 AW067800 Hs 155223 


stanniocalcin 2 


72 2 


790 


l 


i y 


130407 BE385099 


Hs.334727 


hypothetical protein MGC3017 


65 


65 


4 


53 


130441 U63630 


Hs. 155637 


protein kinase, DNA-activated, catalytic 


61 


61 


1 


5.7 


130455 D90041 


Hs.155956 


N-acetyltransferase 1 (arylamine N-acety 


10.8 


706 


66 


92 


130604 AA383256 


Hs.1657 


estrogen receptor 1 


32.2 


322 


1 


47 


130617 M90516 


Hs 1674 


glutamine-fructose-6-phosphate transamin 


10 


100 


1 


7.6 


130712 AJ271881 


Hs 279762 


bromodornain-containing 7 


17.5 


175 


2 


12.8 


131148 AW953575 Hs.303125 


p53-induced protein PIGPC1 


3.8 


585 


153 


37 


131388 NM_014S10Hs 92200 


KIAA0480 gene product 


7.6 


76 


1 


5 


131564 T93500 


Hs.28792 


Homo sapiens cDNA FU11041 fis, clone PL 


47 


381 


81 


6.4 


131742 AA961420 


Hs.31433 


ESTs 


117 


117 


1 


10.1 


131877 J04088 


Hs.156346 


topoisomerase (DNA) II alpha (170kD) 


68 


68 


1 


5.6 


131985 AA503020 


Hs 36563 


hypothetical protein FLJ22418 


40.2 


402 


1 


4 


132316 U28831 


Hs 44566 


KIAA1641 protein 


18.6 


186 


10 


1.5 


132528 T78736 


Hs.50758 


SMC4 {structural maintenance of chromoso 


93 


93 


1 


8.4 


132742 AA025430 


Hs.292812 


ESTs, Weakiy similar to T33468 hypotheti 


6.5 


65 


1 


5.6 


132990 X77343 


Hs 334334 


transcription factor AP-2 alpha (activat 


12.7 


311 


25 


2.4 


133015 AJ002744 


Hs.246315 


U DP-N -acetyl-alpha- D-gaiariosarnJne:pofyp 


4.6 


427 


93 


10.4 
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133199 AF231981 


Hs.250175 


homolog of yeast long chain pofyunsatura 


3 


816 


275 


3.9 


133240 AKQ01489 


Hs.242894 


ADP-ribosylation factor-like 1 
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81 
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4.6 


133271 Z48633 


Hs.283742 


H.sapiens mRNA for retrotransposon 


12.4 


124 
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10.8 


133640 AW246428 Hs.75355 


ubiquitin-conjugating enzyme E2N (homolo 
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133746 AW410035 Hs.75862 


MAD (mothers against decapentapieglc, Dr 
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93 
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133999 AA535244 


Hs.78305 


RAB2, member RAS oncogene family 


7.8 


78 




5.6 


134110 U41060 


Hs79136 


L1V-1 protein, estrogen regulated 


4.5 


1472 330 


2.1 


134485 X82153 


Hs.83942 


cathepsin K (pycnodysostosis) 


34.3 


411 


12 


5.1 


134654 AK001741 


Hs.8739 


hypothetical protein FU 10879 


6.4 


64 




5.1 


134880 A1879195 


Hs.90606 


15 kDa selenoprotein 


5.7 


57 




5 


135029 H58818 


Hs. 187579 


hydroxysterotd (17-beta) dehydrogenase 7 


11.5 


115 




10 


135389 U05237 


Hs.99872 


fetal Alzheimer antigen 


20.6 


206 


4 


19.1 


128305 AI954968 


Hs.279009 


matrix Gla protein 


9.4 


94 


3 


5.3 
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TABLE 11 A 

Table 11 A shows the accession numbers for those pkeys lacking unigenelD's for Table 11. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number: Gene cluster number 
Accession; Genbank accession numbers 



Pkey CAT number Accession 

123619 371681.1 AA602964 AA60S200 

1 1 3702 genbank JT973G7 T97307 

114988 genbankJVA251089 AA251089 
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TABLE 12: Figure 12 from BRCA 001-3 PCX 



Table 12 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R 1 : Ratio of tumor to normal body tissue 

R2: Ratio of 90" 1 percentile tumor to body 

R3: Ratio of 75 th percentile body to tumor 

R4: Ratio of tumor to normal breast tissue 



Pkey 


ExAccn UnigenelD Unigene Title 


R1 


R2 


R3 


R4 


100131 


D12485 Hs 11951 phosphodiesterase I (PC-1 ) 


13.2 


244 


19 


9.9 


105500 


AW602166 Hs 222399 ESTs 


25.4 


508 


20 


3 


112244 


AB029000 Hs. 70823 KIAA1 077 protein 


5.7 


567 


100 


6.7 


114124 


W57554 Hs 125019 ESTs 


24.2 


242 


10 


5.6 


119771 


AI905687 Hs.2533 ESTs 


3.5 


2073 


595 


2.1 


121723 


AA243499 Hs. 104800 ESTs 


2.9 


214 


74 


3.7 


128790 


AF026692 Hs 105700 secreted frizz led-re la ted protein 4 


17.4 


409 


24 


7.8 


131148 


AW953575 Hs.303125 ESTs 


3.8 


585 


153 


3.7 


131985 


AA503020 Hs 36563 ESTs 


40.2 


402 


1 


4 


133199 


AF231981 Hs,250175 Homo sapiens clone 23904 mRNA sequence 


3 


816 


275 


3.9 
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TABLE 13: Table 1 from BRCA 001-5 US 
Table 13 depicts a preferred group of genes upregulated in breast cancer cells. 



Pkey: Unique Eos probeset identifier number 

1 0 ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 
Unigene Titfe: Unigene gene title ■ 

R1 ; Ratio of tumor to normal body tissue 

15 

Pkey ExAccn UnlGenelD Unigene Title 

100038 M97935 control 

100039 M97935 control 
20 100040 M97935 control 

100041 M97935 control 

100082 AB003103 Hs,4295 proteasome (prosome; macropain) 26S sub 

100091 AF000177 Hs 111783 Lsm1 protein 

100100 AF006084 Hs 11538 actin related protein 2/3 complex; subunit 

25 100103 AF007875 Hs5085 do Itchy I- phosphate man nosyltr an sf erase p 

100114 D00596 Hs. 82962 thymidy late synthetase 

100121 D10495 Hs 155342 protein kinase C; delta 

100123 D10523 Hs 168669 oxog luta rate dehydrogenase {lipoamide) 

100128 D11094 Hs61153 proteasome (prosome; macropain) 26S sub 

30 100131 D12485 Hs 11951 phosphod 'teste rase l/nucleotide pyrophosp 

100137 D13627 Hs.15071 chaperonin containing TCP1 ; subunit 8 (t 

100144 D13643 Hs.75616 Human mRNA for K1AA0O18 gene; comp 

100147 D13666 Hs 136348 osteoblast specific factor 2 {fasciclin I -like 

100154 D14657 Hs.81892 KIAA01 01 gene product 

35 100164 D14812 Hs.173714 MORF-related gene X 

100169 D14878 Hs.82043 D123 gene product 

100190 D21090 Hs 178658 RAD23 (S. cerevisiae) homofog B 

100203 D25538 Hs 172199 adenylate cyclase 7 

100209 D26308 Hs 76289 biliverdin reductase B (flavin reductase (N 

40 100215 D26598 Hs.82793 proteasome (prosome; macropain) subunrt 

100216 D26599 Hs 1390 proteasome (prosome; macropain) subunit 

100219 D28137 Hs.118110 bone marrow stromal cell antigen 2 

100227 D28915 Hs.82316 interferon-induced; hepatitis C-associated 

100248 D31888 Hs.78398 KIM0071 prote'm 

45 100287 D43950 Hs.1600 chaperonin containing TCP1; subunit 5 (e 

100294 D49396 Hs 75454 antioxidant protein 1 

100307 D50525 Hs699 hypothetical prote'm 

100335 D63391 Hs.6793 platelet-activating factor acetylhydrotase; 

100340 D63487 Hs82563 KIM0153 protein 

50 100355 D78129 Hs 71465 Homo sapiens mRNA for squaiene epoxid 

100363 D78514 Hs78563 ubiquitin-conjugating enzyme E2G 1 (horn 

100368 D79987 Hs.153479 extra spindle poles; S, cerevisiae; homoto 

100372 D79997 Hs 184339 K1AA01 75 gene product 

100375 D80004 Hs 75909 KIAA01 82 protein 

55 100379 D82060 Hs 278721 Ke4 gene; mouse; human nomolog of 

100387 D83777 Hs75137 KIAA01 93 gene product 

100393 D34145 Hs 39913 novel R GO -containing protein 

100398 D84557 Hs.155462 minich romoso me maintenance deficient (m 

100405 D86425 Hs.82733 nidogen2 

60 100406 D86479 Hs.1 18397 AE-binding protein 1 

100409 D86957 Hs.80712 KIAA0202 protein 

100421 D86985 Hs.79276 Human mRNA for KJAA0232 gene; comp 

100446 D87464 Hs.10037 KIAA0274 gene product 

100447 D87465 Hs74583 KIAA0275 gene product 
65 100448 D87469 Hs.57652 EGF-iike-domain; multiple 2 

215 



R1 

16.7 

6.3 

83 

14.8 

75 

49 

47 

13.4 

15.9 

46 

75 

44 

87 

95 

6 

85 

10.5 

46 

7.9 

5.6 

9.9 

4.9 

14.2 

11.3 

5.7 

5.7 

74 

5.6 

12.9 

84 

6.8 

44 

12.6 

46 

6.5 

84 

45 

81 

10.7 

72 

72 

54 

43 

11.9 

97 

6 4 

10 

6.2 



WO 02/059377 



PCT/US02/02242 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



100467 


D89052 


Hs.7476 


ATPase; H+ transporting; lysosomal (vacu 


7.5 


100468 


D89289 


Hs.1 18722 


fucosyttransferase 8 (alpha (1 ;6) fucosyltr 


5 


100486 


HT1112 


Hs.10842 


Ras-Like Protein Tc4 


16.9 


100497 


HT1400 


Hs.79137 


Carboxyl Methyltransferase. Aspartate, A 


5.6 


100618 


HT2710 


Hs.1 14599 


Collagen, Type Vllf, Alpha 1 


7.5 


100661 


HT3018 


Hs.1 32748 


Ribosomal Protein L39 Homolog 


4.4 


100667 


HT3127 


Hs.169610 


Epican. Alt Splice 1 1 


4.6 


100668 


HT3938 


Hs.1 69610 


Epican.Alt Spfce 12 


4.4 


100676 


HT3742 


Hs 287820 


Flbronectin, AH Spfice 1 


9 


100775 


HT26388 


Hs 89603 


Mucin 1, Epithelial, Aft Splice 9 


47 


100783 


HT4018 


Hs. 191356 


Basic Transcription Factor, 44 Kda Subun 


13.7 


100829 


HT4343 


Hs 278544 


Cytosolic Acetoacetyl-Coenzyme A Thlo 


10.6 


100830 


HT4344 


Hs.4756 


Rad2 


55 


100840 


HT4392 


Hs 183418 


Protein Kinase Pitslre, Alpha, Alt SpRce 


41 


100850 


HT417 


Hs297939 


Cathepsin B 


4 


100866 


HT4582 


Hs 75113 


Transcription Factor Ifia 


49 


100906 


HT5158 


Hs 5398 


Guanosine 5 -Monophosphate Synthase 


8 7 


100914 


HT511 


Hs 324178 


Ras Inhfcitor Inf 


7 2 


100Q16 


HT544 


He 73946 


Endothelial Cefl Growth Factor 1 


59 




HT884 


He 1 RDfiflfi 

■ lO 1 uuuuu 


Oncogene E6-Ap, Papillornavirus 


46 


100375 


102Q91 


He 76^06 


lymphocyte cytosolic protein 1 (L-plastin 


30.1 


lOOQfifl 

IVA/300 


lO^fiAQ 


He 764 AO 

110/ U1 V/VJ 


uhirtiiftifv/(kp 4 


83 


100QQR 


lOWIQ 
juoyvjj 


He 14R91 


interferon! gamrna-inducible protein 30 


69 


IUU333 


juoyo^ 


He A070R 
ns out uo 


Hfarihnraep /f\JAnH^NADPH\ /rvtrwhnnm 
uidL/i iui abc i|"ir>urwi vmjt i ij uui 1 1 


43 


mini 1 

IU I U 1 1 


JU*f+OU 


Me 1911 

nb. i z i i 


arid nhnenKsta^p ^ tartratp rp'iRtant 


59 


inmi7 

IU I U I 1 


I04W) 
JUH333 


Hs 821 


hin fvra n 


51 


mm^i 

I U I UO 1 


JU3U/ U 


no. I J I / OO 


mafriy mptallnnrrrfpinacp Q /nplatinaiP B' 

1 1 iCIll IA 1 IFCiClimL/IVAuH \f vwlW loot* u t 


37^ 


IU 


I0R94Q 


He 7Q411 
ns / 3*t I i 


rpnltratinn rvntpin A9 /^JkD^ 


6 1 


IU I uD4 


PiUZtUJ 


He 7?cni 
nb / 030 i 


Human UHC class II HLA-DO-beta mRN 


43 


101061 


f\UOQ t -J 


He 1R0R3? 
no i (jUkJji. 


glucose phosphate isomerase 


4.3 


10lflQ1 
IU IU3 1 


I nci ^9 

LUu IOZ 


He 14Q1RR 
nb I 13 j 


uriMAnp-rlpnprYlpnt aninn rhannpl 1 


74 


IU IU3f 


I OR7Q7 


He RQ414 


rhpmnkinp IC-X-C, mntifi* rprpntnr 4 /fu*5 


46 


1 U 1 1 


I 07K1*i 
LU7 0 IO 


He 1RQ9RR 
ns i oyzoo 


Human npi imnpntiflp Y rPf^pntnrYl /MPY 


18.3 


101 14 1 * 
IU I I^J 


L12723 


He QOOQ? 
nb. 3vU3J 


hoat ehnH( 70kH runtpin 4 


17.4 


mi m9 

IU I IDZ 


1 1 7 RAO 
L I OOUU 


He QRA4 


Hnmn epnipnQ Hvpr PXnnp^cpH nrntpin APR 


76 


mi ift^ 


I 1Q77Q 


He 7QC 


H9A hfclnno famlfir mpinhpr O 
nzA iiibiuiic idiiiny, tntynucj w 


10.9 


101216 


L25876 


He R411? 


cyclin-dependent kinase inhibitor 3 (CDK 


74 


I U I zoo 


LZ3UU0 


He R7R 

ns of 0 


onrhitnl HohuHrrvion^iep 
out uiuji uci lyuiuy ci icioc 


14.6 


1 n 1 947 
IU IZ^+f 


] ^R01 
LOOOU I 


He 7RR09 
nb. / OOUZ 


nh/p/Yipn evntKpep klnaep "1 hpta 


7 5 


mi 7ft? 

I U I ZOZ 


l ^RR10 

LOOO IU 


He 7Q?A7 
nb / 300/ 


nrntpaen/np /nrrfirmnp* nnar:ronain^ 26S *;uh 


4 4 




L42572 


He 7A504 

1 lb . / Vrr 


inner membrane protein,* mitochondrial (m 


58 


IU IOOZ 


L47276 


He 1SR^4R 
nb. ijooho 


Hnmn eanipne (rtAl iinp HI -fi*i alnha tnnn 

l 1 Ui 1 IU OOUId 10 \ LCI 1 III 1 } l—^J J Uipi lu IW^V^ 


18.9 


iniiAA 

lUTO^O 


I 7791? 
Lf / Z 10 


He ?0QR4 

nb 0U33H 


nhnenhnmpvalnnsfp kfnfl^P 


75 


101352 


L77701 


He 169Q7 


C0X17 (yeast) homolog; cytochrome c ox 


93 


1U1-5/0 


IVI I Or 30 


He R?? 

ns ojo 


wtrellclLHI-oLtlliUidlcU ^lUlcill, u MJa 


18.1 




IVI 1 0/30 


He 7RQQR 

ns / oyyo 


nmlifpraHno rptl niirlpar antinpn 


86 


101404 


IVI 1 004Z 


He 1R9447 
no lOZt**/ 


hptprnnpnpmie niiHftar rihonudpoorotpin C 


4.5 


iniA*w 

IU 1 403 


M 900/19 
ivizuyuz 


He 9RRR71 
nb.zoou/ 1 


arwilinnnfYitoin (Lf 

□ UUH^/Upi UlCII 1 V^*l 


6.1 


101 4R4 


M9?S^R 

IVI^ZJOO 


He S19QQ 
nb O I Z33 


WADH rlphudmnpna^p /iihinuinonel flavn 


87 


101469 


M22877 


Hs 169248 


Human somatic cytochrome c (HCS) gene 


42 


101472 


M22960 


Hs 118126 


protective protein for beta-galactosidase ( 


6.5 


101478 


M23379 


Hs 758 


RAS p21 protein activator (GTPase activa 


14 


101484 


M24594 


Hs 20315 


interferon-induced protein 56 


9 2 


101539 


M30818 


Hs926 


myxovirus (influenza) resistance 2; homol 


51 


101540 


M30938 


Hs 84981 


X-ray repair complementing defective rep 


47 


101544 


M31169 




Human propionyi-CoA carboxylase beta-s 


5.5 


101552 


M31642 


Hs 82314 


hypoxanthine phosphoribosyttransferase 1 


85 


101580 


M34677 


Hs 83363 


DNA segment on chromosome X (unique) 


4.5 


101600 


1^37583 


Hs 119192 


H2A histone family; member Z 


5.7 


101563 


M60750 


Hs.2178 


H2B histone tamify; member A 


58 


101664 


M60752 


Hs 121017 


H2A histone famity; member A 


13.5 


101667 


M60858 


Hs 79110 


nucleoiin 


4 


101684 


M63256 


Hs 75124 


cerebellar degeneration- related protein (62 


7.6 


101702 


M64929 


Hs 179574 


protein phosphatase 2 (formerly 2A); regu 


4.2 


101754 


M77142 


Hs 239489 


TIA1 cytotoxic granule-associated RNA-b 


4.5 


101758 


M77836 


Hs 79217 


pyrroline-5-carboxylate reductase 1 


5.7 


101767 


M81057 


Hs.1 80884 


carboxypeptidaseBI (tissue) 


21.7 



216 



WO (12/059377 



PCT/US02/02242 



101770 M81501 Hs.78859 transcription elongation factor A (Sll); 1 4.6 

101791 M&3822 Hs.62354 cell division cycle 4-fike 9-7 

101803 M86546 Hs.155691 pre-B-cell leukemia transcription (actor 1 5.5 

101809 M86849 Hs.323733 Homo sapiens connexin 26 (GJB2) mRNA 22 5 

5 101839 M93036 Hs.692 membrane component; chromosomal 4; su 4 

101851 M94250 Hs.82045 midkine (neurite growth-promoting facta 7.6 

101888 M99701 Hs-95243 transcription elongation factor A (Sliyike 114 

101973 S82597 Hs.80120 UDP-N-acetyl-alpha-D-galactosamine:po 4.6 

101991 U00968 Hs.166 Human SREBP-1 mRNA; complete cds 4.1 

10 102009 U02680 Hs.82643 protein tyrosine kinase 9 4.4 

102025 U03911 Hs.78934 mutS (E. coli) homoiog 2 (colon cancer; n 4 

102047 U07158 Hs.83734 syntaxin 4 A (placental) 6.1 

102051 U07550 Hs.1197 heat shock 10kD protein 1 (chaperonir 10 4.4 

102083 U10323 Hs75117 interteukin enhancer binding factor 2; 45k 10 4 

15 102095 U11313 Hs.75760 sterol carrier protein 2 9-5 

102130 U15009 Hs.1575 small nuclear ribonucieoprotein D3 polyp 6.6 

102133 U15173 Hs.155596 BCL2/adenovirus E1B 19kD-interacting p 4.3 

102148 U16954 Hs.75823 ALL 1- fused gene from chromosome 1q 6.9 

102179 U19713 Hs.76364 allograft inflammatory factor 1 4.8 

20 102180 U19718 Hs.33551 microftbriflar-associated protein 2 7.2 

102193 U20758 Hs.313 seaeted phosphoprotein 1 (osteopontin; b 7.2 

102198 U21090 Hs. 74598 polymerase pNA directed); delta 2; regu 4.3 

102202 U21931 Hs.574 fructose-bis phosphatase 1 4.5 

102209 U22970 Hs.265827 interferon; alpha-inducible protein (clone 9.9 

25 102211 U23070 Hs.78776 putative transmembrane protein 4.9 

102220 U24389 Hs.65436 lysyf oxidase-like 1 8 5 

102224 U24704 Hs.148495 proteasome (prosome; macropain) 26S sub 5.4 

102234 U26312 Hs.278554 chromobox homoiog 3 (Drosophila HP1 g 7 7 

102250 U28014 Hs.74122 caspase 4; apoptosis-related cysteine prot 5.4 

30 102260 U28386 Hs.159557 karyopherin alpha 2 (RAG cohort 1 ; impo 6.3 

102261 U28488 Hs.155935 complement component 3a receptor 1 57 

102273 U30888 Hs.75981 ubiqurtin specific protease 14 (tRN A -guan 6.1 

102298 U 32849 Hs.54483 N-myc (and STAT) interactor 4.1 

102302 U33052 Hs,69171 protein kinase C-iike 2 4.3 

35 102305 U33286 Hs.90073 chromosome segregation 1 (yeast homok) 5 4 

102320 U34683 Hs.82327 glutathione synthetase 4.1 

102330 U35451 Hs.77254 chromobox homoiog 1 (Drosophila HP1 b 4 

102348 U37519 Hs.87539 aldehyde dehydrogenase 8 9.4 

102361 U39400 Hs.75859 chromosome 11 open reading frame 4 5.2 

40 102362 U39412 Hs.75932 N-ethylmaieimide-sensitive factor attachm 9.3 

102369 U 39840 Hs.299867 hepatocyte nuclear factor 3; alpha 7 7 

102395 U41767 Hs.92208 a dlsintegrin and metalloproteinase domai 10.4 

102409 U43286 Ha 118725 setenophosphate synthetase 2 6 2 

102418 U43923 Hs.79058 suppressor of Ty (S.cerevisiae) 4 homoiog 4 1 

45 102425 U44772 Hs.3873 palmrtoy {-protein thioeste rase (ceroid- lipo 4 8 

102457 U48807 Hs.2359 dual specificity phosphatase 4 6.3 

102465 U49352 Hs.81548 2;4-dienoyl CoA reductase 1; mftochondri 9.4 

102495 U51240 Hs79356 Lysosomal -associated multispanning mem 6.5 

102534 U56833 Hs.198307 von HippeHindau binding protein 1 8 6 

50 102546 U57877 Hs.3577 succin ate dehydrogenase complex; subuni 4 3 

102549 U58046 Hs.198899 eukaryotic translation initiation factor 3; s 6,3 

102557 U58766 Hs.264428 tissue specific transplantation antigen P35 5 

102562 U 59309 Hs.75653 fumarate hydratase 6 

102568 U59877 Hs.223025 RAB31; member RAS oncogene family 91 

55 102580 U60806 Hs.152981 CDP-diacylglycerol synthase (phosphatid 7 9 

102581 U61145 Hs.77256 enhancer of zeste (Drosophila) homoiog 2 7 6 

102590 U62136 Hs.79300 Homo sapiens enterocyte differentiation a 7 

102591 U62325 Hs.324125 amyloid beta (A4) precursor protein -bindi 4 

102592 U62389 Hs.11223 Human putative cytosolic NADP-<Jepende 5 
60 102617 U65928 Hs.198767 Jun activation domain binding protein 6 1 

102618 U65932 Hs.81071 extracellular matrix protein 1 23.2 

102638 U67319 Hs.9216 caspase 7; apoptosis-related cysteine prot 8 9 

102663 U70322 Hs.168075 karyopherin (importin) beta 2 7.1 

102666 U70660 Hs.279910 ATX1 (antioxidant protein 1; yeast) homo 4 7 

65 102679 U72661 Hs.11342 ninjurin 1; nerve injury- induced protein -1 4.7 

102687 U73379 Hs.93002 ubiqurtin carrier protein E2-C 7.7 
102704 U76638 Hs.54089 BRCA1 associated RING domain 1 5.6 
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103464 Y00285 Hs 76473 insulin-l:ke growth factor 2 receptor 

103470 Y00796 Hs.174103 integrin; alpha L (antigen CD11 A (p1 80); 

103494 Y06991 Hs,83050 phosphafidylinositol 3-kinase-associated p 

103505 Y09912 Hs.33102 transcription factor AP-2 beta (activating 

5 103547 214982 Hs.180062 proteasome (prosome; macropain) submit 

103551 215115 Hs.75248 topoisomerase (DNA) I! beta (180kD) 

103565 Z2254B Hs. 146354 thioredoxr-dependent peroxide reductase 

103587 Z29083 Hs,82128 5T4 oncofetal trophobtast glycoprotein 

103621 Z47727 Hs.150675 polymerase (RNA) II (DNA directed) pol 

10 103622 248042 Hs.278672 membrane component; chromosome 11; s 

103658 Z74615 Hs.172928 collagen; type I; alpha 1 

103680 293784 Homo sapiens DNA sequence from PAC 

103772 AA092473 Hs.278554 chromobox homoiog3(DrosophilaHP1 g 

103774 AA092898 Hs.92918 ESTs; Weakly similar to R07G3.8 [C.eleg 

15 103821 AA 157623 Hs.1 98793 KIAA0750 gene product 

103835 AA172215 Hs.93748 ESTs; Moderately similar to TRANSCRIP 

103886 AA236384 Hs.105737 ESTs; Weakly similar to gene 9306 protei 

103890 AA236843 Hs.72085 ESTs; Weakly similar to unknown [S.cere 

103892 AA243523 Hs.239189 ESTs 

20 104054 AA393432 Hs.7100 hypothetical protein 

104115 AA428090 Hs.26102 ESTs 

104136 AA442669 Hs.268371 zv6df6.r1 Soares_total_fetus_Nb2HF8_9w 

104147 AA451992 Hs.283037 ESTs; Highly similar to HSPC039 protein 

104173 AA476564 Hs.76561 ESTs; WeaMy similar to finger protein HZ 

25 104181 AA479521 Hs.283740 ESTs 

104183 AA480838 Hs.1 14309 ESTs 

104192 AA486946 Hs.21321 Homo sapiens mRN A; cDN A DKFZp564 

104209 AB000221 Hs. 16530 small inducible cytokine subfamily A (Cy 

104234 AB002357 Hs.168212 kinesin family member 3B 

30 104271 C01687 Hs.7381 ATP synthase; H+ transporting; mttochon 

104278 C02582 Hs.109253 ESTs; Highly similar to N-terminaJ acetyl 

104307 D52818 Hs.1 11680 endosulfine alpha 

104309 D55869 Hs.284123 Homo sapiens mRNA full length insert cD 

104370 H19378 Hs.21851 Homo sapiens mRNA; cDNA DKFZp 586 

35 104446 L44497 Hs.7351 ESTs 

104453 M19169 Hs.123114 cystatinSN 

104476 N33807 Hs.324275 protease; serine; 15 

104558 R56678 Hs.88959 Human DNA sequence from done 967N2 

104592 R81003 Hs.325820 serine protease; umbilical endothelium 

40 104634 AA004274 Hs.19151 ESTs 

104636 AA004415 Hs.106106 ESTs 

104658 AA007145 Hs.27268 Homo sapiens mRNA; cDNA DKFZp564 

104667 AA007234 Hs.30098 ESTs 

104675 AA009596 Hs.301553 ESTs; Moderately similar to II 1 1 ALU SU 

45 104767 AA025534 Hs.8852 ESTs 

104785 AA027163 Hs.7942 ESTs 

104791 AA029046 Hs.301871 ESTs; Moderately similar to cAMP indue 

104804 AA031357 Hs.31803 ESTs; Weakty similar to N-WASP [H.sap 

104807 AA032147 Hs.23296 ESTs 

50 104837 AA039469 Hs.21126 ESTs; Weakly similar to KIAA0299 [H.s 

104849 AA040270 Hs.241507 Homo sapiens mRNA; cDNA DKFZp564 

104867 AA045481 Hs.225979 Human gene from PACs37M1 7 and 305B 

104884 AA053021 Hs.14511 SCO (cytochrome oxidase deficient; yeast 

104906 AA055809 Hs.26802 ESTs; Weakly similar to ptosphoprotein [ 

55 104919 AA057193 Hs,25252 ESTs 

104921 AA057839 Hs,1508 ESTs 

104926 AA058846 Hs.33363 DKFZP434N093 protein 

104938 AA064627 Hs.318725 ESTs; Highly similar to CGI-72 protein [H 

104943 AA065217 Hs.114218 ESTs 

60 104957 AA074919 Hs.10026 ESTs; Weakly simitar to ORF Y J L063c [S 

104961 AA076672 Hs.33905 ESTs 

104968 AA084602 Hs.29669 ESTs 

104975 AA086071 Hs.50758 chromosome-associated polypeptide C 

104977 AA088228 Hs.18272 ESTs 

65 104978 AA088458 Hs.1 9322 ESTs 

104987 AA101723 Hs.1 1861 ESTs 

105002 AA113266 Hs.182704 ESTs; Moderately similar to alternatively 
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105012 AA 116036 Hs.9329 chromosome 20 open reading frame 1 10.7 

105019 AA121879 Hs.9280 proteasome (prosome; macropain) subunrt 5.7 

105029 AA1 26855 Hs. 13268 ESTs 4.4 

105033 AA127964 Hs.274329 TP 53 target gene 1 6.3 

5 105035 AA128486 Hs.8859 ESTs 6.5 

105039 AA130349 Hs.36475 ESTs 4 

105062 AA1 34968 Hs.36529 ESTs 4.3 

105076 AA142858 Hs.37810 ESTs 6.4 

105087 AA147884 Hs.9812 ESTs 9.2 

10 105091 AA148859 Hs.179909 ESTs; Weakly similar to !!!! ALU SUBFA 5.7 

105093 AA149051 Hs.32405 ESTs 6.3 

105107 AA1 52302 Hs.25035 DKFZP566G223 protein 6.2 

105127 AA158132 Hs.301957 ESTs; Weakly similar to contains simiiari 5.7 

105132 AA159501 Hs.247280 HBV associated factor 4.2 

15 105143 AA165333 Hs.24808 ESTs 4.7 

105154 AA171736 Hs.35947 methyl-CpG bind ing domain protein 4 9 

105162 AA176690 Hs.4084 KIAA1025 protein 9.1 

105186 AA191512 Hs.28005 Homo sapiens mRNA; cDNA DKFZp5 64 19.3 

105209 AA205072 Hs.227743 KIAA0980 protein 7.4 

20 105223 AA211388 Hs.7750 ESTs 5.1 

105252 AA227428 Hs.9728 ESTs; Weakly similar to KIAA051 2 prote 11.1 

105253 AA227448 Hs,5003 KIAA0456 protein 6.4 
105261 AA227871 Hs 6361 MEK partner 1 9.1 
105263 AA227926 Hs6682 ESTs 6 7 

25 105274 AA228122 Hs.281866 ATPase; H+ transporting; lysosomal (vacu 5 3 

105297 AA233451 Hs.183858 transcriptional intermediary factor 1 8 7 

105309 AA233790 Hs 4104 ESTs; Weakly similar to cON A EST yk38 7.4 

105312 AA233854 Hs 23348 S- phase kinase-associated protein 2 (p45) 5B 

105342 AA235286 Hs 157078 ESTs 4 5 

30 105376 AA236559 Hs8768 ESTs; Weakly similar to!!!! ALU SUBFA 5 8 

105386 AA236950 Hs8115 ESTs 5 5 

105397 AA242868 Hs7395 ESTs; Weakly similar to house-keeping p 7 7 

105399 AA243007 Hs 15420 ESTs; Highly similar to SH3 domain-bind 5 6 

105400 AA243052 Hs 65648 RNA binding motif protein 8 5 8 
35 105404 AA243303 Hs.21187 ESTs 91 

105409 AA243562 Hs 301855 ESTs 4 4 

105436 AA252172 Hs 237856 ESTs; Moderately similar to cAMP indue 51 

105483 AA255874 Hs 23458 ESTs 4 9 

105493 AA256268 Hs. 10283 ESTs 6 

40 105495 AA256317 Hs 28785 Homo sapiens mRNA; cDNA DKF2p586 5 2 

105496 AA256323 Hs 301997 DKFZP434N 126 protein 87 

105500 AA256485 Hs 222399 CGI-96 protein 9 5 

105507 AA25667B Hs.226318 ESTs; Moderately similar to CCR4-associ 4 1 

105538 AA258860 Hs.32597 ring finger protein (C3H2C3 type) 6 41 

45 105544 AA261954 Hs 24678 ESTs 8 

105546 AA262032 Hs.268281 ESTs; Weakly similar to 62D9.a [D.melan 81 

105549 AA262417 Hs5415 ESTs 4 6 

105551 AA262477 Hs 25292 ribonuclease HI; large subunit 91 

105560 AA262783 Hs 306915 ESTs 4 5 

50 105565 AA278302 Hs 18349 ESTs; Weakly similar to partial CDS [C.e 4 2 

105566 AA278323 Hs 17481 Homo sapiens clone 24606 mRNA sequen 11.9 

105575 AA278717 Hs 12772 ESTs 5 9 

105584 AA279012 Hs 3454 ESTs; Weakly similar to KIAA0665 prote 4 4 

105596 AA279418 Hs 18490 ESTs 4 

55 105604 AA279787 Hs 15467 ESTs; Moderately similar to putative pho 5 6 

105610 AA279991 Hs.99872 ESTs; Weakly simitar to trithorax homolo 5 3 

105621 AA280865 Hs.6375 Homo sapiens mRNA; cDNADKFZp564 4 8 

105627 AA281245 Hs.23317 ESTs 7.5 

105638 AA281599 Hs .247817 Homo sapiens mRNA for for h'etone H2B 5.9 

60 105645 AA282138 Hs 11325 ESTs 6.4 

105650 AA282347 Hs .25635 ESTs; Highly similar to HSPC003 [H.sap 11.3 

105666 AA283930 Hs 34906 ESTs 4.7 

105674 AA284755 Hs 279789 CDW52 antigen (CAMPATH-1 antigen) 8 

105687 AA286809 Hs.28423 ESTs 7.1 

65 105700 AA287643 Hs.35254 ESTs; Weakly similar to hypothetical pro 4.9 

105705 AA290767 Hs.101282 Homo sapiens mRNA; cDNA DKF2p4 34 8 

105709 AA291268 Hs.26761 DKFZP586L0724 protein 6.8 
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105731 AA292711 Hs.29131 ESTs 6.4 

105753 AA299789 Hs 110857 ESTs 7 

105774 AA348014 Hs 23412 ESTs 7 1 

105784 AA350771 Hs 17850 ESTs 13.4 

5 105791 AA358038 Hs 14368 SH3-binding domain glutamic acid-rich p 4 3 

105807 AA393803 Hs 16869 ESTs; Moderately similar to COLLAGEN 5 3 

105808 AA393808 Hs 286131 KIAA0438 gene product 41 

105812 AA394126 Hs 20814 ESTs; Highly similar to CGI-27 protein [H 14.6 

105813 AA394140 Hs 18585 ESTs 4 9 
10 105819 AA397920 Hs 28783 Homo sapiens mRNA; cDNA DKFZp564 4 9 

105870 AA399623 Hs 101067 ESTs 4 8 

105874 AA400074 Hs 171118 ESTs 4 

105896 AA400999 Hs 7838 Human ring zinc-finger protein (ZNF127- 4 8 

105934 AA404248 Hs. 16577 ESTs 5 2 

15 105935 AA404277 Hs.263727 ESTs; Weakly similar to bisphosphate 3'- 4 

105966 AA406105 Hs5344 adaptor- re late d protein complex 1; gamma 8 3 

105974 AA406321 Hs6224 KIAA0895 protein 4 6 

105990 AA410336 Hs 29403 ESTs; Weakly similar to PROBABLE AT 4 5 

105995 AA410510 Hs5345 ESTs 4 9 

20 106000 AA410972 Hs 20726 ESTs 53 

106007 AA411462 Hs 11042 ESTs. Weakly similar to veli 1 [H. sapiens 6.9 

106016 AA411819 Hs8164 KIAA0898 protein 5 

106034 AA412473 Hs 14928 ESTs 6 6 

106042 AA412700 Hs.169895 ubiqu.tin-conjugating enzyme E2L 6 4 6 

25 106057 AA417067 Hs.289074 ESTs 4 5 

106065 AA417558 Hs.25206 ESTs 12.3 

106070 AA417761 Hs 5957 Homo sapiens clone 24416 mRNA sequen 5 

106103 AA421104 Hs.12094 ESTs 15.4 

106126 AA424O06 Hs 22972 ESTs. Moderately similar to H5AR [M.m 6 4 

30 106154 AA425304 Hs.6994 ESTs 51 

106157 AA425367 Hs 34892 ESTs 11.1 

106166 AA425872 Hs 19561 NADH dehydrogenase (ubiquinone) 1 alp 19.3 

106204 M428024 Hs 21479 ESTs 4 7 

106210 AA428239 Hs 10338 ESTs 5 7 

35 106220 AA428582 Hs 32196 ESTs. Moderately similar to metarg "din p 7 7 

106236 AA429951 Hs 21104 ESTs 8 

106240 AA430074 Hs 18552 ESTs. Weakly similar to Ylr218cp [S.cere 4 4 

106263 AA431462 Hs 28329 ESTs 4 9 

106288 AA435536 Hs.24336 ESTs 8 8 

40 106293 AA435591 Hs 301444 signal sequence receptor gamma (transloc 8 7 

106310 AA436244 Hs 17240 ESTs 4 5 

106317 AA436568 Hs 108124 ESTs 4 

106328 AA436705 Hs 28020 KIAA0766 gene product 4 4 

106341 AA441798 Hs.5243 ESTs, Moderately similar to plL2 hypoth 23.7 

45 106348 AA442253 Hs 10702 ESTs 4 7 

106350 AA442763 Hs 194698 cyciinB2 61 

106371 AA443923 Hs 170310 ESTs 6 8 

106389 AA446949 Hs6236 ESTs 4.7 

106394 AA447223 Hs 25320 Homo sapiens clone 25142 mRNA sequen 4 4 

50 106426 AA448282 Hs 15206 ESTs; Weakly similar to F55C12.5 [C.ele 4 5 

106459 AA449741 Hs4029 giioma-amprrfied sequence-41 4 8 

106462 AA449912 Hs.30532 ESTs; Highly similar to CGI -77 protein [H 5 2 

106468 AA450047 Hs 14770 ESTs 6.8 

106479 AA450351 Hs 75251 ESTs 12.4 

55 106494 AA452108 Hs.18387 transcription factor AP-2 alpha (activating 4.5 

106503 AA452411 Hs. 29679 ESTs; Highly similar to mediator [H.sapie 5.1 

106507 AA452584 Hs 267819 protein phosphatase 1; regulatory (inhibto 4.9 

106533 AA453786 Hs.145998 ESTs 8.3 

106568 AA455970 Hs 28285 patched related protein translocated in re n 7.6 

60 106586 AA456598 Hs.57787 ESTs 8.2 

106589 AA456646 Hs 28661 ESTs 4.8 

106606 AA457730 Hs 283437 Homo sapiens clone 23851 mRNA sequen 4.4 

106611 AA458904 Hs 26267 ESTs; Weakly similar to to rsinA [H.sapie 7 

106614 AA458934 Hs 256150 ESTs 4.5 

65 106628 AA459657 Hs 12311 Homo sapiens done 23570 mRNA sequen 6.5 

106637 AA459961 Hs 250824 ESTs 5.5 

106644 AA460239 Hs 12680 ESTs 4.4 
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106664 AA460969 Hs.7510 mitogen -activated protein kinase kinase ki 8.4 

106698 AA463745 Hs.29403 ESTs; Weakly similar to PROBABLE AT 5.3 

106719 AA465171 Hs.236844 ESTs 5.6 

106726 AA465339 Hs.3886 ESTs 10,1 

5 106747 AA476473 Hs.171957 triple functional domain (PTPRF interact! 10.4 

106759 AA477263 Hs.25584 ESTs 4.2 

106765 AA477717 Hs.306117 interteukin 13 receptor; alpha 1 6.9 

106784 AA478558 Hs.227913 APIWifce 1 5.1 

106831 AA482014 Hs.29463 centrin; EF-hand protein; 3 (CDC31 yeast 5.1 

10 106836 AA482112 Hs.238707 ESTs 4.8 

106840 AA482548 Hs.5534 ESTs 10.3 

106856 AA486183 Hs,285123 ESTs; Weakly similar to similar to oxyste 6.2 

106865 AA487228 Hs.19479 ESTs 4.5 

106878 M488872 Hs.12314 Homo sapiens mRNA; cDNA DKFZpS 86 7.9 

15 106888 M489101 Hs.24734 oxyste rol binding protein 6.4 

106895 AA489665 Hs.25245 ESTs 4.6 

106909 AA490323 Hs 250747 SUMO-1 activating enzyme subunit 1 4.2 

106919 M490885 Hs 21766 ESTs 12.3 

106920 AA490899 Hs.296323 ESTs 6.2 
20 106941 M496204 Hs 237971 ESTs 4 

106942 M496347 Hs 31314 retinoblastoma-bindtng protein 7 4.8 

106948 AA496788 Hs 21077 KIAA0532 protein 4 

106968 AA504631 Hs 26813 ESTs; Weakly similar to hypothetical 43.2 4.4 

106973 AA505141 Hs 11923 Human DN A sequence from done 16 7A1 5.4 

25 106980 M521121 Hs8858 bromodomain adjacent to zinc finger do m 4,1 

106981 AA521157 Hs74101 ESTs 57 

106998 AA598461 Hs 195464 insulin-like growth factor binding protein 187 

107008 AA598710 Hs 23740 ESTs 6.2 

107028 AA599214 Hs 24143 ESTs 4.1 

30 107032 M599472 Hs 247309 succinate-CoA ligase; GDP-fomning; beta 5,3 

107052 AA600134 Hs. 12482 glyceronephosphateO-acyltransferase 4.8 

107053 M600147 Hs5741 ESTs; Weakly simitar to NADH-cytochro 5.8 
107056 AA600310 Hs 18720 programmed cell death 8 {apoptosis-in due 4.9 
107080 M609210 Hs 19221 ESTs 8.4 

35 107102 AA609723 Hs.30552 ESTs 8 

107109 AA609943 Hs 32793 ESTs 9.5 

107129 AA620553 Hs4756 flap structure-specific endonuclease 1 4.9 

107132 AA620598 Hs 9052 ESTs 5.3 

107136 AA620795 Hs.8207 ESTs 4 

40 107140 AA620889 Hs.170088 ESTs 6.7 

107151 AA621169 Hs 8687 ESTs 19 

107159 AA621340 Hs 10600 ESTs; Weakly similar to ORF YKR081 c [ 8.1 

107174 AA621714 Hs.25338 ESTs 8.5 

107217 D51095 Hs.35861 DKFZP586E1621 protein 7.2 

45 107252 D59971 Hs25925 ESTs 7.9 

107295 T34527 Hs.80120 UDP-N-acetyl-alpha-D^alactosamine:po 5.6 

107299 T40327 Hs.30661 lung resistance-related protein 8.4 

107324 T81665 Hs 278422 DKFZP58661 122 protein 7.5 

107372 U85625 Hs.8297 ribonuclease 6 precursor 4.7 

50 107373 U85773 Hs. 154695 phosphomannomutase 2 4 8 

107481 W58247 Hs.279766 Homo sapiens kinesin superfamily motor 6 3 

107531 Y13936 Hs.17883 protein phosphatase 1G (formerly 2 C); ma 83 

107859 AA024835 Hs.47584 potassium voltage-gated channel; delayed 7 3 

107890 M026030 Hs.61311 ESTs; Weakly similar to CALPA IN 2; LA 7.3 

55 107908 AA026894 Hs.42826 ESTs 49 

108039 AA041341 Hs.46670 ESTs 54 

108040 AA041551 Hs.159971 ESTs 8 4 
108102 AA046424 Hs.49433 ESTs; Weakly similar to HYPOTHETICA 66 
108217 AA058686 Hs.62588 ESTs 7 7 

60 108255 AA063157 Hs. 172608 ESTs 4 

108358 M071514 Hs.1634 ESTs 4 

108609 AA10O694 Hs.69499 Human DN A sequence from BAC 15E1 o 5 5 

108647 AA1 12396 Hs.44276 ESTs; Moderately similar to HOMEOBO 14.3 

108676 AA115562 Hs.274417 Homo sapiens mRNA; cDNA DKFZp564 5 2 

65 108687 AA120785 Hs.54347 ESTs 55 

108695 AA121315 Hs.70823 K1AA1077 protein 10.5 

108733 AA126422 zn84f1 ,s1 Stratagene lung carcinoma 9372 4.4 
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108774 AA128125 Hs7104C ESTs; Moderately similar to CELL GROW 4.6 

108828 AA131584 Hs 273344 DKFZP564O0463 protein 5.5 

108872 AA134063 Hs1 11680 ESTs 7.2 

108884 AA134958 Hs.293591 ESTs 11.3 

5 108893 AA135894 Hs 194691 retinoic acid induced 3 8.9 

109008 AA1 56360 Hs 87128 ESTs 14.7 

109010 AA156460 Hs.44229 dual specificity phosphatase 12 4.9 

109011 AA156542 Hs 72127 ESTs 4.6 
109042 AA159525 Hs 71779 Homo sapiens DNA from chromosome 19 7.2 

10 109086 AA166685 Hs.270737 tumor necrosis factor (ligand) superfamily 4 

109090 AA167006 Hs.70499 ESTs 5.9 

109101 AA167708 Hs. 52184 ESTs 4.2 

109112 AA169379 Hs.257924 ESTs 4 

109160 AA179387 Hs.301997 DKFZP434N1 26 protein 4 

15 109166 AA179845 Hs 73625 RAB6 interacting; kinesin-like (rabkinesin 13.6 

109178 AA181600 Hs.283707 ESTs 11.8 

109179 AA181902 Hs.192789 ESTs; Weakly similar to !!!! ALU SUBFA 5.4 
109261 AA1 95255 Hs.61779 ESTs 6.7 
109270 AA195515 Hs.3585 ESTs; Weakly similar to alternatively spii 4.9 

20 109277 AA196332 Hs.86043 ESTs 5.4 

109313 AA206800 Hs.86276 ESTs; Moderately similar to zinc finger p 5.5 

109415 AA227219 H&110826 trinudeotide repeat containing 9 20.1 

109454 AA232255 Hs.295232 ESTs 4.7 

109467 AA232904 Hs.63187 ESTs 6.8 

25 109481 AA233342 Hs.289069 ESTs, Weakly similar to WD40 protein C 10.6 

109508 AA233892 Hs.55902 ESTs, Weakly similar to .'(!' ALU SUBFA 8 

109514 AA234087 Hs.262346 ESTs; Weakly similar to ORF2: function 8.2 

109572 F02027 Hs.171937 ESTs 4 8 

109632 F04165 Hs.235873 ESTs; Weakly similar to K1 1C42 [C.eleg 5 2 

30 109644 F04477 Hs.291531 ESTs, Moderately similar to GLYCERAL 6 6 

109703 F09684 Hs.24792 ESTs; Weakly similar to ORF YOR283w 71 

109726 F10009 Hs.9196 ESTs 5 

109747 F10161 Hs.22969 ESTs 4 7 

109799 F10770 Hs.180378 Homo sapiens clone 669 unknown mRNA 45 

35 109814 F10979 Hs.153106 Homo sapiens clone 23728 mRNA sequen 8.7 

110189 H20543 Hs.6278 DKF2P586B 1621 protein 16.6 

110240 H25577 Hs. 176588 ESTs; Weakly similar to CYTOCHROME 6 2 

110280 H29285 Hs.32468 ESTs 4.5 

110520 H56965 Hs.4082 yi09f06.s1 Soa res fetai fiver spleen 1 NFL 5 7 

40 110561 H59617 Hs.5199 ESTs, Weakly similar to UBIQUITIN-CO 19.5 

110707 H95079 Hs.15617 ESTs; Weakly similar to !!J! ALU SUBFA 62 

110734 H98714 Hs.24131 ESTs 30.2 

110770 N22262 Hs.131705 ESTs 5.8 

110780 N23174 Hs.22891 sotute carrier family 7 (cationic amino aci 8 2 

45 110787 N24716 Hs.12244 ESTs; Weakly similar to C44B9.1 [C.eleg 6 7 

110794 N25262 Hs.27931 ESTs 5 9 

110799 N26101 Hs.323401 Human ring zinc-finger protein (ZNF127- 4 

110818 N29454 Hs.27552 ESTs, Weakly similar to putative pl50 [H 4 3 

110839 N30856 Hs.30246 solute carrier family 19 (thiamine transpo 12.8 

50 110844 N31952 Hs.167531 Homo sapiens mRNA full length insert cD 10.1 

110854 N32919 Hs.27931 ESTs 4 7 

110856 N33063 ESTs. Weakly similar to S 164 [H sapiens 4 2 

110860 N33438 Hs.170065 ESTs 12.5 

110897 N39148 Hs.688G DKFZP434D1 56 protein 4 

55 110915 N46252 Hs.29724 ESTs 222 

110935 N48787 Hs.305979 protease inhibitor 1 (anti-elastase); alpha- 4 8 

110970 N51374 Hs.96870 Homo sapiens mRNA full length insert cO 5 4 

111006 N53375 Hs.166146 Homer neuronal immediate early gene; 3 4 7 

111008 N53388 Hs.7222 ESTs 13 3 

60 111018 N54067 Hs.3628 mitoaen-activated protein kinase kinase ki 5 7 

111084 N59543 Hs.15456 PDZ do majn containing 1 83 

111100 N62522 Hs.20450 ESTs 14.3 

111125 N63823 Hs.269115 ESTs 79 

111132 N64378 Hs.83293 ESTs. Highly similar to unknown function 4.4 

65 111139 N64683 Hs.290943 ESTs 6 

111164 N66857 Hs.14808 ESTs, Weakly similar to I!!! ALU CLASS 41 

111172 N67102 Hs.21851 Homo sapiens mRNA; cDNA DKFZp586 5.5 
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Hs.171802 


ESTs; Weakly similar to hypothetical pro 


5.6 


111184 


N67437 


Hs.243901 


Homo sapiens mRNA; cDNA DKFZp564 


87 


111221 


N68869 


Hs.15119 


ESTs 


73 


111223 


N68921 


Hs.297939 


ESTs; Weakly similar to neogenin [H.sap 


9 


111229 


N69113 


Hs.110855 


ESTs 


89 


111241 


N69514 


Hs.288880 


ESTs; Weakly similar to CGI -82 protein I 


69 


111268 


N70481 


Hs.26118 


Homo sapiens done 24766 mRNA sequen 


45 


111295 


N73275 


Hs.21275 


ESTs; Weakly similar to ubiquitin-conjug 


56 


111299 


N73808 


Hs.24936 


ESTs 


8.5 


111336 


N79565 


Hs.29894 


ESTs 


67 


111357 


N91023 


Hs.87128 


ESTs 


15 


111370 


N92915 


Hs 94631 


brefeldin A-inhibited guanine nucteotide-e 


52 


111806 


R33468 


Hs.279008 


ESTs 


10 


111825 


R35885 


Hs.286146 


stromal antigen 1 


45 


111836 


R36228 


Hs.25119 


ESTs 


7.2 


111890 


R38678 


Hs 12365 


ESTs 


17.3 


111923 


R39995 


Hs 25925 


Homo sapiens clone 23860 mRNA sequen 


73 


111942 


R40576 


Hs.21590 


ESTs 


92 


111987 


R42036 


Hs 6763 


KIAA0942 protein 


10.6 


112101 


R44793 


Hs 296341 


adenylyl cyclase-associated protein 2 


53 


112134 


R46025 


Hs7413 


ESTs 


17.4 


112197 


R49482 


Hs.5637 


ESTs 


44 


112244 


R51309 


Hs 70823 


KIAA1077 protein 


11 


112253 


R51818 




Homo sapiens mRNA; cDNA DKFZp566 


93 


112305 


R54822 


Hs 26244 


ESTs 


4.4 


112449 


R63802 


Hs 124186 


ring finger protein 2 


63 


112483 


R66534 


Hs 285885 


ESTs 


49 


112519 


R68631 


Hs.11861 


ESTs 


14.3 


112610 


R79392 


Hs 23643 


ESTs 


52 


112693 


R88741 


Hs 91065 


ESTs; Moderately similar to proliferation 


46 


112751 


R93507 


Hs 8207 


ESTs 


56 


112801 


RQ74ftfi 
rwf MOO 


Hs 1S71R0 


nrntp'm kinasp* DNA-artivatpH* rafalvfir n 


87 


112869 


T03313 


Hs4747 


dyskeratosis congenita 1; dyskerin 


59 






nb. \c.COO 


ESTs 


58 


112908 


T10065 


Hs 3530 


TLS -associated serine-aipinine protein 


41 


1 12966 


T17119 




glucocorticoid receptor DNA binding fact 


5 7 


112971 


T17185 


Hs 83883 


ESTs 


6.4 


112995 


723528 


Hs.7155 


ESTs; Weakly simflar to TYKi protein [M 


9 1 


113047 


T25867 


Hs 7549 


ESTs 


54 


1 1 3075 


T34660 


Hs.6986 


ESTs; Weakly similar to !!!! ALU SUBFA 


5.7 


113117 


T47819 


Hs 159153 


ESTs 


5.8 


113206 


T58044 


Hs 241471 


ESTs; Moderately similar to HI! ALU SU 


64 


113248 


T63857 




yc16e1.s1 Stratagene lung (#93721) Homo 


46 




T64896 


Hs ?R7d?0 
no t-\>i ttu 


ESTs 


69 


1 11777 


TCC707 


Hs 11774 


nrntp'm /npntirlvl-nmlvt pLs/trarLS Lsnmpras 


56 


1 1 3278 


T65802 


Hs 11135 


yc11h10.s1 Stratagene lung (#937210) Ho 


4 5 




TRR191 
1 OO \L \ 


Hs 1Q144S 


ESTs 


6 4 


113523 


T90037 


Hs 95549 


ESTs 


64 


113604 


T92735 


Hs 296083 


ESTs 


8.7 


113702 


T97307 




ESTs; Moderately similar to !!!! ALU SU 


95 


113783 


W19222 


Hs.7041 


ESTs; Weakly similar to !i!l ALU SUBFA 


52 


113794 


W37382 


Hs.11090 


ESTs 


11,9 


113808 


W44735 


Hs.9286 


ESTs 


16.7 


113811 


W44928 


Hs.6994 


ESTs 


4 


113822 


W47350 


Hs.17466 


retinoic acid receptor responder (tazaroten 


48 


113823 


W47388 


Hs 55099 


rab6 GTPase activating protein (GAP and 


4 


113836 


W56792 


Hs 12040 


ESTs; Weakly similar to KIAA0881 prote 


41 


113857 


W65477 


Hs5297 


Homo sapiens mRNA; cDNA DKFZp564 


43 


113886 


W72471 


Hs 23920 


ESTs 


46 


113895 


W73738 


Hs 12921 


ESTs 


7 1 


113923 


W80763 


Hs 3849 


ESTs; Weakly simiar to FK506-bindlng p 


68 


113931 


VV81205 


Hs3496 


ESTs 


61 


113950 


W85765 


Hs 30504 


Homo sapiens mRNA; cDNA DKFZp434 


14 


113970 


W86748 


Hs.8109 


ESTs 


15 


114051 


W94942 


Hs 177534 


dual specificity phosphatase 10 


54 
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114057 


W96222 


Hs.34192 


ESTs 


4.8 


114086 


238266 


Hs.288649 


Homo sapiens PAC clone DJ0777O23 fro 


5.1 


114098 


238347 


Hs. 11 8338 


ESTs; Weakly similar to similar to S. cere 


6.2 


114109 


238435 


Hs.184108 


ribosomal protein L21 


4.6 


114124 


238595 


Hs. 12501 9 


ESTs; Highly similar to KIAA0886 prote 


22 


114138 


238763 


Hs. 15740 


amyloid beta (A4) precursor protein- bindi 


8.8 


114149 


238814 


Hs.27196 


ESTs 


4 


114162 


238909 


Hs.22265 


ESTs 


7.2 


114177 


239062 


Hs. 23740 


ESTs 


5.3 


114196 


239211 


Hs.150926 


fucose-1- phosphate guanytyltrarsferase 


4.4 


114208 


239301 


Hs 7859 


ESTs 


5.1 


114250 


239897 


Hs. 13297 


ESTs 


7.2 


114251 


239898 


Hs 21948 


ESTs 


14.7 


114292 


240715 


Hs.184641 


delta-6 fatty acid desaturase 


19.4 


114297 


240758 


Hs.173091 


DKF2P434K151 proten 


8.9 


114334 


241342 


H s .22941 


ESTs 


13.7 


114460 


AA024604 


Hs 961H2 


ESTs 


10.1 


114471 


AA028074 


He 104613 

fi;>, iinu s \j 


ESTs 


5,7 


114480 


AA032243 


Hs 151678 


U DP-N -acetyi-alpria-O-galactosamine; po 


7,3 


114518 


AA046407 


He 106469 


suppressor of varl (S.cerevisiae) 3-iike 1 


4.3 


114542 


AA055768 


Hs 293380 


ESTs 


11.7 


114549 AA056484 


He 292833 


ESTs 


7 3 


114652 


AA101416 


He 107149 

(13. IV/ I^s7 


ESTs; Weakly similar to PTB-ASSOCIAT 


6 1 


114673 


AA1 13303 


He <WiR3 


transmembrane 4 superfamily member (te 


4 3 


114698 


AA1 26951 


He 1 10AS7 
no. i ivjqj/ 


FST e' Hinhlv eimilar to nutativp DNIA-dir 


7 1 


114767 


AA148885 


He I^M.^ 
no. i *?*t*h+o 


mininhrnmrLenmp maintpnanfp ripficapnt 

11 HI \ \\A II Ul IIUOUII IC II la HI LCI Kkl UCI'WCI 11 \*J 


5 3 


114799 


AA1 59323 


He K1QQ9Q 


ESTs 


4 2 


114804 


AA160363 


He 2KQQS6 


ESTs 


48 


114811 


AA161161 


He Q6P07 


mnltinip inneitnl nnlvnhosohatp nhnsohata 


7.1 


114821 


AA165313 


He 55468 


ESTs 


4 4 


114852 


AA235035 


He Wfifi 


ESTs; Moderately similar to ubiquitin spe 


5 


114901 


AA236276 


He 1064^7 
i io. i *t of 


ESTe' Wpaklv eimilar to R26660 V Darti 


16.9 


114902 


AA236359 


He 39504 


ESTs 


5 1 


114940 


AA243012 


Hq 75Q28 


ESTs 


8 5 


114965 


AA250737 


Hs.72472 


ESTs 


35.1 


115047 


AA252627 


He 


IIUIIICV LAM D*J 


57 


115054 


AA252863 


He R772Q 
no. of f 


ESTs 


62 


115061 


AA253217 


Hs.41271 


ESTs 


13 


115082 


AA255557 


U_ 1QQOCQ 

ns. 130/03 


H1AI""\W HohuHr/vion!icD (nh'int iinnno\ 1 aln 

iwun uenyuroyciidSc ^uuiquinuiicj i dip 


25.2 


115116 AA256486 


He R207*i 


FQTc 
Co 1 5 


8 8 


115140 


AA258030 


ns.z/yyoo 


Co I o, VvcdlUy bllMMdF IU oUpjJUllcU Uj V3C 


4 1 


115205 


AA262470 


He 9A491R 




8 3 


115206 


AA262491 


He 1ftfi<\79 


FQTe 

CO 1 o 


5 1 


115239 


AA278650 


Wc- 711Q1 


CCTc- \hltmlf\\i elmilor similar tr\ th^i kv^t 
Lo 1 S, WcdRiy SllTlHdr 10 Similar 10 U1C Ucl 


4 6 


115242 


AA278755 


He OMT'l? 


FQTe 
CO 1 o 


8 3 


115249 


AA278961 


We 71 194 


FQTe 
CO 1 5 


10.1 


115259 


AA279071 


Wc 1^51 




9 5 


115285 


AA279799 


Wc WV7^fi 

ns. ijo/oo 


CO 1 b 


5 8 


115291 


AA279943 


He 199T7Q 


FQTc 

Co 1 5 


5 1 


115357 


AA281793 


Hi 72988 

no. r tjuu 


ESTs 


5 


115377 


AA282247 


Hs.193063 


ESTs 


61 


115400 AA283198 


Hs.89113 


ESTs 


4.9 


115439 


AA284561 


Hs.193090 


ESTs 


5.8 


115471 


AA287138 


Hs.59346 


ESTs; Weakly simiter to ASPARTYL-TR 


11.7 


115506 AA292537 


Hs.45207 


Human DNA sequence from clone 620E1 


6.8 


115522 


AA331393 


Hs.47378 


ESTs 


58 


115572 


AA398392 


Hs.59594 


ESTs; Weakly sim3ar to F33612.3 gene p 


97 


115587 


AA399264 


Hs.283037 


ESTs; Highly similar to HSPC039 protein 


87 


115600 


AA400247 


Hs.42173 


ESTs 


4 


115612 


AA400948 


Hs.71243 


ESTs; Weakly similar to zinc finger prote 


6.4 


115646 


AA404352 


Hs.305971 


ESTs 


5 3 


115652 


AA405098 


Hs.38178 


ESTs 


16.1 


115657 


AA405620 


Hs.55158 


ESTs; Weakly similar to weak similarity t 


4 7 


115658 


AA405625 


Hs. 183056 


Human DNA sequence from ctone 34821 


5.1 


115675 


AA406546 


Hs.82065 


Homo sapiens mRNA; cONA DKF2p564 


20.5 


115721 


AA417102 


Hs.90960 


ESTs 


4.8 


115763 


AA421560 




ESTs 


7 
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115764 AA421562 Hs.91011 anterior gradient 2 (Xenepus laevis) homo 416 

115835 AA428576 Hs.41371 ESTs 4.2 

115844 AA430124 Hs.7773 ESTs 119 

115875 AA433943 Hs.43946 ESTs; Weakly similar to Weak similarity 33 5 

5 115888 AA435839 Hs.76591 KIAA0887 protein 7 2 

115922 AA441911 Hs.71869 ESTs; Weakly similar to K1AA0926 prote 5.1 

115941 AA443602 Hs.46679 ESTs 4.8 

115947 AA443793 Hs.94761 ESTs 8.3 

115948 AA443798 Hs.43445 poly(A)-spedf]crfbonuclease (deaden ylat 13 5 
10 115951 AA443918 Hs.301048 cofilin 1 (non-muscle) 7.5 

115967 AA446887 Hs.42911 ESTs 8 8 

115984 AA447687 Hs.91109 ESTs 131 

116009 AA449448 Hs 44238 ESTs 5 5 

116024 AA451748 Hs.83883 Human DN A sequence from clone 718J7 7 5 

15 116028 AA452112 Hs 42644 thioredoxin-iike 12.7 

116050 AA453656 Hs 88417 ESTs 7.2 

116097 AA456099 Hs.176376 ESTs 11.8 

116108 AA457566 Hs.28777 ESTs 4.5 

116121 AA459254 Hs 48855 ESTs 4.5 

20 116127 AA459703 Hs.279884 v-myc avian myeiocytomatosts viral onco 4 3 

116129 AA459956 Hs.49163 ESTs; Highly similar to putative ribonude 7 6 

116142 AA460649 Hs 39457 ESTs 4 8 

116204 AA465701 Hs. 108646 ESTs 6 8 

116221 AA478397 Hs 50180 ESTs 4 9 

25 116222 AA478415 Hs.89986 ESTs 4 

116238 AA479362 Hs 47144 DKFZP 586 N08 19 protein 4 6 

116246 AA479961 Hs 250646 ESTs; Highly similar to ubiquitin-conjuga 4 

116249 AA480886 Hs 86693 ESTs 18 5 

116250 AA480975 Hs 44829 ESTs 10 8 
30 116254 AA481146 Hs41086 ESTs; Weakly similar to OXYSTEROL-B 91 

116256 AA481256 Hs88201 ESTs; Weakly similar to tysophospholipa 8 4 

116264 AA482594 Hs 272239 Homo sapiens mRN A; cDN A DKFZp586 7 2 

116265 AA482595 Hs 55189 ESTs; Weakly similar to F25B5.3 [C.eteg 11.1 
116282 AA486550 Hs 204501 ESTs; Weakly similar to Wiskott-Aldrich 6 2 

35 116298 AA489046 Hs 94109 ESTs 4 9 

116300 AA489194 Hs 159471 ESTs; Weak ry similar to snRNP protein B 4 6 

116327 AA490959 Hs 28005 Homo sapiens mRN A; cDN AD KF2p5 64 5 8 

116334 AA491457 Hs 48948 ESTs 4 3 

116337 AA496127 Hs 44070 ESTs 8 4 

40 116351 AA504116 Hs82501 Homo sapiens mRNA; cDN A DKFZp4 34 53 

116357 M504806 Hs 90797 Homo sapiens ckxie 23620 mRNA sequen 5 2 

116415 M609204 Hs 27973 K I AA0874 protein 6 6 

116443 AA620313 Hs 190488 ESTs; Weakly similar to KERATIN; TYP 45 

116470 C13992 Hs 83484 ESTs 4 5 

45 116480 C14088 glyceraWehyde-3-phosphate dehydrogena 5 6 

116578 D51272 Hs75337 nucleolar phosphoprotein p 130 4 1 

116579 D51276 Hs 81915 leukemia -associated phosphoprotein p1 8 5 8 
116626 F02028 Hs 81907 ESTs 4 9 
116647 F03069 Hs 15395 ESTs; Weakly similar to ARG I NYL-TRN 61 

50 116674 F04816 Hs92127 ESTs 10,6 

116680 F08813 Hs 273829 LINE retrotransposable element 1 4 2 

116700 F09983 Hs 317589 ESTs 13 

116724 F13665 Hs 65641 ESTs 8 5 

116726 F13681 Hs 53913 ESTs 5 6 

55 116732 F13779 Hs165909 ESTs 116 

116734 F13789 Hs 93796 DKF2P586D2223 protein 5 4 

116760 H11054 Hs 155342 protein kinase C; delta 4 3 

116780 H22566 Hs 30098 ESTs 5 7 

116786 H25836 Hs 301527 tumor necrosis factor (ligand) superfamily 8 8 

60 116787 H28581 Hs 15641 ESTs 8 6 

116790 H29532 Hs 101174 miaotubule^ssociated protein tau 22,2 

116803 H47357 Hs 109701 ESTs; Moderately similar to weak similar 6.7 

116877 H68116 Hs 168732 ESTs 6 5 

116921 H72948 Hs821 bigtycan 20.7 

65 117216 N20083 Hs 42792 ESTs 4 4 

117232 N20579 Hs.61153 ESTs 7 4 

117284 N22162 Hs 183779 ESTs; Weakly similar to cDN A EST yk33 4.1 
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117344 N24046 Hs.2 10706 ESTs 

117367 N24954 Hs.42502 ESTs 

117392 IM26175 Hs.93405 ESTs 

117394 N26257 Hs.39871 KIAA0727 protein 

5 117412 N26722 Hs.42645 ESTs 

117498 N31726 Hs.44268 ESTs; Highly similar to myelin gene expr 

117557 N33920 Hs.44532 diubiquitin 

117534 N36421 Hs.13323 ESTs; Weakly similar to SODIUM- AND 

117639 N36923 Hs.44833 ESTs 

10 117754 N47469 Hs.59757 ESTs 

117852 N49408 Hs.136102 K1AA0853 protein 

117879 N50050 Hs.303025 ESTs; Weakly similar to keratin; 67K typ 

117924 N51056 Hs.38891 ESTs 

117950 N51394 Hs.75478 KJAA0956 protein 

15 117992 N5200Q Hs.172089 Homo sapiens mRN A; cDN A DKFZp586 

118138 N57773 Hs.93560 ESTs; Weakly similar to Irg [R. no rvegicu 

118215 N62195 Hs.77910 3iiydroxy-3-methylglutaryl-Coenzyrne A 

118229 N62339 Hs.166254 heat shock 90kD protein 1; alpha 

118265 N62827 Hs.48645 EST 

20 118336 N63604 Hs.47166 ESTs 

118363 N64168 Hs.48938 ESTs 

118429 N66158 Hs.74649 ESTs 

118470 N66769 Hs.291033 ESTs 

118472 N66818 Hs.42179 ESTs 

25 118475 N66845 ESTs; Weakly similar to 1!!! ALU CLASS 

118493 N67149 Hs.50115 ESTs 

118528 N67889 Hs.49397 ESTs 

118542 N68010 Hs.49427 ESTs 

118600 N69222 ESTs 

30 118695 N71781 Hs.50081 Homo sapiens mRNA full length insert cD 

118698 N72113 Hs.50187 ESTs 

118901 N90719 Hs.94445 ESTs 

118952 N92966 ESTs; Highly similar to CGl-90 protein [H 

118976 N93629 Hs.93391 ESTs 

35 118986 N94362 Hs.125830 ESTs 

118989 N94439 Hs.45105 ESTs 

119027 N99256 Hs.114611 ESTs 

119042 R05316 Hs.5472 ESTs 

119075 R36451 Hs.287820 fibronectin 1 

40 119260 T15916 Hs.102950 ESTs; Highly similar to coat protein gamm 

119271 T16387 Hs.55328 ESTs 

119298 T23820 Hs,155478 cyclin T2 

119302 T25725 ESTs 

119341 T62571 Hs.146388 microtubulenassociated protein 7 

45 119495 W35390 Hs.55533 ESTs 

119580 W42451 Hs.92260 high-mobility group protein 2-I ike 1 

119602 W46286 Hs.233694 ESTs; Weakly similar to ZK 1058.5 [C.ele 

119620 W47620 Hs.56009 2'-5'oligoadenylate synthetase 3 

119676 W60473 Hs.57787 ESTs 

50 119717 W69134 Hs.57987 ESTs 

119729 W69747 Hs.94806 KIAA1 062 protein 

119805 W73788 Hs.43213 ESTs 

119859 W80702 Hs.58461 ESTs 

119867 W80852 Hs,250696 KDEL (Lys-Asp-Glu-Leu) endoplasmic re 

55 119873 W81129 Hs.44865 Homo sapiens mRNA; cDNA DKFZp586 

119899 W84767 Hs.58698 ESTs 

119940 W86779 Hs.272531 DKFZP586B0319 protein 

119943 W86835 Hs,14158 copine ill 

119970 W87812 Hs.93581 Homo sapiens mRNA; cDNA DKFZp586 

60 120131 Z38656 Hs.75887 coatorner protein complex; subunit alpha 

120150 Z39549 Hs.1 53746 ESTs 
120206 Z40805 Hs.91668 ESTs 
120241 Z41815 Hs.65946 ESTs 

120255 AA169752 Hs 5672 ESTs; Weakly similar to Similarity to Yea 
65 120314 AA194166 Hs 221040 KIAA1038 protein 
120325 AA195651 Hs 104106 ESTs 
120352 AA211400 Hs,193172 ESTs 
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120428 AA236822 Hs.173694 KIAA1 097 protein 5.6 

120524 AA261852 Hs,192905 ESTs 5.6 

120528 AA262107 Hs.104413 ESTs 4.5 

120571 AA280738 Hs.34892 ESTs 4.9 

5 120649 AA287115 Hs.192843 ESTs 4.5 

120655 AA287347 Hs.238205 ESTs 6.7 

120668 AA287833 Hs.292913 ESTs 8.3 

120712 AA292654 Hs.102506 eukaryotic translation initiation factor 2 al 4.6 

120713 AA292655 Hs.96557 ESTs 10,6 
10 120724 AA293470 Hs.1 00747 ESTs 5.4 

120873 AA358015 EST 7.1 

120885 AA365515 Hs.301872 ESTs; Moderately similar to Iff! ALU SU 4.6 

120919 AA381125 Hs.301444 ESTs 8.2 

120948 AA397822 Hs.104650 ESTs; Highly similar to similar to mag on 8.6 

15 120969 AA398116 Hs.129206 casein kinase 1; gamma 3 10.5 

120977 AA398155 Hs.97600 ESTs 10.9 

121103 AA398936 Hs.97697 EST 7.4 

121291 AA401753 Hs.8186 lung cancer candidate 5.3 

121320 AA403OO8 Hs.301927 T-cell receptor, alpha (V;D;J;C) 13.5 

20 121463 AA411745 Hs.239681 ESTs; Weakly similar to KIAA0554 prote 8.9 

121596 AA416740 Hs 174104 ESTs 22.6 

121723 AA419622 Hs.104800 ESTs; Weakly similar to Mouse 19.5 mRN 8 

121748 AA421171 Hs.234545 ESTs 5.6 

122125 AA434411 Hs.98806 ESTs 5.3 

25 122522 AA449444 Hs.98969 ESTs 4 

122655 AA454756 Hs.97837 ESTs 4 

122704 AA456326 Hs.99445 ESTs 6.2 

122782 AA459894 Hs 99472 ESTs 5.3 

122856 AA463740 Hs.75367 Src-Jike-adapter 13.1 

30 122882 AA465381 Hs.108812 ESTs; Weakly similar to B0041.5 [C.eleg 5.5 

122928 AA476578 Hs. 101 840 ESTs 6,3 

122974 AA478625 Hs.194215 ESTs 6 

122997 AA479295 Hs.106290 Kelch motif containing protein 12.5 

123016 AA480103 Hs.323231 ESTs; Weakly similar to alternatively spli 4.4 

35 123107 AA486071 Hs. 104207 ESTs 8.3 

123111 AA486273 Hs 191721 ESTs 4.2 

123114 AA486407 Hs.129928 ESTs; Moderately simitar to KIAA0454 p 5.2 

123136 AA487449 Hs.194024 ESTs 4.2 

123137 AA487468 Hs.100686 ESTs; Weakly similar to secreted cement 14.6 
40 123169 AA488892 ESTs; Weakly similar to Gag-Pol porypro 4.5 

123176 AA489020 Hs.69233 ESTs 5.2 

123338 AA504249 Hs 187585 ESTs 4 

123436 AA598714 Hs.223014 protease; serine; 15 7.3 

123442 AA598803 Hs.1 11496 ESTs 5.9 

45 123449 AA598899 Hs.1 12493 Homo sapiens mRNA; cDN A DW : Zp564 4.1 

123494 AA599786 Hs 112110 ESTs 4 

123503 AA600121 Hs.293156 ESTs 12.8 

123533 AA608751 ESTs; Weakly similar to !!!! ALU SUBFA 7.9 

123619 AA609200 ESTs 23,1 

50 123673 AA609471 Hs.1 58549 ESTs 6 6 

123729 AA609778 Hs 278672 membrane component; chromosome 11; s 4.7 

123819 AA620636 Hs 112264 ESTs 4 

123960 AA621785 Hs 287733 methylmaJonate-semiafderryde dehydroge 7 6 

124000 D573 17 Hs 74861 activated RN A polymerase II transcription 4.4 

55 124006 D60302 Hs.270016 ESTs 20.6 

124012 D80240 Hs 241471 HUM5G1 1A Human fetal brain (TFujiwa 6.7 

124021 F02859 Hs 13974 ESTs 47 

124049 F10523 Hs 74519 primase; polypeptide 2A (58kD) 47 

124059 F13673 Hs 283713 ESTs 7 7 

60 124243 H66710 Hs.1 33525 ESTs 5.5 

124308 H93575 Hs 241507 Homo sapiens mRNA; cDNA DKFZp564 11.4 

124314 H94877 Hs 215766 GTP-binding protein 13.7 

124315 H94892 Hs.288757 v-ral simian leu kemia viral oncogene horn 14 
124350 N21359 Hs 101282 Homo sapiens mRNA; cDNA DKFZp434 8.6 

65 124352 N21626 Hs 102406 ESTs 7 2 

124357 N22401 yw37g07.s1 Morton Fetal Cochlea Homo 5.2 

124390 N29325 Hs7535 ESTs; Highly similar to COBW-Cke place 7 9 
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124438 N40188 Hs.11090 ESTs 

124447 N48000 Homo sapiens mRNA; cDN A DKFZp586 

124457 N50114 Hs.266175 ESTs 

124539 N63172 Hs.146409 cell division cycle 42 (GTP- binding prote 

5 124626 N74604 Hs.11090 ESTs 

124632 N79515 Hs.306117 interteukin 13 receptor; alpha 1 

124644 N91279 Hs.109654 ESTs; Moderately similar to outer membr 

124676 R01037 Hs.181013 phosphogty cerate mutase 1 (brain) 

124677 R01073 ESTs; Weakty similar to !!!! ALU CLASS 
10 124724 R12405 Hs.112423 Homo sapiens mRNA; cDNA DKFZp586 

124773 R40923 Hs. 106604 ESTs 

124777 R41933 ESTs 

124792 R44357 Hs.48712 ESTs; Weakly similar to cDNA EST EMB 

124857 R63652 Hs. 137190 ESTs 

15 124911 R88992 Hs.180612 ESTs 

124955 T10598 Hs.324841 ESTs; Weakly similar to !!!! ALU SUBFA 

124958 T11134 Hs.431 murine leukemia viral (bmM) oncogene h 

125038 T78089 Hs 270134 ESTs 

125092 T92544 Hs,137546 CD84 antigen (leukocyte antigen) 

20 125132 W15495 Hs.129781 chromosome 21 open reading frame 5 

125144 W37999 Hs 24336 ESTs 

125154 W38419 ESTs 

125243 W86423 Hs 105413 ESTs 

125279 W93640 Hs 4779 ESTs; Moderately similar to similar to AD 

25 125299 Z39436 Hs 102720 ESTs 

125303 Z39821 Hs.288193 ESTs 

125304 Z39833 Hs, 124940 GTP-binding protein 

125474 AA151216 Hs.75103 tyrosine 3-monooxygenase/tryptophan 5-m 

125509 AA044232 Hs, 288967 ESTs 

30 125580 AA126504 Hs 267812 sorting nex in 4 

125582 AA507383 Hs.74649 cytochrome c oxidase subun'rt Vic 

125670 AI432621 Hs.82685 CD47 antigen (Rh-related antigen; integri 

125698 AA748483 Hs.191356 general transcription factor IIH; polypepti 

125745 AI283493 Hs75722 ribophorin II 

35 125852 H09290 Hs76550 Homo sapiens mRNA; cDNA DKFZp564 

125972 AA434562 Hs.35406 ESTs 

126160 N90960 Hs.265398 ESTs; Weakly similar to transformations 

126257 N99638 Hs. 124084 tumor necrosis factor receptor superfamify 

126337 AI066486 Hs.40500 similar to S. cerevisiae RER1 

40 126405 U46278 Hs.122489 ESTs 

126537 W40262 Hs.146310 ESTs; Weakly similar to putative p150 [H 

126590 W78968 Hs.181307 H 3 histone; family 3A 

126712 AA205862 Hs.7942 ESTs 

126721 T72569 Hs.125359 Thy-1 cell surface antigen 

45 126764 AI334393 Hs.102178 ESTs 
126804 AI203334 Hs.160628 ESTs 
126819 AA305536 Hs.279607 ESTs 
126877 AI052047 Hs.26102 ESTs 
126991 R31652 Hs.821 biglycan 

50 127479 AA513722 Hs.179729 collagen; type X; alpha 1 (Schmid metaph 
127514 AA826926 Hs.204214 ESTs 

127663 W07286 Hs.10340 ESTs; Weakly similar to weak similarity t 
127677 AA916752 Hsi64190 ESTs; Highly similar to MEM3 [M.muscu 
127814 AA761755 Hs.1 36713 ESTs; Weakly similar to V4-1 [H.sapiens 
55 127997 A1281549 Hs.311054 ESTs 
128092 AA904617 Hs.166229 ESTs 

128218 H02682 Hs.292154 ESTs; Moderately similar to recombinatio 
128466 D59653 Hs.241471 EST 
128482 U 83908 Hs.296251 programmed cell death 4 
60 128517 AA280617 Hs.100861 ESTs; Weakly similar to p60 katanin [H.S 
128530 AA504343 Hs. 183475 Homo sapiens clone 25061 mRNA sequen 
128559 AA226801 Hs.101448 metastasis associated 1 
128574 AA412048 Hs.38260 keratin 8 

128595 U31875 Hs.152677 snort-chain alcohol dehydrogenase family 
65 128610 L38608 Hs. 10247 activated leucocyte cell adhesion molecule 
128629 AA399187 Hs.102708 DKFZP434A04 3 protein 
128649 AA142853 Hs.103106 Homo sapiens mRNA for G7b protein (G 
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128651 AA446990 Hs.103135 ESTs 6.1 

128653 R48943 Hs 10315 solute carrier family 7 (cationic amino aci 4.4 

128656 AA458542 Hs.10326 coatomer protein complex; su bun it epsi Ion 14.3 

128717 T30617 Hs. 104222 Homo sapiens mRNA. cDNA DKFZp566 24.5 

5 128727 M64174 Hs 50651 Janus kinase 1 (a protein tyrosine kinase) 7 3 

128764 N49308 Hs.104938 ESTs; Weakly similar to alpha 1 (XVI II) c 9 2 

128781 X85372 Hs 105465 small nuclear ribonucleoprotein poly pepti 5 4 

128793 W93562 Hs. 105749 KIAA0553 protein 4 6 

128835 W15528 Hs.106390 Homo sapiens mRNA. cDNA DKF2p586 4 

10 128845 AA455658 Hs 10649 basement membrane- induced gene 69 

128871 AA400271 Hs. 106778 Homo sapiens mRNA for putative Ca2+-t 4 5 

128922 AA252023 Hs9589 ESTs; Weakly similar to HRIHFB21 57 [H 64 

128925 D61676 Hs 21851 Homo sapiens mRNA, cDN A DKF2p586 64 

128938 AA410325 Hs.107260 ESTs 7 

15 128946 N29353 Hs. 107318 kynurenine 3-monooxygenase (kynurenin 5 2 

128948 AA485655 Hs 223025 proteasome (prosome; macropain) subunit 13.1 

128955 F10290 Hs.185807 Homo sapiens clone 24758 mRNA sequen 5 8 

129005 AA460049 Hs 13323 ESTs; Weakly similar to SODIUM- AND 12.6 

129009 AA131421 Hs 75607 ESTs 98 

20 129017 H13108 Hs.107968 ESTs 13.9 

129057 X62466 Hs 276770 CDW52 antigen (CAMPATH-1 antigen) 10.7 

129075 AA129465 Hs.83765 ESTs 4 7 

129095 L12350 Hs 108623 thrombospondin 2 44 

129124 AA234530 Hs.108802 N -ethy I maleimide-sensi five factor 20,7 

25 129160 AA131252 Hs. 109007 ESTs 5 9 

129164 AA282183 Hs.109045 ESTs 5 8 

129180 R40556 Hs. 318401 ESTs; Highly similar to HSPC039 protein 76 

129224 X89109 Hs.109606 coronin; actin-binding protein; 1A 12 

129229 AA2 11 941 Hs 109643 polyadenylate binding protein-interacting 7 9 

30 129240 W24360 Hs.237868 interteukin 7 receptor 5 3 

129241 AA435665 Hs 109706 ESTs; Moderately similar to HN1 [M.m us 84 

129243 H88033 Hs 109727 K I AA07 33 protein 7 8 

129247 AA151574 Hs 109733 pilin-like transcription factor 64 

129259 AA090695 Hs 181385 ESTs 62 

35 129270 Z35227 Hs 109918 ras homotog gene family; member H 5 4 

129281 AA026318 Hs 289101 glucose regulated protein; 58kD 4 4 

129300 C20976 Hs 110165 ESTs; Highly similar to ribosomal protein 57 

129318 N93155 Hs 285976 calmodulin 1 (phosphorylase kinase; delta 7 7 

129319 AA037467 Hs30340 ESTs 6 
40 129351 AA167268 Hs 62349 Human ras inhibitor mRNA; 3' end 9 3 

129366 H18027 Hs 184697 plexind 18.2 

129383 W92984 Hs 288224 ESTs 59 

129388 AA151621 Hs 110964 ESTs 41 

129391 T80814 Hs 11101 discs; large (Drosophila) homolog 3 (neur 10.9 

45 129404 AA172056 Hs.317584 ESTs 5 3 

129406 N23707 Hs 111138 KIAA07 12 gene product 4 

129426 AA412087 Hs 111323 EST; Highly similar to protein inhibitor o 8 

129453 AA421213 Hs 111632 Lsm3 protein 5 5 

129513 C00225 Hs306163 ESTs; Weakly similar to fos39554_1 [H.s 55 

50 129519 AA298786 Hs 112242 ESTs 6 8 

129606 R21443 Hs 166254 heat shock 90kD protein 1; alpha 5 

129622 AA278243 Hs 323949 ESTs 6 8 

129626 AA447410 Hs 111334 ESTs; Weakly similar to !!!l ALU SUBFA 51 

129627 AA258308 Hs 71968 Homo sapiens mRNA, cDNA DKFZp564 5 3 
55 129528 U26727 Hs 1174 cyclin dependent kinase inhibitor 2A (mel 8 2 

129642 R50008 Hs.11B06 7-dehydrocholesterol reductase 4 3 

129663 AA442768 Hs 11866 translocase of inner mitochondrial membr 4 4 

129665 M88458 Hs 118778 KDEL (Lys-Asp-Glu-Leu) endoplasmic re 4 

129691 X06700 Hs 119571 collagen; type III; alpha 1 (Ehlers-Danlos 6 

60 129783 AA45461 8 Hs 12479 associated molecule with the SH3 domain 6 4 

129800 AA252436 Hs 12540 tysophospholipase I 7 7 

129836 AA452161 Hs 206521 YME1 (S.cerevisiae)-like 1 5 

129850 N20593 Hs 288932 GDP dissociation inhibitor 2 6 9 

129869 AA102520 Hs 13015 ESTs; Weakly similar to heat shock prote 5 

65 129896 AA043021 Hs 13225 UDP-Gal:betaGlcNAcbeta 1;4- galactosy 66 

129982 M87789 immunoglobulin gamma 3 (Gm marker) 4 

129985 AA450045 Hs 140452 cargo selection protein (mannose 6 phosp 5.8 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



130029 AA236412 
130033 M90696 
130036 Ml 95260 
130069 AA056896 
130077 T24055 
130080 X14850 
130096 AA223874 
130114 AA234717 
130125 M36803 
130135 M61764 
130170 AA610070 
130189 D43947 
130208 AA620556 
130211 D50840 
130235 X14046 
130276 S75295 
130280 L13738 

130313 AA620323 

130314 D86967 
130328 AA135673 
130356 X84373 
130367 238501 
130378 T47333 
130384 X66364 
130393 D13630 
130399 M449417 
130407 N29888 
130414 M21121 
130417 U58522 
130421 D21260 
130441 U35835 
130455 X17059 

130498 L38951 

130499 AA416723 
130511 L32137 
130553 AA430032 
130558 H96654 
130568 AA232535 
130583 W24957 
130585 H66211 
130604 X03635 
130614 AA132007 
130619 AA477739 
130622 AA235247 
130625 F03969 
130627 L23808 
130629 M60346 
130635 M87503 
130639 D59711 
130677 H17861 
130681 D82808 
130693 AA487202 
130703 N63295 
130706 AA488843 
130712 AA292066 

130714 X92896 

130715 T98227 
130744 AA203527 
130747 AA471293 
130751 AA435633 
130796 R39390 
130800 AA223386 
130855 AA425439 
130859 AA287327 
130866 M58028 
130880 D14678 
130891 D31891 



Hs.236510 
Hs.181301 
Hs.1 25849 
Hs.146428 
Hs.91379 
Hs. 147097 
Hs.197955 
Hs.,14992 
Hs.,1504 
Hs.21635 
Hs.151469 
Hs.151761 
Hs.15250 
Hs.23703 
Hs.153053 
Hs.169149 
Hs. 153937 
Hs.154320 
Hs.154332 
Hs.154668 
Hs.155017 
Hs.8768 
Hs. 1551 88 
Hs,166071 
Hs.155291 
Hs. 155356 
Hs.155410 
Hs.241392 
Hs.155485 
Hs.178710 
Hs. 155637 
Hs.155956 
Hs.180446 
Hs,158286 
Hs.1584 
Hs.252587 
Hs.15984 
Hs. 16085 
Hs.293907 
Hs.16331 
Hs1657 
Hs 16697 
Hs 12532 
Hs 16846 
Hs.260720 
Hs.1695 
Hs 1697 
Hs.1706 
Hs 17132 
Hs 17767 
Hs.17820 
Hs 17962 
Hs 18103 
Hs 201673 
Hs 279762 
Hs 18212 
Hs. 171952 
Hs 18747 
Hs.6879 
Hs 18879 
Hs.19525 
Hs 19574 
Hs 143323 
Hs.20478 
Hs.2055 
Hs.20830 
Hs.20991 



ESTs; Moderately similar to PFT27 [M.m 
cathepstn S 

ESTs; Moderately similar to l!!t ALU SU 

collagen; type V; alpha 1 

ribosomal protein L26 

H2A histone family, member X 

KIAA0704 protein 

ESTs 

hemopexin 
tubulin; gamma 1 

calcium/caimodutirMtependent serine pro 
KIAA0100 gene product 
peroxisomal D3;D2-enoyl-CoA teomerase 
UDP-giucose ceramkie glucosyltransferas 
CD37 antigen 

karyopherin alpha 1 (importin alpha 5) 

activated p21cdc42Hs kinase 

ubiquitin-activating enzyme E1C (homolo 

KIAA0212gene product 

KIAA0391 gene product 

nuclear receptor interacting protein 1 

ESTs; Weakly similar to !l!f ALU SUBFA 

TATA box binding protein (TBP)-associa 

cyclin-dependent kinase 5 

KIAA0005 gene product 

Homo sapiens mRNA for putative glucosy 

ESTs 

small inducible cytokine A5 (RANTES) 
hunHngtin-interacting protein 2 
dathrin; heavy pdypeptide-like 2 
protein kinase; DNA-activated; catalytic p 
N-acetyltransf erase 1 (arylamine N^cetyl 
karyopherin [importin) beta 1 
Homo sapiens mRNA for KIAA0446 prot 
cartilage oligomeric matrix protein (pseud 
pituitary tumor-transformtng 1 
ESTs; Weakfy similar to gene pp21 protei 
ESTs; Highly similar to C6I-1 3 protein [H 
ESTs; Moderately similar to similar to C.e 
ESTs 

estrogen receptor 1 

ESTs 

ESTs 

ESTs; Weakly similar to cytochrome P45 
matrix metalloproteinase 2 (gelatinase A; 
matrix metalloproteinase 12 (macrophage 
ATPase; H+ transporting; lysosomal (vacu 
interferorvstimutated transcription factor 
ESTs 
ESTs 

Rho-assoctated; coiled-coil containing pro 

ESTs 

ESTs 

cornichon-like 
adenylate cyclase 7 

DNA segment on chromosome X (unique) 
occludin 

POP7 (processing of precursor, S. cerevis 
ESTs 

Homo sapiens cone 23965 mRNA sequen 
ESTs 

ESTs; Weakly similar to katanin p80 subu 
putative DNA/chromatin binding motif 
ceroid-lipofuscinosis; neuronal 2; late infa 
ubiquitin-activating enzyme E1 (A1S9T a 
kinesirvlike 2 
SET domain; bifurcated; 1 
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130905 M056489 Hs 129998 ESTs 8.7 

130913 W03592 Hs 21198 translocase of outer mitochondrial me mbr 20.9 

130919 AA291710 Hs.21276 collagen; type IV; alpha 3 (Goodpasture a 9 

130921 AA074596 Hs.194688 bromodomain adjacent to zinc finger dom 5.3 

5 130944 M97935 Hs.21486 signal transducer and activator of transcrip 18.8 

130974 X579B5 Hs 217B H2B histone family; member Q 13.4 

130987 R45698 Hs 21893 ESTs; Weakly similar to cAMP inducible 8.5 

130999 N48963 Hs 21992 KIAA06 89 protein 7.2 

131010 AA435748 Hs 169341 ESTs; Weakly similar to phosphatide acid 5.2 

10 131046 X02530 Hs.2248 small inducible cytokine subfamily B (Cy 10.1 

131091 T35341 Hs 22880 ESTs; Highly similar to dipeptidyl peptid 6 3 

131153 H11760 Hs.23606 ESTs 7.3 

131185 M25753 Hs 23960 cyclinBI 6 2 

131200 AA609427 Hs 293732 ESTs; Moderately similar to !!!! ALU SU 4 3 

15 131206 AA044078 Hs.24210 ESTs 5 5 

131210 AA430047 Hs95549 ESTs 7.1 

131227 AA429472 Hs.236522 DKFZP434P1 06 protein 5 6 

131244 D38076 Hs.24763 RAN binding protein 1 5.5 

131245 AA620599 Hs.24766 DKFZP564E1 962 protein 6 7 
20 131257 AA256042 Hs24908 ESTs 58 

131319 U25997 Hs.25590 stanniocaldn 89 

131339 AA463450 Hs.25812 Nljmegen breakage syndrome 1 (nib rin) 6 5 

131388 R34531 Hs 92200 KIAA04 80 gene product 9 2 

131410 H 84658 Hs.279836 ESTs 12.1 

25 131472 AA608962 Hs 27258 calcyclin binding protein 18.1 

131475 Z39053 Hs.27263 ESTs 7 5 

131501 AA121127 Hs.8207 H3 histone; family 3A 5 5 

131514 X02152 Hs.2795 lactate dehydrogenase A 51 

131524 N39152 Hs.301804 ESTs 4 3 

30 131528 D60856 Hs.28309 UDP-glucose dehydrogenase 8 4 

131544 N33236 Hs 28555 ESTs; Weakly similar to BOS 11.8 [C.eleg 5 6 

131557 D30946 Hs.28707 signal sequence receptor; gamma (transloc 8 7 

131562 U90551 Hs.28777 H2A histone family; member L 18.8 

131564 AA491465 Hs 28792 ESTs 11.8 

35 131586 AA235385 Hs 25966 ESTs; Moderately similar to alternatively 4 7 

131587 M15182 Hs.183868 glucuronidase; beta 52 

131589 U52100 Hs 29191 eprthelfal membrane protein 2 4 4 

131615 D14533 Hs.192803 xeroderma pigmentosum; complementatio 4 6 

131664 AA136126 Hs 30327 mitoge n activated protein kinase-activated 4 3 

40 131679 AA136660 Hs 30579 ESTs 9 4 

131684 U 26174 Hs.3066 granzyme K (serine protease; granzyme 3 9 7 

131687 L11066 Hs 3069 heat shock 70kD protein 9B (mortal in-2) 6 2 

131689 AA599653 Hs 30696 transcription factor-like 5 (basic helix-bo 8 3 

131693 W60913 Hs 110796 ESTs; Weakly similar to cDN A EST yk45 9 

45 131710 AA233225 Hs.30985 MRS 1 protein 5 2 

131716 D49738 Hs 31053 cytoske leton- associated protein 1 6 6 

131742 D31352 Hs 31433 ESTs 11 

131762 H46831 Hs 107767 ESTs; Moderately similar to CaM-KII inn 4 9 

131781 AA460450 Hs 31989 DKFZP586G1 722 protein 9 2 

50 131795 N32724 Hs 32317 Sox-like transcriptional factor 4 5 

131809 L76517 Hs 3260 presenilin 1 (Alzheimer disease 3) 5 4 

131814 AA437226 Hs157 interteu kin 10 receptor; alpha 4 

131838 AA091932 Hs 180628 dynamin-like protein 6 7 

131877 J04088 Hs 156346 topoisomerase (DNA) II alpha (170kD) 5 

55 131885 AA044095 Hs3402 ESTs 11.1 

131891 AA1 58258 Hs.30376 heterogeneous nuclear protein similar to r 5 6 

131925 AA248470 Hs 183180 ESTs; Weakly similar to RING finger pro 4 5 

131930 AA205460 Hs.69476 ESTs 14.3 

131941 D62657 Hs 35086 ubiquitin-specific protease 1 6 2 

60 131965 W90146 Hs35962 ESTs 6.3 

131970 D86960 Hs 3610 KIAA0205 gene product 4.2 

131971 R70167 Hs 154938 ESTs 4.3 
131974 AA410424 Hs 268122 Homo sapiens mRNA; cDNA DKFZp586 4.6 
131977 F09788 Hs3622 procollagen -proline; 2-oxoglutarate 4 -diox 6.4 

65 131994 AA479515 Hs 279882 Human DNA sequence from clone 703H1 12 

131997 D82399 Hs. 136644 Homo sapiens clone 23714 mRNA sequen 10 

132017 W67251 Hs.267659 Homo sapiens vav 3 oncogene (VAV3) m 4.7 
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132021 T68246 Hs.306079 diaperonin containing TCP1; subunit 5 (e 

132065 D82226 Hs.211594 p rotea some (prosome; macro pain) 26S sub 

132085 D44466 Hs.3887 proteasome (pfosome; macropain) 26S sub 

132089 AA131971 Hs.39122 ESTs 

5 132109 AA599801 Hs.40098 ESTs 

132143 AA257056 Hs.7972 KIAA0871 protein 

132149 T10822 Hs.324743 ESTs 

132153 N90141 Hs.41066 ESTs; Moderately similar to ELONGAT I 

132160 AA281770 Hs.295923 seven in absentia (Drosophila) homolog 1 

10 132164 U84573 Hs.41270 procollagen-lysine; 2-oxog I utarate 5-dioxy 

132180 AA405569 Hs.418 fibroblast activation protein; alpha; sepras 

132183 L19183 Hs.199695 hypothetical protein 

132225 AA128980 ESTs 

132227 AA412620 Hs.4248 ESTs 

15 132235 F09058 Hs.42656 ESTs 

132256 AA608856 Hs.431 murine leukemia viral (bmM) oncogene h 

132298 N41849 Hs.7120 Homo sapiens cytokine receptor related p 

132314 AA285290 Hs.44499 small EDRK-rich factor 2 

132325 N37065 Hs.44856 ESTs 

20 132384 AA479933 Hs.46967 Human DNA sequence from clone 167A1 

132387 R70914 Hs.281434 heat shock 70 kD protein 1 

132393 W85888 Hs.47334 ESTs; Moderately similar to !!!! ALU SU 

132406 F09979 Hs.4774 ESTs 

132407 AA431459 Hs.47783 ESTs 

25 132413 AA132969 Hs.260116 KIAA1 104 protein 

132446 AA426218 Hs.48764 ESTs 

132465 AA047896 Hs.49169 ESTs 

132482 AA429478 Hs.238126 ESTs; Highly similar to CGI49 protein [H 

132492 T03749 Hs.4990 KIAA1 089 protein 

30 132528 AA283006 Hs.50758 chromosome-associated polypeptide C 

132540 AA488987 Hs.5097 synaptogyrin 2 

132543 AA417152 Hs.5101 protein regulator of cytokinesis 1 

132580 L37042 Hs.283738 casein kinase 1; alpha 1 

132586 AA4 12452 Hs 52515 DKFZP434N024 prote'm 

35 132608 AA199588 Hs,5321 ARP3 (a ctin -related protein 3; yeast) horn 

132616 AA386264 Hs.283558 isocitrate dehydrogenase 2 (NADP+); mit 

132617 AA171913 Hs.5338 carbonic an hydrase XII 

132618 AA253330 Hs.279916 adaptor-related protein complex 1; gamma 
132640 U33821 Hs. 5437 Taxi (human T-cell leukemia virus type I 

40 132668 AA453614 Hs.5460 KIAA077 6 protein 

132694 M60830 Hs.5509 ecotropic viral integration site 2B 

132700 N47109 Hs.5521 ESTs 

132724 AA417962 Hs.55498 geranylgeranyt diphosphate synthase 1 

132738 W42674 Hs, 264636 ESTs; Moderately similar to neuronal thre 

4 5 1 32742 AA490862 Hs.29281 2 ESTs; Weakly similar to C43H8.1 [C.eleg 

132744 X54326 Hs.55921 glutamyl-prolyl-tRNA synthetase 

132795 H99152 Hs.57079 ESTs 

132807 AA331777 Hs.57301 mul (E.coli) ho mo log 1 (colon cancer; n 

132811 U25435 Hs.57419 transcriptional repressor 

50 132817 AB004884 Hs.57553 tousled- 1 ike kinase 2 

132840 N23817 Hs.5807 Homo sapiens clone 23675 mRNA sequen 

132845 D62588 Hs.5813 ESTs 

132847 T48195 Hs.58189 eukaryotic translation inrtiation factor 3; s 

132856 W79865 Hs.58367 grypican4 

55 132869 N26855 Hs.203961 ESTs 

132874 AA425776 Hs.58609 ESTs 

132880 AA444369 Hs.177537 ESTs 

132894 D82422 Hs.5944 ESTs 

132900 N 56451 Hs.5978 UM domain only 7 

60 132903 AA235404 Hs.5985 Homo sapiens clone 25186 mRNA sequen 

132904 X83618 Hs.59889 3-hydroxy-3-methylglutaryl-Coenzyme A 

132906 AA142857 Hs.234896 ESTs; Highly similar to geminin [H.sapie 

132914 AA496037 Hs.60293 ESTs 

132918 AA252605 Hs.6051 KIAA06 16 protein 

65 132936 AB002305 Hs.6111 K I AA0307 gene product 

132951 U04209 Hs.61418 microftbrlllar-associated protein 1 

132957 AA234791 Hs.61469 Human gene from PAC 753P9; chromoso 
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132959 


AA028103 


Hs.61472 


ESTs; Weakly similar to unknown [S.cere 


18.9 


132968 


N77151 


Hs.61638 


myosin X 


5.8 


132984 


H80409 


Hs.62112 


zinc finger protein 207 


4.3 


132990 


AA458761 


Hs.18387 


transcription factor AP-2 alpha (activating 


4.2 


132994 


AA505133 


Hs.279905 


solute carrier family 2 (facitrtated glucose 


26.4 


132996 


Y0O062 


Hs.170121 


protein tyrosine phosphatase; receptor typ 


4.4 


133002 


AF006082 


Hs.42915 


ARP2 (actin-related protein 2; yeast) horn 


4.7 


133005 


C21400 


Hs.278605 


KIAA0970 protein 


5.6 


133015 


AA047036 


Hs.246315 


ESTs 


7.9 


133016 


W81298 


Hs.6289 


growth factor receptor-bound protein 2 


5.2 


133039 


X62055 


Hs.63489 


protein tyrosine phosphatase; non-recepto 


4 


133050 


S67325 


Hs.63788 


proptonyl Coenzyme A carboxylase; beta 


5.2 


133056 


AA071387 


Hs.6396 


jumping translocation breakpoint 


5 


133062 


R33663 


Hs.64056 


ESTs 


5.4 


133083 


N70633 


Hs.6456 


chaperonin containing TCP1; subunit 2 (b 


6 


133091 


AA122147 


Hs.64691 


KIAA0483 protein 


5 


133093 


AA598749 


Hs. 285996 


ESTs 


5.6 


133124 


AA1 55049 


Hs. 26792 3 


ESTs 


4.1 


133126 


D16469 


Hs.6551 


ATPase; H+ transporting; lysosomal (vacu 


6.2 


133196 


R37 367 


Hs.6727 


Ras-GTPase activating protein SH3 doma 


5.1 


133214 


Y 10659 


Hs. 285115 


interteukin 13 receptor, alpha 1 


5.2 


133225 


Z41415 


Hs.6823 


ESTs; Weakly similar to intrinsic factor-B 


8.3 


133228 


N90029 


Hs.6831 


Homo sapiens done 1400 unknown prote 


4.7 


133239 


AA059405 


Hs. 179882 


Homo sapiens done 24655 mRNA sequen 


5.5 


133240 


D31161 


Hs. 242894 


ESTs 


9 


133257 


AF005086 


Hs.6895 


actin related protein 2/3 complex; subunit 


7.7 


133264 


W72187 


Hs. 69192 


ESTs; Weakly similar to cDNA EST yk37 


5.7 


133274 




Hs 6949 


ESTs 


4.2 


133281 


AA421079 


Hs.69594 


ESTs; Weakly similar to Sox-like transcri 


4.9 


133283 


AA4 10507 


Hs.6968 


ESTs 


4,3 


133287 


L15702 


Hs. 69771 


B-factor; properdin 


9.3 


133294 


R79723 


Hs 69997 


zinc finger protein 238 


30.4 


133297 


AA600057 


Hs.70266 


KIAA0905 protein 


10.4 


133318 


AA256168 


Hs. 152316 


ESTs 


8.5 


133362 


H06195 


Hs.7194 


ESTs; Highly similar to CGI -59 protein [H 


14 


133370 


AA1 56897 


Hs. 72157 


DKFZP564I1922 protein 


5 


133391 


X57579 


Hs.727 


inhib'm; beta A (activin A; activin AB alp 


13.9 




AA491296 


Hs. 72805 


ESTs 


4,3 


133422 


N79516 


Hs 73287 


ESTs; Weakly similar to eyelid [D.melano 


4.5 


133431 


AA255438 


Hs.7358 


Homo sapiens mRMA; cDNA DKFZp566 


8 


133435 


T23983 


Hs.323966 


ESTs 


5 


133449 


AAflP4Qft9 


Hs 7181 


voltage-dependent anion channel 3 


8.7 




AUJUOO 


Hs 71<H1 


mainr hlstnrnmnatihilitv mmnlpx 1 nlass 11 

1 1 lalL/l MFOULA^Ulf ty)Q\tU\l\\J ^AJUipiGA, wtLIJw 1 1 


5 




X78710 


He: PHIPl 


mpfal-j"pnjilatnrv tranvjintion fartor 1 


5.3 


133506 


AA31B868 


Hs 74146 


ESTs; Weakly similar to 140G11.h [D.me 


6.8 


1003 1 1 


X52947 


Hs. 74471 


nan innrtinn nmtpirr alnha 1 43kD fconn 


5.7 


1 W>1 

\ OODJ 1 




Hs 278634 


KIAA0146 nrotein 


4.8 




AA3 13977 


Hs 172772 


transcription elongation factor B (SHI); po 


9.5 


133572 


W94333 


Hs 279915 


trans locase of inner mitochondrial membr 


5 


133577 


F03717 


Hs.75063 


human immunodeficiency virus type 1 enh 


7.4 


133589 


L37368 


Hs.75104 


RNA-binding protein S1; serine-rich dom 


5 


133608 


D13315 


Hs.75207 


glyoxalase I 


4,2 


133617 


AA148318 


Hs. 75249 


KIAA0069 protein 


4.5 


133627 


U09587 


Hs .75280 


glycyl-tRNA synthetase 


10 


133633 


D21262 


Hs.75337 


nucleolar phosphoprotein p130 


4.5 


133634 


U24166 


Hs.234279 


microtiibule-associated protein; RP/EB fa 


15.2 


133640 


D83004 


Hs,75355 


ubtquitin-conjugating enzyme E2N (homo 


9.1 


133644 


D89077 


Hs.75367 


Src-like-adapter 


6.4 


133649 


AA479139 


Hs 75393 


acid phosphatase 1; soluble 


4.8 


133652 


AA287383 


Hs.7540 


ESTs 


4.2 


133674 


AA458946 


Hs. 75497 


ESTs 


4.3 


133700 


K01396 


Hs.297681 


protease inhibitor 1 (anti-elastase); alpha- 


8.3 


133705 


N21648 


Hs.75659 


MpV17 transgene, murine homolog, gtom 


4.6 


133716 


Y00282 


Hs .75722 


nbophorin II 


7.5 


133720 


L27841 


Hs.75737 


pericentriolar material 1 


9.4 


133752 


U49278 


Hs.75875 


ubtquitin-conjugating enzyme E2 variant 


4.5 


133765 


D21255 


Hs.75929 


cadherin 11 (OB-cadherin; osteoblast) 


6.4 



234 



WO 02/059377 



PCT/l S02/02242 



133772 W73693 Hs.76038 isopentenyl-diphosphate deha isomerase 

133774 223090 Hs.76067 heat shock 27 kD protein 1 

133776 J03473 Hs,177766 ADP-nbosyttransferase (NAD+; poiy (AD 

133784 AA214305 Hs.301064 ESTs 

5 133814 M33882 Hs.76391 myxovirus (influenza) resistance 1 ; homo) 

133829 AA453783 Hs 76550 Homo sapiens mRN A; cDN A DKFZp564 

133834 AA147510 Hs 288660 serine protease; umbilical endothelium 

133839 M59815 Hs 170250 complement component 4A 

133842 U73477 Hs 285013 putative human HU class II associated p 

10 133845 T68510 Hs 76704 ESTs 

133859 U86782 Hs. 178761 26S proteasome-assoc'iated parJ1 homoiog 

133867 043948 Hs 76989 KIAA0097 gene product 

133868 U58090 Hs. 183874 cull in 4 A 
133871 AA454597 Hs 182793 ESTs 

15 133893 X01060 Hs 77356 transferrin receptor (p90; CD71) 

133914 N32811 Hs 77542 ESTs 

133918 W72783 Hs 58382 ESTs; Weakly similar to C13F10.5 [C.ele 

133944 AA045870 Hs 7780 Homo sapiens mRNA; cONA DKFZp564 

133946 AA156565 Hs.173878 4-nitrophenylphosphatase domain and non 

20 133963 L34587 Hs 184693 transcription elongation factor B (Sill); po 

133980 D00760 Hs 250811 proteasome (prosome; macropain) subunit 

133990 C02374 Hs 7822 Homo sapiens mRNA; cDNA DKFZp564 

133999 M28213 Hs 78305 RAB2; member RAS oncogene family 

134030 J03077 Hs 78575 prosaposin (variant Gaucher disease and v 

25 134032 Z81326 Hs 78589 protease inhibitor 12 (neuroserpin) 

134045 S82470 Hs 78768 BB1 

134046 D28473 Hs 172801 isoieucine-tRNA synthetase 
134064 D87685 Hs. 78893 KIAA0244 protein 
134070 H98621 Hs 78946 cullin 3 

30 134087 U51166 Hs 173824 thymi'ne-DNAglycosytase 

134090 M22382 Hs 79037 heat shock 60 kD protein 1 (eta peronin) 

134098 X06323 Hs 79086 ribosomal protein; mitochondrial; L3 

134110 U41060 Hs 79136 UV-1 protein; estrogen regulated 

134132 U32519 Hs 220689 Ras-GTPase-activating protein SH3-doma 

35 134168 AA398908 Hs 181634 Human Chromosome 16 BAC clone CIT9 

134170 M63138 Hs.79572 cathepsin D (lysosomal aspartyl protease) 

134208 U88871 Hs 79993 peroxisomal biogenesis factor 7 

134258 L28010 Hs 808 heterogeneous nuclear ribonucleopratein F 

134288 AA4300O8 Hs8117 ESTs 

40 134310 AA313414 Hs 8148 Homo sapiens clone 24856 mRNA sequen 

134326 U16306 Hs 81800 chondroitin sulfate proteoglycan 2 (versic 

134329 D38551 Hs,81848 RAD21 (S. pombe) homobg 

134331 AA452020 Hs 111222 ESTs; Weakly similar to CGI- 128 protein 

134351 R82074 Hs.82109 syndecan 1 

45 134357 L43575 Hs.82171 Human clone 191 B7 placenta expressed m 

134363 M37033 Hs.82212 CD53 antigen 

134367 X54199 Hs.82285 phc«prwribosylglycinamicteforrTiyltransfe 

134374 D62633 Hs.8236 ESTs 

134375 AA412720 Hs.82389 ESTs; Highly similar to CGI-1 18 protein 
50 134376 X02874 Hs.82396 2' ;5'-oligoadeny late synthetase 1 

134381 U56637 Hs.184270 capping protein (actin filament) muscle Z- 

134388 M15841 Hs.82575 small nuclear ribonudeoprotein polypepti 

134395 L09717 Hs.8262 lysosomal -associated membrane protein 2 

134399 H99801 Hs.82689 tumor rejection antigen (gp96) 1 

55 134401 AA243746 Hs.211577 kinectin 1 (kinesin receptor) 

134405 J04177 Hs.82772 collagen; type XI; alpha 1 

134415 AA329274 Hs,82911 protein tyrosine phosphatase type IVA; m 

134417 D87969 Hs.82921 solute carrier family 35 (CMP -sialic acid t 

134419 L08044 Hs.82961 trefoil factor 3 (intestinal) 

60 134421 AA122386 Hs.82985 collagen; type V; alpha 2 

134423 W96151 Hs.83006 ESTs; Highly similar to CGI-139 protein 
134438 AA449984 Hs.246857 ESTs; Highly Sfmter to proteine kinase JN 

134446 T25732 Hs.83419 KIAA0252 protein 

134453 X70683 Hs.83484 SRY (sex determining region Y)-box 4 

65 134470 X54942 Hs,83758 CX28 protein kinase 2 

134487 R38185 Hs.83954 Homo sapiens unknown mRNA 

134495 D63477 Hs.84087 KIAA01 43 protein 
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134498 M63180 Hs.84131 threonyl-IRN A synthetase 6,1 

134506 U45328 Hs.84285 ubiqu'rtin-conjugating enzyme E2I (homol 4.6 

134529 H24460 Hs.848 FK506- binding protein 4 (59kD) 6.2 

134570 U66615 Hs.172280 SWl/SNF related; matrix associated; actin 4.8 

5 134582 M234966 Hs.86041 CGG triplet repeat binding protein 1 4.7 

134600 R68B84 Hs.86347 ESTs; Weakly similar to predicted using G 5.8 

134623 X74496 Hs.86978 prolyl endopeptidase 4.5 

134654 W23625 Hs.8739 ESTs; Weakly similar to ORF YGR200c [ 13.7 

134655 AA454070 Hs.1 23090 ESTs 5.8 
10 134675 AA250745 Hs.87773 protein kinase; cAMP-dependent; catalyti 8.9 

134711 X04011 Hs,88974 cytochrome b-245, beta polypeptide (chro 6.8 

134714 U89922 Hs.890 lymphotoxin beta (TNF superfamily; mem 35.7 

134722 W47183 Hs.284226 ESTs; Weakly similar to neural F box pro 8.1 

134776 J05582 Hs,89603 mucin 1; transmembrane 6.2 

15 134806 Z49099 Hs.89718 spermine synthase 4.2 

134810 M27394 Hs.89751 membrane-spanning 4-domains; subfamily 7 

134840 U51477 Hs.89981 d'lacytglycerol kinase; zeta (104kD) 4.1 

134843 H60595 Hs 90061 progesterone binding protein 4.7 

134853 D82348 Hs 90280 5-aminoimidazole-4-carboxamide ribonuc 10.2 

20 134866 U84011 Hs 904 amylo-1;6-glucosidase; 4-alpha-glucanotr 12,1 

134868 Z39762 Hs 90419 KIAA0882 protein 5 

134885 N27670 Hs 9071 progesterone membrane binding protein 5 

134982 N46086 Hs 92308 ESTs 4.1 

134989 AA236324 Hs 92381 Homo sapiens mRNA; chromosome 1 spe 16.8 

25 134992 H05625 Hs.5831 ESTs 4 

134993 AA282343 Hs 301005 purine-rich element binding protein B 4.4 

135010 D59675 Hs 92927 ESTs 7 

135015 U54999 Hs.278338 LGN protein 4.8 

135029 AA224180 ESTs; Moderately similar to 17-beta-hydr 13.6 

30 135032 AA243497 Hs.173685 Human DNA sequence from clone 30M3 4 

135037 U77948 Hs 278589 general transcription factor II; i 8 

135059 AA598449 Hs 93832 Homo sapiens clone 24483 unknown mRN 5.4 

135071 L08069 Hs.94 heat shock protein, DNA J-l ike 2 9.3 

135083 AA495950 Hs 94262 ESTs 6.7 

35 135117 W52493 Hs 94694 Homo sapiens clone 24837 mRNA sequen 10.2 

135144 AA044842 Hs 95260 Homo sapiens mRNA; cON A DKFZp586 6.6 

135154 AA126433 Hs.267812 sorting nexin 4 7.4 

135218 D31157 Hs 324277 ESTs; Weakly similar to growth factor-res 6.2 

135237 AA454930 Hs 9691 ESTs 19.5 

40 135243 AA215333 Hs.97101 putative G protein -coupled receptor 8.8 

135335 H20989 Hs 198281 pyruvate kinase; muscle 12.4 

135349 D83174 Hs 9930 coll age n-binding protein 2 (coll ig en 2) 5.5 

135367 AA480109 Hs9963 TYRO pre te in tyrosine kinase binding pro 5.4 

135389 U05237 Hs 99872 fetal Alzheimer antigen 7.8 

45 135400 M23263 Hs 99915 androgen receptor (dihydrotesbste rone re 9.1 

135411 L10333 Hs 99947 reticulon 1 5.3 

300019 M97935 AFFX control: STAT 1 8.3 

300021 M97935 AFFX control; STAT 1 7 

300022 M97935 AFFX control: STAT 1 14 
50 300089 AI199738 Hs 208275 ESTs; Weakly similar to !!!! ALU CLASS 9.1 

300107 AI694585 Hs 270464 ESTs; Weakly similar to !!!! ALU CLASS 7.4 

300254 AW079607 Hs 188417 ESTs; Weakly similar to ZnT-3 [H.sapien 30.1 

300328 AW015860 Hs 224623 ESTs 119 

300549 AA699328 Hs 298119 ESTs 5.5 

55 300711 AI492179 Hs 166244 ESTs; Weakly similar to cDN A EST yk40 11 

300921 AW293224 Hs 232165 ESTs 11 

301 124 T79326 Hs 298262 ESTs; Weakly similar to dJ88J8.1 [H.sapi 8.8 

301165 N85789 Hs. 150186 ESTs; Weakly similar to PTE R1N4-ALP 6 

301576 AI682905 Hs.270431 ESTs; Weakly similar to !!!! ALU SUB FA 4.7 

60 301604 AA373124 Hs.24809 ESTs; Weakly similar to C17G10.1 [Cete 8 

301704 AA526313 Hs.293691 ESTs 4.2 

301782 N99399 Hs.143046 EST cluster (not in UniGene) with exon h 18 

301884 AA312082 Hs. 105445 GDNF family receptor alpha 1 20.7 

301936 NfvL004694Hs.1 14924 EST duster (not in UniGene) with exon h 11.6 

65 302002 AF013956 Hs.5637 chromobox homolog 4 (Drosophila Pc cla 9.2 

302032 NM_001992Hs. 128087 EST cluster (not in UniGene) with exon h 4.3 

302067 H05698 Hs.222399 ESTs; Weakly similar to proteirvtyrosine 7.8 
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302145 NM 003613HS.151407 
302236 AI128606 Hs.6557 
302276 NH_Q04448Hs.323910 
302290 AL117607 Hs 175563 
5 302326 NN1_004271Hs.184018 
302342 AB023141 Hs. 190386 
302372 AL117406 Hs 200102 
302422 AB021227 Hs.3743 
302431 AF1 29530 Hs.226434 

10 302501 AF022726 Hs.251446 
302505 ALD49650 Hs.247874 
302533 L36149 Hs.248116 
302638 AA463798 Hs.102696 
302656 AW293005 Hs 70704 

15 302792 AA343696 Hs .46821 
302820 X04588 Hs.85844 
302838 U66049 Hs.82171 
302892 N58545 Hs.42346 
302977 AW263124 Hs.315111 

20 302989 N45406 Hs.84700 
303007 AA478876 Hs.317714 
303052 AF140242 Hs.279926 

303131 AW081061 Hs.103180 

303132 A!929819 Hs.4055 
25 303153 U09759 Hs.246857 

303387 AA908797 Hs.180799 
303499 AI815990 Hs.293515 
303502 AA488528 
303576 T07216 Hs.301226 

30 303620 AA397546 Hs.1 19151 
303634 A1953377 Hs.28444 
303642 AW299459 Hs.1 11977 
303654 AA436942 Hs.288529 
303733 AW502498 Hs.1 5220 

35 303780 AM 24014 Hs.1 8995 
303792 C75094 Hs.199839 
303842 AI337304 Hs.126268 
303951 AW475081 Hs.1 72928 
304465 AA421948 

40 304507 AA456426 
304591 AA505702 
304601 AA507875 
304659 AA533185 
305040 AA630582 Hs. 169476 

45 305134 AA653159 Hs.179661 
305415 AA725116 Hs.78465 
305453 AA738110 
305898 AA872838 
305913 AA876109 

50 305950 AA884479 

306004 AA889992 Hs.2186 
306009 AA894560 Hs.283370 
306060 AA906161 Hs.76277 
306398 AA970548 Hs.297681 

55 306505 AA987722 Hs.172928 
306576 AA995761 Hs.276092 
307117 AI184111 Hs.76067 
307138 AI185516 Hs.172928 
307187 AI190870 Hs.276417 

60 307542 A1280659 Hs.62954 
307554 A1281603 Hs.172928 
307806 AI351739 Hs.276726 
308079 AI472733 Hs.270208 
308307 A1581398 Hs.172928 

65 308511 AI687580 Hs.1 69476 
308615 AI738593 Hs.101774 
308677 AI761173 



EST duster (not in UniGene) with exon h 
zinc finger protein 161 
EST cluster (not in UniGene) with exon h 
Homo sapiens mRNA; cDNA DKFZp564 
EST cluster (not in UniGene) with exon h 
KIAA0924 protein 

Homo sapiens mRNA; cDNA DKFZp434 
matrix metaltoproteinase 24 (membrane-in 
EST cluster (not in UniGene) with exon h 
EST duster (not in UniGene) with exon h 
multiple UniGene matches 
chemokine (C motif) XC receptor 1 
ESTs; Weakly similar to C11D2.4 [C.eleg 
ESTs 

ESTs; Weakly similar to putative [H.sapie 

EST cluster (not in UniGene) with exon h 

EST cluster (not in UniGene) with exon h 

hlstone deacetylase 3 

EST cluster (not in UniGene) with exon h 

EST cluster (not in UniGene) with exon h 

pallid (mouse) homolog; pallidin 

EST cluster (not in UniGene) with exon h 

actin-like 6 

ESTs 

mitogen-activated protein kinase 9 

ESTs 

ESTs 

EST cluster (not in UniGene) with exon h 
EST cluster (not in UniGene) with exon h 
ESTs 

ESTs; Weakly similar to predicted using G 
EST cluster (not in UnjGene) with exon h 
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EST singleton (not in UniGene) with exon 
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heat shock 27kD protein 1 
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ESTs 
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308852 AI829848 Hs.182937 peptidylprolyl isomerase A (cyclophilin A 5.9 

308974 AI872290 Hs. 300697 immunoglobulin gamma 3 (Gm marker) 4.5 

308981 AI873242 EST singleton (not In UniGene) with exon 7.6 

308995 AI880172 EST singleton (not in UniGene) with ex on 6.6 

5 309177 AI95111B EST singleton (not in UniGene) with exon 243 

309186 AI952723 Hs.90207 EST singleton (not in UniGene) with exon 61 

309198 AI955915 major histocompatibility complex; ciass I; 5 6 

309226 AI969897 EST singleton (not in UniGene) with exon 6 2 

309279 AI990102 EST singleton (not in UniGene) with exon 7 9 

10 309583 AW170035 EST 64 5 

309624 AW191929 Hs 252989 EST 5 3 

309629 AW192764 Hs 1 7292Q collagen; type I; alpha 1 6 9 

309641 AW194230 Hs. 253100 EST 11.4 

309698 AW238461 Hs 73742 ribosornal pratein; large; P0 4 3 

1 5 309700 AW241 170 Hs 179661 Homo sapiens done 24703 beta-tubulin m 119 

310073 AI335004 Hs 148558 ESTs 4 2 

310094 AW450967 Hs 235240 ESTs 5 7 

310373 AW080778 Hs 145582 ESTs 4 8 

310438 AW022192 Hs 200197 ESTs 391 

20 310470 AI281848 Hs 194691 ESTs 4 9 

310583 AW205632 Hs.211198 ESTs 7 

310877 T47784 Hs 188955 ESTs 4 1 

311067 AI587332 Hs 209115 ESTs 11.2 

311166 AI821294 Hs 118599 ESTs 241 

25 311199 T57896 Hs 191095 EST cluster (not in UniGene) 57 

311465 AI758660 Hs 205132 ESTs 15 7 

311587 AI828254 Hs 271019 ESTs 6 4 

311774 AA700870 Hs 14304 ESTs 6.2 

311785 AI056769 Hs 133512 ESTs 5 

30 311923 T60843 Hs 189679 ESTs 5.9 

311935 AA216387 EST cluster (not in UniGene) 5 5 

311972 N51511 Hs 188449 ESTs 5.2 

312014 AI435650 Hs 128778 ESTs 4 3 

312047 AA588275 Hs.180669 ESTs 14.7 

35 312147 T89855 Hs 195648 EST cluster (not in UniGene) 9 8 

312153 AA759250 Hs 153028 cytochrome b-561 27.1 

312168 T92251 Hs 198882 ESTs 4 2 

312172 AI222168 Hs. 191 168 ESTs 6 1 

312226 AI796815 Hs 199993 ESTs; Weakly similar to ubiquitous TP R 5 5 

40 312292 AW451893 Hs 151124 ESTs 184 

312312 AI080505 Hs 134529 ESTs 119 

312369 AA582039 Hs 173884 Homo sapiens mRNA; chromosome 1 spe 4 

312407 R46180 Hs 153485 ESTs 13 6 

312430 AW1391 17 Hs 117494 ESTs 4.1 

45 312470 AW451 347 Hs 175862 ESTs 4 6 

312483 AI417526 Hs7753 ESTs 15 3 

312521 AA033609 Hs 319093 ESTs 12 5 

312544 AI498371 Hs 183526 ESTs 14 6 

312638 AW4391 95 Hs 256880 ESTs 5 3 

50 312754 R99834 Hs 250383 ESTs 8 4 

312772 H63791 EST cluster (not in UniGene) 4 3 

312821 AA699325 Hs 269880 ESTs 8 3 

312837 AW292286 Hs 255058 ESTs 71 

312849 AA846353 Hs.194054 ESTs 5 9 

55 312854 AA828713 Hs 321058 EST duster (not in UniGene) 41 

312992 AA088446 Hs 170298 ESTs 7 3 

313096 AI422367 Hs 163533 ESTs 6.1 

313112 AA732534 Hs 269099 ESTs 4.2 

313126 AA720887 Hs 283313 EST duster (not in UniGene) 181 

60 313136 N59284 Hs.288010 ESTs 17 

313197 A1738851 Hs.222487 ESTs 129 

313219 N74924 Hs.182099 ESTs 7.1 

313258 AW068358 Hs.183918 ESTs 13.7 

313328 AW449211 Hs.105445 ESTs 27.9 

65 313352 AW292127 Hs.144758 ESTs 9.8 

313417 AA741151 Hs.137323 ESTs 8.2 

313455 AW081702 Hs.98571 ESTs 6.9 
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313590 AA804410 Hs.291677 EST cluster (not in UniGene) 5.3 

313663 A1953261 Hs.169813 ESTs 7.6 

313667 U69201 Hs.13684 ESTs; Weakly similar to choline kinase is 12.5 

313749 AW450376 Hs.119004 ESTs 5.5 

5 313832 AW271022 Hs, 133294 ESTs 4.3 

313881 AA535580 Hs. 16331 ESTs 7.7 

313915 A1969390 Hs, 163443 ESTs 27.1 

313955 AJ858884 Hs.270647 ESTs 5,7 

313974 AI310151 Hs.173524 ESTs 4.3 

10 314097 AA648744 Hs.269493 ESTs 14.5 

314129 AA228366 Hs. 115122 ESTs 9.5 

314359 AA205569 Hs.194193 ESTs 5.4 

314384 AA535840 Hs.162203 ESTs; Weakly similar to alternatively spli 5.3 

314394 AI380563 Hs.130816 ESTs 13.2 

15 314462 AA347951 Hs.326413 ESTs 6.2 

314465 AA602917 Hs.156974 ESTs 18.1 

314470 AI934422 Hs.30661 ESTs 4.2 

314488 AA358265 Hs. 182890 ESTs 6.1 

314506 AA833655 Hs 206868 ESTs 27.8 

20 314510 AI204418 Hs.190080 ESTs 9.5 

314558 AI873274 Hs190721 ESTs 22,5 

314661 AA436432 Hs.324239 EST cluster (not in UniGene) 13.3 

314691 AW207206 Hs.136319 ESTs 21.4 

314754 AW026761 Hs 134374 ESTs 4 4 

25 314775 AI149880 Hs.188809 ESTs 4.4 

314943 AI476797 Hs.184572 cell division cycle 2; G1 to S and G2 to M 18.4 

314961 AW0O8061 Hs.231994 ESTs 10.2 

314963 AJ689617 Hs.200934 ESTs 5.3 

315006 AI538613 Hs.298241 ESTs 20.7 

30 315010 AA531082 Hs 240049 ESTs 5 

315019 AA532807 Hs 105822 ESTs 61 

315033 AI493046 Hs.146133 ESTs 12 

315036 AA534953 Hs 163297 ESTs 8 3 

315037 AW205863 Hs 133988 ESTs; Weakly simitar to gene MAC25 pr 6 1 
35 315051 AW292425 Hs 163484 EST 12.7 

315054 AI968598 Hs.7876fl ESTs 7.6 

315073 AW452948 Hs 257631 ESTs 13.9 

315080 AA744550 Hs 136345 ESTs 4 4 

315083 AI221325 Hs.205442 ESTs 5.1 

40 315088 AA557351 Hs.152448 ESTs; Moderately similar to MULTIF UN 4.7 

315175 AI025842 Hs 152530 ESTs 11.9 

315196 AA972756 Hs.44898 ESTs 28,8 

315296 AA876905 Hs.125286 ESTs 16.1 

315303 AW194364 Hs 128022 ESTs; Weakly similar to FIG-1 PROTEIN 25.7 

45 315352 AA604799 Hs.136528 ESTs; Moderately similar to I!!! ALU SU 12.3 

315364 AA643502 Hs 155485 ESTs; Highly similar to serine protease [H 4.6 

315368 AW291563 Hs 104696 ESTs 4.8 

315390 AI801565 Hs 200113 ESTs; Weakly similar to alternatively spli 4.4 

315408 AW273261 Hs 216292 ESTs 5 

50 315458 AA872000 Hs.116104 ESTs 7.6 

315472 AA828B50 Hs 155469 ESTs 4.9 

315478 AA665612 Hs120874 ESTs 5.2 

315498 AA628539 Hs.1 16252 ESTs; Moderately similar to I!!! ALU SU 4.8 

315527 AI791138 Hs. 116768 ESTs 4 4 

55 315530 AI200852 Hs 127780 ESTs 22.4 

315562 AA737415 Hs 152826 ESTs 5 9 

315634 AA837085 Hs 220585 ESTs 8 8 

315647 AA648983 Hs 212911 ESTs 15 

315652 A!521489 Hs 3053 ESTs 6 3 

60 315676 AW002565 Hs 124660 ESTs 9 2 

315680 AAB14309 Hs 123583 ESTs 81 

315735 AI831760 Hs.155111 ESTs 13.4 

315741 AA812168 Hs 122559 ESTs 5.4 

315769 AA744875 Hs.189413 ESTs 4.4 

65 315978 AA830893 Hs.119769 ESTs 10.4 

315984 AI015862 Hs.131793 ESTs 5 

316042 AW297979 Hs.170698 ESTs 14.7 
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316136 AA830808 Hs.124366 ESTs 4 

316177 AI908272 Hs.293102 EST cluster (not in UniGene) 32.6 

316313 AA741300 Hs.202599 ESTs 4.8 

316405 AA757900 Hs.270823 ESTs 4.8 

5 316480 AJ749921 Hs.205377 ESTs 12.9 

316564 AI743571 Hs.168799 ESTs; Weakly similar to !!!! ALU SUBFA 8.1 

316714 AA809792 Hs.123307 ESTs 5 

316715 AI440266 Hs.1 70673 ESTs 4.2 
316828 AA828116 Hs.173076 ESTs 5.2 

10 316869 AI954880 Hs.134604 ESTs 13.3 

316905 AW138241 Hs.210846 ESTs 6.2 

316943 AW014875 Hs.137007 ESTs 5.3 

316949 AA856749 Hs 124620 ESTs 7.2 

317008 AW051597 Hs 143707 ESTs 4.1 

15 317028 AA962623 Hs.189144 ESTs; Weakly similar to RENAL SODIU 4.2 

317067 AI805392 Hs.325335 ESTs 4.5 

317069 AI732892 Hs 190489 ESTs 6 4 

317210 AA490718 EST cluster (not in UniGene) 4 4 

317298 A1922374 Hs 158549 ESTs 5 9 

20 317658 AW139077 Hs 202217 ESTs 4 6 

317674 AW294909 Hs 132208 ESTs 5 2 

317685 AI798630 Hs 149997 ESTs 4 3 

317836 AA983913 Hs 128929 ESTs 12.4 

317881 AI827248 Hs 224398 ESTs 12.1 

25 317902 AI828602 Hs.211265 ESTs 8 8 

317916 AI565071 Hs.159983 ESTs 12.6 

318042 AW294522 Hs 149991 ESTs 5 6 

318053 AI074465 Hs 133469 ESTs 4 

318064 AW296888 Hs.170939 ESTs 5 2 

30 318070 AJ024594 Hs 248942 ESTs 4 7 

318073 AW167087 Hs 131562 ESTs 15 7 

318146 AKM0125 Hs 150521 ESTs 5 9 

318186 AW016773 Hs3709 ESTs 5 3 

318481 AI291584 Hs 145921 ESTs; Weakly similar to HYPOTHETtCA 7 6 

35 318566 AI335361 Hs 226376 ESTs 5 8 

318617 AW247252 Hs.75514 nucleoside phosphorylase 11.1 

318662 AI285898 Hs.294014 ESTs 16 3 

318691 AW192139 Hs 181307 H3 histone; family 3A 4 

318740 NM 002543Hs. 77729 EST cluster (not in UniGene) 213 

40 318744 A17931 24 Hs 144479 ESTs 35 

318948 AA317274 Hs 13996 ESTs 117 

319163 F15257 Hs 27 glycine dehydrogenase (decarboxylating; 7 

319478 R06841 Hs 270307 EST cluster (not in UniGene) 89 

319545 R83716 Hs 14355 ESTs 8 2 

45 319668 NM_002731Hs 87773 EST cluster (not in UniGene) 25 4 

319763 AA460775 Hs6295 ESTs 7 

319913 AA179304 Hs 271586 ESTs; Moderately similar to !!!! ALU SU 8 7 

319936 W22152 Hs 282929 EST cluster (not in UniGene) 5 6 

319951 AA307665 Hs 14559 ESTs 4 9 

50 319962 H0635O Hs 135056 ESTs 9 2 

319977 AA632632 EST cluster (not in UniGene) 4 6 

320074 AA321166 Hs 278233 EST cluster (not in UniGene) 16 7 

320092 AF022799 Hs 113292 calpain 9 (nCL^i) 5 4 

320107 AA836461 Hs 291712 EST cluster (not in UniGefle) 5 3 

55 320133 D63271 EST cluster (not in UniGene) 5 5 

320167 AA984373 Hs.90790 EST cluster (not in UniGene) 15 

320187 T99949 Hs.303428 EST cluster (not in UniGene) 6 7 

320211 AL039402 Hs.125783 DEME-6 protein 243 

320401 U90449 Hs.152717 nucleoside diphosphate kinase type 6 (inh 10 

60 320458 AI884396 Hs.24131 ESTs 5 4 

320488 R31386 Hs.191791 EST cluster (not in UniGene) 49 

320521 N31464 Hs.24743 ESTs 9 5 

320661 AA864846 Hs.1 15175 EST cluster (not in UniGene) 6 6 

320691 R61576 Hs.313951 hypothetical protein 5 9 

6 5 320699 R63161 Hs.118249 EST cluster (not in UniGene) 4 

320727 U96044 Hs.181125 EST cluster (not in UniGene) 15.3 

320993 AL050145 Hs.225986 Homo sapiens mRNA; cDIMA DKFZp586 7 2 
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321012 AA737314 Hs.194324 EST cluster (not in UniGene) 6.1 

321050 AW393497 EST cluster (not in UniGene) 5 

321051 AF134149 Hs.240395 EST cluster (not in UniGene) 11.4 
321171 AI769410 Hs.221461 ESTs 7.7 

5 321192 AA295304 Hs.297939 ESTs; Weakly similar to neogen in [H. sap 5.5 

321354 AA078493 EST cluster (not in UniGene) 16 9 

321387 H68014 Hs.141278 ESTs; Weakly similar to !!!! ALU SUB FA 4 2 

321412 AW366305 Hs.22891 EST cluster (not in UniGene) 6 3 

321489 AW392474 Hs.172759 ESTs; Moderately similar to !!!! ALU SU 9 

10 321539 N98619 Hs.42915 ARP2 (actin-related protein 2; yeast) horn 11.3 

321593 H84762 Hs.253197 ESTs 10,4 

321666 D28390 Hs.272897 EST cluster (not In UniGene) 19 9 

321891 AW157424 Hs.165954 ESTs 5 6 

321910 H67065 Hs.271530 ESTs; Weakly similar to !!!! ALU SUBFA 5 4 

1 5 321953 AW068268 Hs.292833 ESTs; Weakly similar to !!!! ALU CWSS 65 

321978 N77342 Hs.21851 EST duster (not in UniGene) 102 

322017 AA310O39 Hs.9192 ESTs 9.8 

322026 AA233527 Hs 283675 low density lipoprotein receptor (familial 27 8 

322035 AL137517 Hs,306201 EST cluster (not in UniGene) 40 2 

20 322171 AF085968 Hs. 48474 EST cluster (not in UniGene) 5 7 

322175 AF085975 EST cluster (not in UmGene) 7 7 

322236 AL134970 Hs.104222 fo His latin-like 1 14 4 

322303 W07459 Hs 157601 EST cluster (not in UniGene) 13 4 

322735 AA086123 Hs 297856 EST cluster (not in UniGene) 7 6 

25 322777 AA679082 Hs. 269947 ESTs 4 4 

322818 AW043782 Hs 293616 ESTs 21 

322882 AW248508 Hs. 279727 DiGeorge syndrome critical region gene 2 15 3 

322975 C16391 EST cluster (not in UniGene) 213 

322991 C18965 Hs 159473 ESTs 117 

30 323011 AA580288 EST cluster (not in UniGene) 8 9 

323091 AW014094 Hs 210761 ESTs 10 8 

323107 AI301107 Hs,150790 ESTs 6 5 

323136 AL120351 Hs. 30177 EST cluster (not in UniGene) 55 

323168 AL120862 Hs 124165 ESTs 17 9 

35 323195 AI064982 Hs.1 17950 multifunctional polypeptide similar to SA 5 8 

323201 AL049370 Hs 13350 Homo sapiens mRNA; cDMA DKFZp586 11.6 

323203 AA203135 Hs. 130186 ESTs 6 4 

323243 W44372 Hs. 110771 EST cluster (not in UruGene) 7 3 

323244 T70731 Hs.193620 EST cluster (not in UniGene) 15 8 
40 323328 AA228078 Hs 255096 EST cluster (not in UniGene) 4 8 

323332 AI829520 Hs.227513 ESTs 20.2 

323333 AA228883 Hs 208558 EST cluster (not in UniGene) 8 8 
323570 AL038623 Hs 208752 ESTs; Weakly similar to PI! ALU SUBFA 5 
323604 AI751438 Hs 41271 ESTs; Weakly similar to ALU SUBFA 6 5 

45 323685 AA344205 Hs 289088 EST cluster (not 'in UniGene) 71 

323753 AA327102 Hs.70266 EST cluster (not in UniGene) 61 

323817 AA410943 EST cluster (not in UniGene) 16.8 

323845 AI684674 Hs.41127 ESTs; Weakly similar to waclaw [D.rneJan 10.1 

323930 AA570698 Hs 8173 ESTs 6.4 

5 0 323997 AAB44907 Hs ,274454 EST cluster (not in UniGene) 8 

324047 AA378201 Hs.271340 EST cluster (not in UniGene) 6.3 

324261 AL044891 Hs.269350 EST cluster (not in UniGene) 50.1 

324302 AA543008 Hs.292471 ESTs; Weakly similar to ! M.I ALU SUBFA 5 7 

324338 AL138357 Hs.145078 ESTs 9 5 

5 5 324344 A W502000 Hs. 46677 EST duster (not in UniGene) 44 

324432 AA464510 Hs.152812 EST cluster (not in Un;Gene) 16.7 

324495 AW501411 Hs.122489 ESTs; Weakly similar to I'll ALU CLASS 5.5 

324497 AW152524 Hs.136340 ESTs 5 4 

324598 AA502659 Hs.163986 ESTs 6 8 

60 324603 AW016378 Hs.292934 ESTs 23.1 

324620 AA448021 Hs.94109 EST cluster (not in UniGene) 21.2 

324727 AI610425 Hs.19597 ESTs 5 

324774 AI031771 Hs 132586 ESTs 5 

324783 AA640770 Hs , 200994 EST cluster (not in UniGene) 4 1 

6 5 324824 AI826999 Hs.224624 ESTs 6.3 

324826 AA704806 Hs.143842 ESTs 11.7 

324902 D31323 Hs.271492 ESTs 48 
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Hs.17385 


ESTs 


4.7 


331222 


T98531 


Hs.173904 


ESTs 


4.1 


331230 


W69807 


Hs.16537 


hypothetical protein; similar to (U06944) 


4.9 


331306 


AA252079 


Hs.63931 


dachshund (Drosophila) homotog 


15.1 


331327 


AA281076 


Hs.109221 


ESTs 


4 8 


331337 


AA287662 


Hs.50495 


ESTs 


7.6 


331341 


AA303125 


Hs.23240 


ESTs; Weakly similar to !!!! ALU SUBFA 


13 


331344 


AA357927 


Hs.126550 


ESTs 


12.4 


331362 


AA417956 


Hs.40782 


ESTs 


6.5 


331363 


AA421562 


Hs.91011 


anterior gradient 2 (Xenepus laevis) homo 


28.2 


331376 


AA443802 


Hs,41007 


ESTs; Weakly similar to cDNA EST yk47 


15.1 


331384 


AA4 56001 


Hs.93847 


ESTs 


7.9 
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331478 


N26608 


Hs.40639 


ESTs 


7 


331526 


N49967 


Hs.46624 


ESTs 


1Q ft 
13,0 


331533 


N51517 


Hs.47282 


ESTs 


Q.J 


331661 


W85712 


Hs. 119571 


cotlagen; type III; alpha 1 (Ehters-Danlos 


13 ft 


331686 


W88502 


Hs. 182258 


ESTs 


Q Q 


331750 


AA284372 


Hs. 111471 


ESTs 


c; c 

J.O 


331751 


AA284840 


Hs.143818 


ESTs 


>; ft 

3.0 


331760 


AA292721 


Hs. 154434 


ESTs; Weakly similar to unknown [H.sap 


7 A 


331763 


AA312861 


Hs. 96704 


ESTs 


( .0 


331825 


AA411144 


Hs.292882 


ESTs 


152 


331890 


AA432166 


Hs.3577 


succinate dehydrogenase complex; subuni 




331952 


AA454756 


Hs.97837 


ESTs 


5 


332015 


AA487910 


Hs.2O880O 


ESTs; Weakly similar to !!!! ALU CLASS 


10.5 


332043 


AA490831 


Hs,125056 


ESTs 


H A 


332060 


AA504779 


Hs.191402 


ESTs 


13.6 


332071 


AA598594 


Hs. 205293 


ESTs 


9.1 


332093 


AA608794 


Hs. 112592 


ESTs 


8.8 


332139 AA620669 


Hs.1 12879 


EST 


9 


332219 


N22508 


Hs.139315 


ESTs 


7.1 


332225 


N33213 


Hs.100425 


ESTs 


12.2 


332246 


N57927 


Hs.1 20777 


ESTs; Weakly similar to RNA POLYME 


15.6 


332247 


N58172 


Hs.109370 


ESTs 


16.9 


332260 


N70088 


Hs.138467 


ESTs 


4 


332269 


M91279 


Hs.109654 


ESTs; Moderately similar to outer membr 


8.2 


332336 


T96130 


Hs.137551 


ESTs 


7.7 


332340 


W 15495 


Hs.1 29781 


chromosome 21 open reading frame 5 


14.1 


332347 


W60326 


Hs.288584 


ESTs 


4.4 


332362 


W93640 


Hs.4779 


ESTs; Moderately similar to similar to AD 


16.9 


332467 


AA489630 


Hs.1 19004 


K1AA0665 gene product 


4.8 


332499 


M12036 


Hs.323910 


Human tyrosine kinase-type receptor (HE 


10.4 


332513 


AA018182 


Hs.154424 


detodinase; lodothyronine; type II 


5.8 


332526 


AA281753 


Hs.77515 


inositol 1;4;5-triphospriate receptor; type 


19 


332532 


N63192 


Hs.1892 


EST; Highly similar to PHENYLETHAN 


15.3 


332565 


AA234896 


Hs. 25272 


E1A binding protein p300 


12.3 


332607 


R41791 


Hs .36566 


LIM domain kinase 1 


11.1 


332640 


AA417152 


Hs.5101 


protein regulator of cytokinesis 1 


18.2 


332694 


AA262768 


Hs.243901 


KIAA1067 protein 


15.2 


332702 


H93968 


Hs.75725 


transgelin 2 


4.7 


332705 


T59161 


Hs.76293 


thymosin; beta 10 


5.5 


332749 


AA479968 


Hs.88251 


arylsuifatase A 


9.8 


332927 






CH22 FGENES.38J 


17.7 


332929 






CH22 FGENES 38_3 


4.7 


332930 






CH22.FGENES.38_4 


7.4 


332955 






CH22 FGENES.48J2 


54 


332958 






CH22 FGENES.48J5 


17.8 


332961 






CH22_FGENES.48JB 


10.6 


332983 






CH22 FGENES.54_5 


4.3 


333009 






CH22 FGENES.61J 


52 


333010 






CH22_FGENES.61_2 


8.1 


333013 






CH22 FGENES 61_5 


85 


333108 






CH22 FGENES 79J4 


5 6 


333139 






CH22 FGENES 83J6 


63 


333254 






CH22_FGENES 118,2 


68 


333305 






CH22_FGENES 137_2 


11.4 


333343 






CH22 FGENES 139 J 2 


5.1 


333388 






CH22 FGENES 144,3 


12.7 


333456 






CH22 FGENES 157,5 


4.2 


333459 






CH22_FGENES 157,8 


7.6 


333517 






CH22 FGENES 173,2 


8.2 


333585 






CH22 FGENES.203,4 


5 


333679 






CH22_FGENES 247,6 


4.3 


333743 






CH22_F GENES 264,1 


13.4 


333758 






CH22_FGENES.268_1 


4 


333767 






CH22 FGENES 271_6 


5.6 


333768 






CH22„FGENES 271^7 


12.2 


333769 






CH22 FGENES 271 8 


48.3 


333795 






CH22 FGENES 275 1 


6.1 
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333796 


CH22 FGENES.275J 


6.8 


333892 


CH22_FGENES.292J4 


4.4 


333904 


CH22 FGENES.294_2 


6.5 


333905 


CH22„FGENES.294J 


9.3 


333921 


CH22_FGENES.296J2 


9.6 


333968 


CH22_FGENES.307_4 


15.9 


334102 


CH22_FGENES.327_60 


7.1 


334222 


CH22_FGENES.360_3 


6.7 


334223 


CH22 FGENES.360_4 


33.5 


334264 


CH22 FGENES.367J5 


185 


334343 


CH22 FGENES.375_25 


6.1 


334360 


CH22 FGENES.378_5 


6.1 


334784 


CH22_FGENES.432J 


4.8 


334789 


CH22 FGENES.432 14 


5.1 


334794 


CH22_FGENES.434_2 


7 


334889 


CH22J=GENES.452_3 


12.4 


335004 


CH22_FGENES.472_8 


7.9 


335115 


CH22 FGENES.496_2 


18.8 


335287 


CHZLFGENES.526J1 


4.5 


335342 


CH22_FGENES.536_1 


5.3 


335491 


CH22 FGENES.570_23 


24 


335495 


CH22 FGENES.570_28 


7 


335498 


CH2£_FGENES.571_7 


12.2 


335544 


CH22 FGENES.576_5 


8.4 


335610 


CH22„FGENES.583J 


129 


335653 


CH2^FGENES.590_4 


6.7 


335682 


CH22_FGENES.595_2 


121 


335687 


CH22_FGENES.596 - 2 


139 


335755 


CH22_FGENES.604J 


11.5 


335782 


CH22_FGENES.609_4 


17.9 


335791 


CH22_FGENES.611_7 


27.3 


335809 


CH22_FGENES.617 6 


192 


335822 


CH22„FGENES.619_7 


19.1 


335823 


CH22_FGENES.619_8 


4.5 


335824 


CH2£JGENES.619_11 


40.2 


335825 


CH22 FGENES.619J2 


343 


335895 


CH22_FGENES.635_3 


102 


335917 


CH22_FGENES.636J3 


6 


335920 


CH22_FGENES.636J6 


8.8 


336035 


CH22 FGENES.678.6 


5.9 


336042 


CH22_FGENES.679_4 


5.8 


336093 


CH2£_FGENES.691„2 


116 


336096 


CH22_FGENES.691_5 


7.6 


336150 


CH22 FGENES.706J5 


6.3 


336152 


CH22 FGENES.706J 


10 5 


336416 


CH22„FGENES.823_38 


5 


336444 


CH22_FGENES.827_10 


4.8 


336449 


CH22 FGENES.829J 


13.6 


336471 


CH22_FGENES.829_30 


6.9 


336512 


CH22_FGENES.834_7 


214 


336558 


CH22.FGENES.842J 


8.2 


336560 


CH22 FGENES.842J 


9 


336676 


CH22 FGENES.43-4 


9.4 


336959 


CH22_FGENES.367-13 


19 


337968 


CH22 EM:AC005500.GENSCAN.103-2 


134 


338008 


CH22_EM:AC005500.GENSCAN.127-9 


152 


338057 


CH22 EM:AC005500.GENSCAN.160-1 


139 


338410 


CH22 EM:AC005500.GENSCAN.34U 


8 


338451 


CH22 EM:AC005500.GENSCAN.359-3 


116 


338588 


CH22_EMAC005500.GENSCAN.432-1 


10.3 


338665 


CH22 EMAC005500.GENSCAN.464-2 


4.8 


338689 


CH22 EM:AC005500.GENSCAN.475-3 


6.7 


338832 


CH22 DJ246D7.GENSCAN.6-9 


4.8 


338980 


CH22J)A59H18.GENSCAN.24 


5.1 


339352 


CH22 BA354I12.GENSCAN.29-7 


6.9 


339373 


CH22_BA232E17.GENSCAN.1-29 


4.3 



244 



WO 02/059377 



PCT/US02/02242 



TABLE 13A 



Table 13 A shows the accession numbers for those pkeys lacking unigenelD's for Table 13. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

123619 371681 J AA602964 AA609200 

103207 30635.-4 X72790 

103349 11052.-2 X89059 

110856 19346J4 AA992380 N33063 NI21418 H79958 R21911 H79957 

113243 328626J T63857 AW971220 AA493469 T63699 

123169 44573_2 A1950087 N70208 R9704O N36809 AI308119 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 



AA908598 AA251875 AI820501 AI820532 W87891 T85904 U 71456 T82391 BE328571 T75102 R34725 AA884922 BE323517 
Ai219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 AA488964 
AA283144 AI890387 AI950344 AI741346 AI689062 AA282915 AW102898 AI872193 AI763273 AW1 73586 AW150329 AI653832 
AI762688 AA988777 AA488892 AI356394 AW103813 AI539642 AA642789 AA856975 AW505512 A 1961 530 AW629970 
BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA1800O9 AA337499 AW961 101 AA251669 AA251874 
AI819225 AW205862 AI683338 AI858509 AW276905 AI6330O6 AA972584 AA908741 AW072629 AW513996 AA293273 
AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 AW613456 AI373032 AI564269 
F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 F04005 



102919 25180.2 M21 191 AL035748 AA021266 AA323126 AA180515 AI613029 D28356 NM_000034 M1 1560 AW401425 AW248248 AA012956 



AA323294 W04965 H38759 AA206622 AA580747 AI541 172 AA381075 AA354229 AW402353 AW405575 AW404021 AW406207 
AA075752 AW176066 AA287222 AA195818 T20243 R87945 AA295539 AA402533 AA232419 AA224515 AW401583 AA331367 
AW402140 AW249079 R31488 AA075757 X05236 AW239490 AA338036 AW239495 AA357262 AA431005 AA306726 R33804 
AA216544 AW275288 AA227044 AL038124 AA243300 C03242 AA315615 AL035840 R64336 AA313917AA018963 AA001385 
AA054395 H30840 AW498825 AA086141 AI557324 AA121576 H39128 R77161 AA019688 AA380987 AA348140 AA348257 
AW175086 AA362432 AA171389 AA362416 AA299938 AA319093 AA337972 C04921 AA345696 R89640AA085425 AA481708 
AA313637 AL039229 H84490 H86153 FO0656 AA326668 AA347304 R65890 H41949 AA339309 AW402002 AW404854 
AA192582 AA112802 H09248 N83165 H38367 AA356339 AA455763 R66853 AA294935 H85911 AA310414 H93436 N87014 
AA001186 H83640 AA41 1328 AA317929 C04192 AW406288 U46335 AA323179 AA427649 AI366131 H14328 AA197161 
AA379497 AA31 1 816 AA017206 AA001 1 37 AA017420 AA012990 AW163775 AA021397 AA29551 3 AA355248 AA374921 
AW380419 AA345864 AA318058 AA37171 1 AA363255 AA057094 R88057 AA394045 AW362741 AA479579 AW362789 
AW362775 AA223624 AW362809 AW362746 AW362753 AW380412 AA057104 AA295271 X06352 AW362771 X12447 R59553 
R87925 AA292864 R83920 F01 120 AA496740 AA852551 AL039209 AA093339 AA333961 N86416 C03325 003631 AA194750 
AA359270 AA359460 AW245492 AA339400 AA3 30784 A 1 908970 N89223 AW361938 AI940063 AA 1 34323 AA 37 1741 F01267 
AW372970 AA341973 AA346098 AW372969 AA337549 AA327342 H93855 AKJ74079 F2911 8 AA852940 F35696 AA345963 
AA079578 AA 11 37 85 C02989 AA095945 C03145 COS 199 AA346024 AA 190506 AW361659 M909845 AW374374 AW374382 
AW374401 AW374373 AW374370 AI909831 AW374367 AA353658 F01041 C02843 AA375948 AW374414 AA213946 AI525039 
H13744 R31007 AA1 12044 AA134404 H47935 AW177018 AA429768 AA335873 AA 112875 H46393 AA191267 D59131 
AW406037AA055244AA341880AA179024AA308537AW406985AW327311 H30301 AA3O0705 H43788 AW364149 
AA806213 AA481936 C04941 AW375299 M21190 AA410818 AA25O940 AI354547 AA317422 AA250903 AI865497 AA890603 
AA366197 AW498538 M78072 AW40646 1 C03092 FOO308 H56488 AA336320 AW406501 AA354102 AA382942 AA096393 
AW376830 AA383446 F01259 AW081388 T94036 AA379643 H43842 AI524063 AA621727 AA379099 AA371417 R66811 
H44129 N84794 F01 135 AA477852 AA293062 AW361595 H27194 AI831650 H43253 H24797 AI564680 AA380090 W20057 
AI921586 AI192549 AWO908O8 H25967 AA918121 AI626060 H20221 AA81 2572 H42178 AA887222 H960C0 C03180 F00946 



123533 genbank JW608751 
116480 genbank_C14088 
132225 genbank_AA128980 
125154 genbank JV3841 9 
118475 genbank.N66845 



AA608751 
C 14088 
AA1 28980 
W38419 
N66845 
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C03986 AI313091 A 18601 72 AA582179 AI633388 AA557193 R68Q75 F24105 AW51 8239 W56 622 A1625219 AI925243 
AW458046 AI921828 AA339164 AI144391 AA643334 AA459631 AA873247 AW373432 AA604384 H27600 AI680458 AA1 59956 
AA610836 AA364298 AW373435 AA604225 W73754 AW087924 AA599776 N89227 AI630871 AI633128 AW514329 AA010455 
AA563928 AI571596 AI128394 W73707 A W23575 AA583809 AA657988 AI950837 AA169782 AA600009 AI885540 AA771 884 
AI978829 AA505408 AA533937 AA481469 AA610869 AA775241 AW273870 AWO709O9 AI905695 AA4801 15 AA574051 
AI889185 AA773167 AA331375 AA001437 AA194324 AA194300 AA558632 AI038538 AA41 1329 AA781570 AI833176 
AA935520 AW074197 AA583063 AW073099 AW001 198 AA948025 AA587857 AA191540 AI460085 AA193244 AI538037 
AA51 5572 AA758587 AI14931 1 AA508610 AA206409 AA5340O4 AA994600 AA827543 AI916349 AW245129 AW517804 D25663 
AA781985 AA284536 AI819422 H16040 H27531 AA456564 AA845555 AI423596 AA012908 AA889439 AA71631 1 AA968868 
AA320508 AA725731 AA834202 AA935997 AA72481 5 AA769353 AA594803 F00827 AI342442 AI003519 AKXJ2503 AI347597 
AI040946 AA197162 AA987883 AA292865 AW001944 AI64071 1 AW244044 AA456784 F30588 AA290829 H24754 AI978683 
AA483686 M583939 AA121382 AA833831 AA477102 AA977322 AA666379 F35456 AA993537 AI749610 AA226934 AA716204 
AW513025 AA628543 AA583705 F25702 AI368748 AI124097 AI880086 AA47751 3 AI758834 A1690753 AA477746 F37761 
AA642243 AA159957 AA250844 AA459406 AA427566 F25054 AI569314 AA961665 AI922050 AI759000 AA555236 M514432 
AA293474 AA001129 AA826789 AA64 1390 AA 134405 F35585 AA477416 AW193359 AI361315 AA284988 F36340 AI361322 
F26969 AA991922 AA021267 F26973 AI361314 F35891 AI918509 AA250964 AA190992 AA577139 AA865535 AA134324 
AW192842 AI224046 F18975 AA779626 AA856894 AW269997 AW014614 H95554 F31378 AA374868 F26343 AA654007 
AI830942 AA113195 F26432 W56652 AA464690 AA055263 AA340654 AA031448 AA976399 AA972526 AA063476 R83921 
T16240 AA533290 N91545 H44053 AA883451 AA513761 AA086477 H09249 F20482 F26737 AA054148AA857063 AA017259 
AA179789 AA088908 H43704 AA194320 F35950 AI880127 F20441 F32878 AA962483 H39094 H 56489 H44621 F19390 
AI302232 F25162 AA826965 AA086052 AA917410 AA454513 R59554 AA196755 AA086369 M079530H28106 AA243301 
AI025737 AA101239 AA088887 AI214910 AA974886 F16089 F26054 AA515092 F33436 F32829 M78061 AA235645 F19715 
F37529 AI811549 AA665180 AA708200 F01124 F32382 AA346220 AA627361 F30741 F30010 F28543 AA21 1715 F20245 
AA331222 F25634 F21996 W28215 F21911 R65793 AI192566 H20130 H84491 AA719223 AA557435 F16967 F26989 F30353 
AA857159 AA291918 F28234 F20840 F25176 F22437 T27904 AA480355 F19528 R87926 H14286 F27532 AW337864 F2841 1 
H13692 F25651 AA975454 F24229 F29657 F18024 AA464779 F17588 F34954 AA947328 F18063 AA657777 AA459644 N91455 
F27850 F29608 F27206 F18418 F31459 F18564 F33496 F16376 F29740 F29843 F29904 F29866 F19135 AW276602 T40337 
F24835 F34672 F26474 A1926215 AA464185 F18217 N26193 AL043256 T41 197 F33055 F00386 F29500 F34191 F33297 
AI937207 F22724 F15909 F26232 F18889 AA318627 F29085 AA872104 F17509 F23373 F15660 F17552 F17412 F16863 
F34033 F21515 F17364 F18383 F16546 F17561 F17260 AA292000 F15723 T47438 F16798 F18046 F18319 F17978 F17566 
F34230 F33258 F20860 F17998 AI695701 

118600 genbanK_N69222 N69222 

118952 genbank_N92966 N92966 

120873 genbank_AA358015 AA358015 

113702 genbank_T97307 T97307 

129982 221.267 Z14221 AW381 862 M97920 AW401 444 Z66542 M29470 AW406502 X6 1011 M34024 AA327072 Z14166Z14167 Z14165 

AW403806 214200 AA383972 Z14205 Z14201 M1 851 3 Z14202 AW403684 X14584 AF062221 U43760 X65892 X65883 X62107 
Z80847 X65885 X65893 AF062142 X65891 X17675 Z47274 Z47277 Z47276 X65888 Z47275 X62109 AF062140 L01278 
AF062134 AF062139 X81723 Z80840 X81733 X81743 X81744 X81732 Z80843 AW402942 AW403516 X65919 AF062190 
AF062177 AF062222 AF062115 Z47240 AF062263 AF062261 AF062223 AF062211 Z47238 AW401714 AW404008 AW404991 
AF062280 M99595 Z47214 Z47232 Z47218 M26995 AF062184 X65895 L38433 X81731 Z11946 Z47226 AF062205 AF174012 
L01276 AF062168 AF062136 X81755 X81748 AF174019 AF062285 L06924 L11699 AW402665 LO9070 L28049 L08O90 
AW407843 Z14171 AW402944 L08083 AW405627 L33035 L26907 M17750 L28052 M17751 AJ239360 U19885 L14821 X56526 
AB014341 L12087 L12098 U68231 L12184 AF062242 AF022000 U64499 U00570 AI268604 Y15773 X64239 X62969 U00506 
X73605 X99360 U00577 D83677 AB021539 AF035796 Z33899 U00588 AJ239353 AF174062 Z33901 X98899 AF174058 X63080 
D83676 AB021529 Z18318 U00488 L01412 X81746 U21262 U21272 U00560 AF174060 U00547 U00561 Z18321 S73957 
M26435 AF115130 AF115117 Z92896 U21254 292895 AF1151 12 AF062290 T28938 Y09386 AF174067 U27189 Y09384 
U77373 AF174057 M17749 X69692 AF174038 U64478 U64486 AF174098 X99365 AF174077 AF174083 AF174089 AF015133 
AF127792 AJO06171 U00510U58144 AF004323AF1 15109 AF004324 AF015130 AF090414 AF09O418 AB021536 AB021519 
Z96957 AF021986Z92898 Z96956 L34164 AF062251 AF052524 U00549 U00541 D83687 AF052522 AF067420 AF052520 
AF02201 1 L43085 U00536 L19915 L43083 AF090420 AB021530 AB021534 Z96955 Z96954 Z33903 X62964 AF103282 U00509 
AF062298 AF062289 AF062307 AW408326 L33037 L04337 L04328 X81742 L04336 L03385 AF032360 S56184 AF062191 
AF191092 AI906954 AF103184 L04343 AW364B60 AF001424 AF103163 Z98717 AF103143 AJO08250 L04323 AF103321 
AF103335 L04342 L03818 L03817 AW404978 AW403690 U86801 AF103150 AJ010334 AF035027 AJ007327 AF1031 15 
AF017458 AJ008207 AJ008183 AJ008196 AJ008241 AJ008208 AF103210 AF068668 AF068670 AF068671 AF068664 
AF068669 AF068666 AF068665 AF068672 AW371244 AW403670 AW408074 AW404575 AW362153 AW403803 AW406702 
AW351514 D78345 T29140 J00231 NMJJ02179 AW405146 AA301091 X04646 H64660 AW402990 AW406534 T93007 
AI857980 AW368899 AI905833 AW406586 AA482084 AI872299 AA7 15266 AW404328 AI831674 AI709348 AA603112 
AW514864 AA485775 H64492 AW404789 AA487630 AA71 5498 AA295885 T2761 3 T981 1 3 

115763 genbank_AA421560 AA421560 

124357 genbank_N22401 N22401 

108733 504187.1 AA1 21 022 AA1 26422 

101544 entrez_M31169 M31169 

124447 genbank„N48000 N480C0 

124677 genbank_R01073 R01O73 

124777 genbank_R4l933 R41933 

119302 genbank_T25725 T25725 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



103680 

135029 

112253 

Pkey 

322175 

323011 

322975 

317210 

323817 

309583 

324961 

303502 

320133 

311935 

321050 

319977 

312772 

321354 

336512 

336558 

336560 

329367 

329373 

336676 

338008 

338057 

329655 

336959 

329899 

329960 

338410 

338451 

338588 

338665 

338689 

308677 

338832 

338980 

333009 

333010 

333013 

308981 

308995 

333108 

333139 

333254 

333305 

333343 

333388 

326213 

333456 

333459 

333517 

333585 

333679 

326474 

333743 

333758 

333767 

333768 

333769 

333795 

333796 

335004 

333892 

335115 

333904 



entrez_Z93784Z93784 

H588l8_at H58818 

gefibank_R51818 R51818 

CAT number Accession 

4 6877J AF085975 H53458 H53459 

139750J AA580288AA315655AA1 33031 AA377748 

1510563 J C16391 C16413 

21 1994J AW881145 AA490718 M85637 AA304575 T06067 M331991 
233566.1 AA410943AW948953 AA334202 AA332882 
1046029,-2 AW1 70035 

376239J AA613792 AW182329 T053O4 AW858385 

325188J BE174240 AA488528 AL042253 

447553J BE151746 BE336853 D63271 T94955 AA774994 

174129J AA216387T63548AA228676 

502195.1 C05928AW393497 

345248.1 AA534222 AA632632 T81234 

4380.7 AW962299 AA310349 AVV962294 H63791 H63751 

1 16028.-2 AA078493 

CH22_3941FG_834_7_LINK_DJ 

CH22J992FG.842_3.LINK_DJ 

CH22_3994FG.842_5_LINK.DJ 

c_x_hs 

c_x_hs 

CH22_4154FG_43_4_ 
CH22_6490FG_LINK_EM.AC00 
CH22_6558FG_LlNK_EMACO0 
c14_p2 

CH22J764FG_367_13_ 

c15_p2 

c16_p2 

CH22_7067FG_LINK_EM AC00 
CH22_7124FG_LINK_EMAC0O 
CH22_7331FG_UNK_EMAC00 
CH22_7438FG_LINK_EMAC00 
CH22_7464FG_UNK,EMAC00 
AI761173 

CH22_7678FG_LINK_DJ246D7 

CH22_7859FG_JJNK_DA59H18 

CH22_233FGJ1J_LINK_EMA 

CH22J34FG_61_2JJNK_EMA 

CH22_237FG_61_5_LINK_EMA 

AI873242 

AI880172 

CH22J36FG_79J4_UNK_EM: 
CH22_368FG_83_16_LINK_EM: 
CH22„495FG_1 18_2_LINK_EM: 
CH22 550FG_137_2JJNK EM: 
CH22_589FGJ39J2_UNK_EM 
CH22J34FG_144_3JJNK_ENt 
c17_hs 

CH22_706FG_157_5„LINK_EM: 
CH22_709FG_157_8_LINK_ENt. 
CH22_773FGJ73_2_LINKJM: 
CH22_846FG_203_4_LINK_EM-. 
CH22J41 FG_247_6_LINK_EM: 
c19_hs 

CH22J0O9FGJ64JJJNKEM 
CH22_1024FG_268_1J_INK_EM 
CH22_1034FG_271 5 LINK EM 
CH22_1035FG_271_7_LINK_EM 
CH22_1036FG_271_8_LINK_EM 
CH22J063FG_275_t_LINK_EM 
CH22J065FGJ75 3 LINK^EM 
CH22_2326FG_472.8_LINK.EM 
CH22J163FG_292J4JJNK_E 
CH22J447FG_496_2_UNK_EM 
CH22_1176FG_294_2_UNK EM 
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333905 CH22_1177FG_294_3_UNK_EM 
333921 CH22_1194FG_296J2_LINK_E 
333968 CH22_1245FG_307JJJNK_EM 
328059 c_6_hs 
5 335287 CH22_2629FG_526_11_UNK_E 

326816 c20_hs 

326817 c20_hs 

335342 CH22 J689FG_536J_LINK_EM 
335491 CH22 2843FG_570_23_LINK_E 
10 335495 CH22J847FG_570_28_UNK_E 
335498 CH22_2850FG_571_7_LINK^EM 
328304 cjjis 
305453 AA738110 

335544 CH22J899FG_576_5J.INK_EM 
15 335610 CH22_2969FG_583_4_LINK_EM 
335653 CH22_3013FG_590JU_INK_EM 
335682 CH22_3043FG_595_2_UNK^EM 
335687 CH22_3048FG_596_2_LINK_EM 
328492 c_7_hs 
20 335755 CH22J122FG_604_4_LINK_EM 
335782 CH22_3151FG_609_4JJNK_EM 
335791 CH22_3160FG_61t_7_UNK_EM 
335809 CH22_3181FG_617_6_LINK_EM 
335822 CH22_3195FG_619_7_LINK_EM 
25 335823 CH22_3196FG_619J„LINK_EM 

335824 CH22J197FG_619J1_LINK_E 

335825 CH22_3198FG_619J2_UNK_E 
335895 CH22J272FG_635_3_LINK_EM 
335917 CH22_3294FG_636J3_LINK_E 

30 335920 CH22_3297FG_636_16_LINK_E 

305898 AA872838 

305913 AA876109 

305950 AA884479 

328857 c 7Jis 
35 330084 c19j)2 

337968 CH22_6419FG_LINK_EM:AC00 

309177 AI951118 

309198 AI955915 

309226 AI969897 
40 339352 CH22_8317FG_LINK_BA354I1 

309279 A1990102 

339373 CH22_8348FG___LINK_BA232E1 
325622 c14_hs 

334102 CH22 1380FG_327_6O_LINK_E 
45 332927 CH22L148FGJ8_1_LINK_C20H 

332929 CH22J50FG„38_3J_INK_C20H 

332930 CH22_151FGJ8J_LINK„C20H 
332955 CH22J79FG_48_12_L!NK_EM: 
332958 CH22_182FG_48_15_L!NK_EM: 

50 332961 CH22J85FG_48_18JJNK_EM; 
332983 CH22_207FG_54_5J_INK_EM:A 

334222 CH22_1506FG_360_3_L1NK_EM 

334223 CH22_1507FG_360_4_L1NK_EM 
334264 CH22_1551FG_367„15_LINK_E 

55 327110 c21_hs 

334343 CH22 1636FG_375_25 L1NK_E 

334360 CH22_1654FG_378„5„LfNK^EM 

327196 c_1_hs 

327283 c_1_hs 
60 327313 c_1_hs 

304465 AA421948 

304507 AA456426 

327450 c_2Js 

304591 AA505702 
65 304601 AA507875 

304659 AA533185 

334784 CH22_2096FG_432_9_LINHLEM 
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334789 CH22J101FG_432_14J_INK_E 

334794 CH22J106FG_434_2_LINK_EM 

336035 CH22_3420FG_678_6_LINKDJ 

336042 CH22_3427FG_679_4_LINK„DJ 

336093 CH22_3481FG_691_2_UNK_DJ 

336096 CH22_3484FG_691_5JJNK_DJ 

334889 CH22„2206FGJ52J3JJNK_EM 

336150 CH22_3540FGJ06_6JJNK_DA 

336152 CH22_3543FG_705_9_UNK_DA 

336416 CH22_3833FGJ23_38_LINK_B 

336444 CH22_3864FG_827J0_LINK_D 

336449 CH22_3870FGJ29_6J_INK_DJ 

336471 CH22_3894FG_829_3QJJNK_D 
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TABLE 13B 

Table 13B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 13B. For each predicted ex on, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham !. et al." refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489^195. 
Strand: Indicates DNA strand from which exons were predicted, 

reposition: indicates nucleotide positions of predicted exons. 



Pkey 


Ref 




Strand 


Nt_posrtion 


332955 


Dunham, 


. et.al. 


Plus 


2508896-2508992 


332958 


Dunham, 


. etal. 


Plus 


2516164-2516310 


332961 


Dunham, 


. et al 


Plus 


2521424-2521555 


333139 


Dunham, 


. et al. 


Plus 


3369495-3369571 


333254 


Dunham, 


. etal. 


Plus 


2521424-2521555 


333305 


Dunham, 


.etal. 


Plus 


4630388-4 630645 


333388 


Dunham, 


. etal. 


Plus 


4913749-4913805 


333517 


Dunham, 


.etal. 


Plus 


5570729-5570925 


333585 


Dunham, 


. et at. 


Plus 


6234778-6234894 


333679 


Dunham, 


. et al 


Plus 


7068795-7068896 


333767 


Dunham, 


. et al 


Plus 


7694407-7694623 


TV} "TOO 

33o7oo 


Dunham, 


. etal 


Plus 


7695440-7695697 


333769 


Dunham, 


.etal 


Pius 


7696625-7696707 


333795 


Dunham, 


, etal 


Plus 


7807688-7807795 


333796 


Dunham, 


. etal 


Plus 


7808253-7808319 


333892 


Dunham, 


. etal 


Plus 


8156825-8157001 


333921 


Dunham, 


.etal 


Pius 


8380325-8380441 


333968 


Dunham, 


. etal 


Pius 


86810043681241 


334102 


Dunham, 


. etal 


Pius 


9995140-9996373 


334264 


Dunham, 


. etal 


Pius 


13234447-13234544 


334343 


Dunham, 


. etal 


Plus 


13655828-13656307 


334794 


Dunham, 


. etal 


Pius 


16374312-16374458 


334889 


Dunham, 


. etal 


Plus 


19286024-19286515 


335287 


Dunham, 


. etaL 


Pius 


22299047-22299299 


335491 


Dunham, 


.etal 


Plus 


24128651-24128827 


335495 


Dunham, 


. etal 


Plus 


24140688-24140872 


335498 


Dunham, 


.etal 


Plus 


24172082-24172161 


335653 


Dunham, 


. etal 


Plus 


25329710-25329802 


.335687 


Dunham, 


. etal 


Plus 


25445952-25446064 


335809 


Dunham, 


.et.al 


Plus 


26310772-26310909 


335822 


Dunham, 


.etal 


Plus 


26364087-26364196 


335823 


Dunham, 


. et.al 


Plus 


26365925-26366004 


335824 


Dunham, 


.et.al. 


Plus 


26376860-25376942 


335825 


Dunham, 


. et.al 


Plus 


26378175-26378268 


336035 


Dunham, 


.etal. 


Plus 


29016748-29017410 


336093 


Dunham, 


. et.al 


Plus 


29556922-29557002 


336096 


Dunham, 


. etal. 


Plus 


29578878-29579047 


336444 


Dunham, 


. etal 


Plus 


34190585-34190718 


336959 


Dunham, 


. et.al 


Plus 


13233040-13233126 


338008 


Dunham, 


. et.al 


Plus 


7697068-7697236 


338057 


Dunham, 


. etal. 


Plus 


8526397-8526522 


338410 


Dunham, 


. et.al 


Plus 


19292807-19292916 


338588 


Dunham, 


. etal 


Plus 


22896767-22896920 


338665 


Dunham, 


.et.al 


Plus 


24472654-24472853 


338832 


Dunham, 


.et.al. 


Plus 


27775128-27775290 


338980 


Dunham, 


.etal. 


Plus 


29896789-29896874 


339352 


Dunham, 


. et.al. 


Plus 


33544784-33545121 
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332929 Dunham, I. etal. 

332930 Dunham, I. etal. 
3329a3 Dunham, I. etal. 
333009 Dunham, f. etal. 

5 333010 Dunham, I. etal, 
333013 Dunham, I. etai. 
333108 Dunham, I. etal 
333343 Dunham, I. etal 
333456 Durham, I. etal. 
10 333459 Dunham, I. etal. 
333743 Dunham, I. etai. 
333758 Dunham, I. etal. 

333904 Dunham, 1. etal. 

333905 Dunham, I. etal. 
15 334222 Dunham, I. etal. 

334223 Dunham, i. etal. 

334360 Dunham, I. etal. 

334784 Dunham, I. etal. 

334789 Dunham, I. etal. 
20 335004 Dunham, I. etal. 

335115 Dunham, I. etal. 

335342 Dunham, i. etal. 

335544 Dunham, I. etal. 

335610 Dunham, I. etal. 
25 335682 Dunham, I. etal. 

335755 Dunham, I. etal. 

335782 Dunham, I. etal. 

335791 Dunham, I. etal. 

335895 Dunham, I. etal. 
30 335917 Dunham, I. etal. 

335920 Dunham, I. etal. 

336042 Dunham, I. etal. 

336150 Dunham, L etal. 

336152 Dunham, I. etal. 
35 336416 Dunham, I. etal. 

336449 Dunham, I. etal. 

336471 Dunham, I. etal. 

336512 Dunham, I. etal. 

336558 Dunham, I. etal. 
40 335560 Dunham, I. etal. 

336676 Dunham, I, etal. 

337968 Dunham, I. etal. 

338451 Dunham, I. etal. 

338689 Dunham, I. etal. 
45 339373 Dunham, I. eU 

325622 5867000 

329655 6448516 

329899 6563505 

329960 5091594 

5 0 326213 5867224 

326474 5867405 
330084 6015302 

326816 6552458 

326817 6552458 
55 327110 6117842 

327196 5867446 
327283 5867478 
327313 5867501 
327450 5867766 
60 328059 6117819 
328492 5868455 
328304 6004478 
328857 6381927 
329367 5868842 

6 5 329373 6682537 



Minus 2020758-2020664 

Minus 2022565-2022497 

Minus 2631933-2631797 

Minus 2766043-2765856 

Minus 2766207-2766119 

Minus 2772278-2772039 

Minus 3240494-3240389 

Minus 46928864692753 

Mmus 2631933-2631797 

Minus 5144548-5144344 

Minus 75732187573060 

Minus 7666413-7666091 

Minus 8217374-6217261 

Minus 8217796-6217670 

Minus 12732417-12732289 

Minus 12734365-12734269 

Minus 13728850-13728751 

Minus 16294548-16294360 

Minus 16306095-16305996 

Minus 20581911-20581794 

Minus 21388250-21388146 

Minus 22597448-22597284 

Minus 24650505-24650403 

Minus 25068943-25068841 

Minus 25421215-25421093 

Minus 25763806-25763747 

Minus 25908578-25908440 

Minus 25948563-25948411 

Minus 26975307-26975239 

Minus 27028481-27028377 

Minus 27034927-27034811 

Minus 29041694-29041500 

Minus 30150423-30150256 

Minus 30156053-30155870 

Minus 34047408-34047311 

Minus 34204707-34204577 

Minus 34215091-34214978 

Minus 34278373-34278275 

Minus 34375825-34375698 

Minus 34376814-34376596 

Minus 2022565-2022497 

Minus 7095797-7095680 

Minus 20174286-20174193 

Minus 24893073-24892972 

Minus 33860127-33860047 

Plus 69994-70075 

Minus 35565-35843 

Minus 111058-111783 

Minus 1031-1162 

Minus 60751-60927 
Plus 16995-18101 

Minus 57019-59337 
Plus 198354-198436 
Plus 199909-200001 
Plus 94608-94785 
Plus 180921-181333 
Minus 567-962 
Minus 89734-89838 
Minus 4792848076 
Plus 37052^7204 
Minus 46094-46241 
Minus 3884-3952 
Minus 80557-81051 
Minus 87201-87587 
Minus 38950-39301 
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TABLE 14: Table 2 from BRCA 001-5 US 



Table 14, a subset of table 13, depicts a preferred group of genes highly upregulated in breast 
5 cancer cells. 



Pkey: Unique Eos probeset identifier number 

ExAccn; Exemplar Accession number, Genbank accession number 

1 0 UnlgenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal breast tissue 



15 Pkey ExAccn UniGenelD Unigene Title R1 

100038 M97935 AFFX control; STAT1 16.7 

100114 D00596 Hs.82962 thymidy I ate synthetase 15.9 

100975 J02923 Hs,76506 lymphocyte cytosolic protein 1 (L-plaslin) 30.1 

20 101031 J05070 Hs. 151738 matrix metallopfoteinase 9 (gelatinase B; 9 37.2 

101104 L07615 Hs.169266 Human neuropeptide Y receptor Y1 (NPYY 18.3 

101143 L12723 Hs.90O93 heat shock 70kD protein 4 17.4 

101332 L47276 Hs.156346 Homo sapiens (cell line HL-6) alpha topois 18.9 

101378 M13755 Hs.833 interferon-stimulated protein; 15 kDa 18.1 

25 101809 M86849 Hs 323733 Homo sapiens connexin 26 (GJB2) mRNA 22.5 

102618 U65932 Hs 81071 extracellular matrix protein 1 23.2 

102721 U79241 Hs 118666 Human clone 23759 mRNA; partial cds 15 

102817 U90904 Hs 83724 Human clone 23773 mRNA sequence 15.2 

102907 X06985 Hs 202833 heme oxygenase (decycling) 1 22.7 

30 102985 X17644 Hs 2707 G1 to S phase transition 1 20.6 

103060 X57766 Hs 155324 matrix metalloproteinase 11 (stromelysin 3 17.8 

103180 X69433 Hs5337 isocitrate dehydrogenase 2 (NADP+); mrto 18.9 

103206 X72755 Hs 77367 monokine induced by gamma interferon 15.1 

103821 AA157623 Hs 198793 K I AA07 50 gene product 23.3 

35 104115 AA428090 Hs 26102 ESTs 28.7 

104667 AA007234 Hs. 30098 ESTs 16.6 

105186 AA191512 Hs 28005 Homo sapiens mRNA; cON A DKFZp564G 19.3 

106103 AA421104 Hs 12094 ESTs 15.4 

107151 AA621169 Hs.8587 ESTs 19 

40 109415 AA227219 Hs 110826 trinucleotide repeat containing 9 20.1 

110189 H20543 Hs.6278 DKFZP586B 1621 protein 16.6 

110561 H59617 Hs5199 ESTs; Weakly similar to UBIGUITIN-CON 19.5 

110734 H98714 Hs 24131 ESTs 30.2 

110915 N46252 Hs 29724 ESTs 23.2 

45 111179 N67239 Hs 10760 ESTs 37 

111357 N91023 Hs 87128 ESTs 15 

112134 R46025 Hs7413 ESTs 17.4 

113970 W86748 Hs8109 ESTs 15 

114124 Z38595 Hs 125019 ESTs, Highly similar to K1AA0886 protein 22 

50 114292 240715 Hs 184641 delta-6 fatty acid desaturase 19.4 

114901 AA236276 Hs 196437 ESTs. Weakly similar to R26660J; partial 16.9 

114965 AA250737 Hs 72472 ESTs 35.1 

115652 AA405098 Hs.38178 ESTs 16.1 

115875 AA433943 Hs 43946 ESTs, Weakly similar to Weak similarity t 33.5 

55 116790 H29532 Hs 101 174 mic rotubule -associated p rote in tau 22.2 

116921 H72948 Hs821 biglycan 20.7 

117412 N26722 Hs42645 ESTs 18.1 

120241 Z41815 Hs 65946 ESTs 15.6 

120325 AA195651 Hs 104106 ESTs 15.2 

60 121596 AA416740 Hs 174104 ESTs 22.6 

123619 AA609200 ESTs 23.1 

124006 D60302 Hs 270016 ESTs 20.6 

125852 H09290 Hs 76550 Homo sapiens mRNA; cDNA DKFZp564B 25.9 

126160 N90960 Hs.265398 ESTs; Weakly similar to transformation-re I 16.4 
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127677 AA916752 Hs.264190 
128595 U31875 Hs.152677 
128717 T30617 Hs, 104222 
129124 AA234530 Hs 108802 
5 129366 H18027 Hs 184697 
130455 X17059 Hs 155956 
130604 X03635 Hs 1657 
130913 W03592 Hs 21198 
130944 M97935 Hs 21486 
10 131472 AA608962 Hs 27258 
131562 U90551 Hs 28777 
132180 AA405569 Hs418 
132406 F09979 Hs4774 
132465 AA047896 Hs 49169 
15 132994 AA505133 Hs279905 
133294 R79723 Hs 69997 
133634 U24166 Hs.234279 
134374 D62633 Hs.8236 
134405 J04177 Hs 82772 
20 134470 X54942 Hs.83758 
134495 D63477 Hs.84087 
134714 U89922 Hs.890 
135237 AA454930 Hs.9691 
301884 AA312082 Hs.105445 
25 302276 NMJX)4448Hs.323910 
302290 AL1 17607 Hs.175563 
309177 AI951118 
309583 AW170O35 
310438 AW022192 Hs.200197 
30 311166 AI821294 Hs.118599 
312153 AA759250 Hs.153028 
313915 AI969390 Hs.163443 
314506 AA833655 Hs.206868 
314558 AI873274 Hs.190721 
3 5 314691 AW207206 Hs.136319 
314943 AI476797 Hs.184572 
315196 AA972756 Hs.44898 
316177 AI908272 Hs.293102 
318073 AW167087 Hs.131562 
40 318662 AI285898 Hs.294014 
318740 NM_002543Hs.77729 
318744 AI793124 Hs.144479 
319668 NM_002731Hs.87773 
320074 AA321166 Hs.278233 
45 320211 AL039402 Hs.125783 
320727 U96044 Hs.181125 
322818 AW043782 Hs.293616 
322882 AW248508 Hs.279727 
324261 AL044891 Hs.269350 
50 324432 AA464510 Hs.152812 
324603 AW016378 Hs.292934 
324620 AA448021 Hs.94109 
324988 T06997 Hs. 121 028 
330388 X03363 
55 330486 M13755 Hs.633 

330814 AA015730 Hs.265398 
331145 R72427 Hs.129873 
331306 AA252079 Hs.63931 
331890 AA432166 Hs.3577 
60 332526 AA281753 HsJ7515 
332532 N63192 Hs.1892 
332694 AA262768 Hs.243901 
332958 
333769 
65 333968 
334223 
334264 



ESTs; Highly similar to MEM3 [M.muscul 
short-chain alcohol dehydrogenase family m 
Homo sapiens mRNA; cDNA DKFZp566L 
N-ethyimaleinrude-sensitive factor 
plexin C1 

N-acetyttransferase 1 (arylamine N-acetytt 
estrogen receptor 1 

translocase of outer mitochondrial membra 

signal transducer and activator of transcript 

calcycfin binding protein 

H2A histone family; member L 

fibroblast activation protein; alpha; seprase 

ESTs 

ESTs 

solute earner family 2 (facilitated gkjeose t 
zinc finger protein 238 
microtubule-associated protein; RP/EB fam 
ESTs 

collagen; type XI; alpha 1 
CDC28 protein kinase 2 
KIAA0143 protein 

tymphotoxin beta (TNF superfamily; memb 
ESTs 

GDNF family receptor alpha 1 

EST cluster (not in UniGene) with exon hit 

Homo sapiens mRNA; cDNA DKFZp564N 

EST singleton (not in UniGene) with exon 

EST 

ESTs 

ESTs 

cytochrome b-561 

ESTs 

ESTs 

ESTs 

ESTs 

cell division cycle 2; G1 to S and G2 to M 
ESTs 

EST duster (not in UniGene) 

ESTs 

ESTs 

EST cluster (not in UniGene) 
ESTs 

EST duster (not in UniGene) 

EST cluster (not in UniGene) 

DEME-6 protein 

EST cluster (not in UniGene) 

ESTs 

DiGeorge syndrome critical region gene 2 
EST cluster (not in UniGene) 
EST cluster (not in UniGene) 
ESTs 

EST cluster (not in UniGene) 
EST cluster (not in UniGene) 
HER2 receptor tyrosine kinase (c-erbB-2; E 
interferorvstimulated protein; 15 kDa 
ESTs; Weakly similar to transform ation-rel 
ESTs; Weakly similar to CYTOCHROME 
dachshund (Drosophila) homolog 
succinate dehydrogenase complex; subun'rt 
inositol 1;4;5-triphosphate receptor; type 3 
EST; Highly simQar to PHEN YLETHANO 
KIAA1067 protein 
CH22_FGENES.46_15 
CH22_FGENE$.271_8 
CH22_FGENES.307_4 
CH22_FGENES.360_4 
CH22_FGENES367_15 
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17.3 

27.1 

24.5 

207 

18.2 

26.4 

39.9 

20,9 

18.8 

18.1 

18.8 

15.4 

15 

15.4 

26.4 

30.4 

15.2 

15.2 

15.3 

20.3 

16.1 

35.7 

19.5 

20.7 

21.6 

41.4 

243 

64.5 

39.1 

24.1 

27.1 

27.1 

27.8 

225 

214 

18.4 

28.8 

32.6 

15.7 

16.3 

213 

35 

25.4 

16.7 

24.3 

15.3 

21 

15.3 

50.1 

16.7 

23.1 

212 

24.5 

17.7 

67 

44.1 

41.9 

15,1 

243 

19 

153 

15.2 

17.8 

483 

15.9 

33.5 

18.5 
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335791 CH22_FGENES.611_7 27.3 

336512 CH22_FGENES.834_7 21.4 

338008 CH22_EM:AC005500.GENSCAN. 1 27-9 1 5.2 
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TABLE 14A 



Table 14A shows the accession numbers for those pkeys lacking unigenelD's for Table 14. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustenng 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey CAT number Accession 

309583 1046029_-2 AW170035 

20 335512 CH22_3941FG_834J.LINK.DJ 

338008 CH22_6490FG_LINK_EMACOO 

333769 CH22_1036FGJ71_8_LINK_EM 

333968 CH22J245FG_307J_LINHLEM 

335791 CH22_3160FG_611J_LINK_EM 

25 309177 AI951118 

332958 CH22_182FG_48_15_LINK_EM: 
334223 CH22J 507FG_360_4JJNK_EM 

334264 CH22J551FG_367J5_UNK_E 

123619 371681_1 AA6029 64 AA 609200 



10 



Pkey: 

CAT number: 
Accession: 



Unique Eos probeset identifier number 
Gene duster number 
Genbank accession numbers 



15 
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TABLE 14B 



10 



15 



Table 14B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 14. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham !. et al* refers to the publication 

entitled 'The DNA sequence of human chromosome 22" Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref 



Strand Nt jposition 



20 



25 



332958 
333769 
333968 
334264 
338008 
334223 
335791 
336512 



Dunham, I. et.al. 
Dunham, i. et.al. 
Dunham, I. et.al. 
Dunham, i. etal. 
Dunham, !. et.al. 
Dunham, I. et.al. 
Dunham, I. et.al. 
Dunham, i. et.al. 



Plus 

Plus 

Pius 

Plus 

Plus 

Minus 

Minus 

Minus 



2516164-2516310 

7696625-7696707 

8681004^681241 

13234447-13234544 

76970687697236 

12734365-12734269 

25948563-25948411 

34278373-34278275 
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TABLE 15: Table 3 from BRCA 001-5 US 
Table 15 shows genes downregulated in breast cancer cells. 



10 



Pkey; Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnkjeneiD: Unigene number 

UnigeneTitle: Unigene gene title 

R1 : Ratio of normal breast tissue to tumor 



Pkey ExAccn UniGene ID Unigene Title 



„ R1 

15 

100115 D00632 Hs.1 721 53 glutathione peroxidase 3 {plasma} 1.7 

100499 TIGR:HT1428 Hs.283108 Globin, Beta 1.5 

100502 T1GR:HT1496 Hs.169228 Adrenal-Specific Protein Pg2 2.3 

100815 TIGR:HT4268 Hs.9739 L-Grycerol-3-Phosphate;Nad+Oxidoreduct 1.7 

20 101125 L10373 Hs.82749 transmembrane 4 superfamily member 2 1.5 

101367 M12963 Hs.4 alcohol dehydrogenase 1 (class I); alpha po 2.9 

101397 M15856 Hs.180878 lipoprotein lipase 1-6 

101883 N68399 Hs.75613 CD36 antigen (collagen type I receptor; thr 1.6 

102227 U25138 Hs.93841 potassium large conductance calcium-activ 1.6 

25 102857 X00129 Hs.76461 retind-binding protein 4; interstitial 3 

103211 X73079 Hs.288579 polymeric immunoglobulin receptor 1.8 

103496 Y09267 Hs.132821 flavin containing monooxygenase 2 1.5 

103562 Z21966 Hs.2815 POU domain; class 6; transcription factor 1 1.8 

104672 AA007629 glycerol-3-phosphate dehydrogenase 1 (sol 2.4 

30 105083 AA146619 Hs.18791 ESTs; Weakly similar to CALCIUM-BIND 1.7 

105138 AA164519 Hs.15248 ESTs 1-5 

106075 AA417915 Hs,25930 ESTs 1.5 

106870 AA487576 Hs.26530 serum deprivation response (phosphatidyls 1.6 

107099 AA609645 Hs.211568 eukaryotic translation initiation factor 4 gam 2.7 

35 107616 AA004901 Hs.261164 ESTs 1.6 

107997 AA037388 Hs.82223 Human DNA sequence from clone 14 1H5 o 1.7 

108604 AA099820 Hs.49696 ESTs 2.4 

111130 N64265 Hs.19515 yz44M 2.s1 Morton Fetal Cochlea Homo sa 17 

111837 R36447 Hs.24453 ESTs 16 

40 112538 R70255 ESTs 1 9 

112808 R97970 Hs.281022 EST 15 

113086 T40652 Hs.209100 DKFZP434C171 protein 19 

115740 AA418033 Hs. 283559 ESTs 16 

115949 AA443800 Hs.43125 ESTs 2 

45 115965 AA446661 Hs.173233 ESTs 2.2 

117224 N20300 Hs.218707 ESTs 17 

117513 N32174 Hs.44317 SRY (sex-determining region Y)-box 10 17 

119059 R15436 Hs.77889 Friedreich ataxia region gene X 123 17 

119175 R71792 Hs.301002 ESTs; Weakly simtiar to cell death activate 2.8 

50 119359 T71021 . Hs.285681 ESTs; Highly similar to WS basic-hellx-loo 1.9 

119798 W73386 Hs.249129 ESTs 3 

120889 AA365784 Hs.97044 ESTs 16 

121381 AA405747 Hs.97984 ESTs; Weakly similar to WASP-family pro 1.8 

121750 AA421184 Hs.97549 ESTs 15 

55 122127 AA434447 Hs.106771 ESTs 2.5 

122348 AA443695 Hs.293410 ESTs 2.1 

122485 AA448300 Hs.1 60318 phospholemman 1-5 

123443 AA598841 Hs.167382 natriuretic peptide receptor A/guanylate cy 1.8 
123505 AA600135 ESTs; Moderately similar to I!!! ALU SUB 1.5 

60 125284 W94688 Hs.103253 perilipin 17 
126300 D81972 HUM427D08B Human fetal brain (TFujiw 1.8 

126747 R72515 Hs.160318 phospholemman 1-6 

127218 AA309765 Hs.1 16017 ESTs; Weakly similar to K1AA0795 protei 1.5 

127357 AA452788 Hs.75432 zx39g11.r1 Scares_total_fetu$_Nb2HF8_9 1.7 
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127638 AA634405 Hs. 122608 ESTs 1.5 

128213 AA972780 Hs. 129 194 ESTs; Weakly similar to !!! I ALU SUBFA 1.5 

128351 AI092391 Hs.134886 ESTs 1.5 

128842 N44757 Hs 20340 ESTs 1.6 

5 128870 R71403 Hs 75309 eukaryotic translation elongation factor 2 1.7 

129146 AA459944 Hs 108924 DKFZP586P1422 protein 1.5 

129285 T62068 Hs 11006 ESTs 2 1 

129331 N93465 Hs.279772 ESTs; Highly similar to CGI -38 protein [H 15 

130085 M62402 Hs 274313 insulin-like growth factor binding prote n 6 17 

10 130400 M25079 Hs.2 83 108 hemoglobin; beta 17 

131267 AA21 1776 Hs2504 myomesin 1 (skelemin) (185kD) 38 

131277 AA131466 Hs.23767 ESTs 19 

131282 M12272 Hs 4 alcohol dehydrogenase 3 (class I); gamma p 2 2 

131304 AA295848 Hs.25475 aquaporin7 17 

15 131810 D49487 Hs 194236 leptin (murine obesity homolog) 25 

132788 AA045503 Hs 56874 ESTs; Weakly similar to Homo sapiens p2 16 

132931 Z41452 Hs.6090 deleted in bladder cancer chromosome regi 15 

133120 X64559 Hs 65424 tetranectin (plasminogen-binding protein) 2 

133314 U95367 Hs 70725 gamma-aminobutyric acid (GABA) A recep 15 

20 133507 X74295 Hs 74369 integrin; alpha 7 17 

133601 S95936 Hs 284176 transferrin 2 3 

133702 N56898 Hs 75652 glutathione S-transfe rase M5 19 

134111 N79674 Hs8022 TU3A protein 4 6 

134699 U56814 Hs 88646 deoxyribonuc lease l-like 3 15 

25 134749 L10955 Hs 89485 carbonic a nhyd rase IV 16 

135173 M72885 Hs.95910 Human G0S2 protein gene; complete cds 19 

300132 AW027556 Hs 156286 ESTs 17 

300732 AI369956 Hs 257891 ESTs 15 

300750 AA514805 Hs 293055 ESTs 18 

3 0 301140 AI807692 Hs 129129 ESTs 16 

301396 AA923549 Hs 224121 ESTs 2 1 

302910 N77976 Hs 251577 hemoglobin; alpha 1 18 

303798 V00505 Hs 36977 hemoglobin; delta 16 

303831 T04868 Hs 46780 EST cluster (not in UniGene) with exon hit 17 

35 303844 U94362 Hs 58589 glycogenin 2 15 

304182 H91086 EST singleton (not in UniGene) with exon 15 

304622 AA516384 EST singleton (not in UniGene) with exon 1 5 

304682 AA550994 EST singleton (not in UniGene) with exon 1 7 

305612 AA782347 Hs 272572 EST singleton (not in UniGene) with exon 15 

40 306193 AA923457 EST singleton (not in UniGene) with exon 15 

307206 Al 192534 EST singleton (not in UniGene) with exon 1 6 

307377 AI222691 EST singleton (not in UniGene) with exon 1 5 

308023 AI452732 Hs 251577 EST singleton (not in UniGene) with exon 19 

308359 AI612774 Hs 79372 retinoid X receptor; beta 15 

45 309838 AW296073 Hs 255504 EST 1.5 

310403 AI720978 Hs.148006 ESTs; Moderately similar to alternatively s 1 8 

311671 AW241947 Hs 232478 ESTs 16 

311794 AW238092 Hs 254759 ESTs 2 1 

312082 T79860 Hs 118180 ESTs 19 

5 0 312575 H25237 Hs 306814 ESTs 2 3 

313076 N49684 Hs 143040 ESTs 18 

313283 W32480 Hs .157099 ESTs 2 2 

313374 AW328672 Hs 132760 ESTs 19 

314701 AI754634 Hs 131987 ESTs 17 

5 5 315391 AA759098 Hs 192007 ESTs 18 

315688 AA680055 Hs 264885 ESTs 15 

316249 AA948612 Hs 130414 ESTs 16 

316586 AI205077 Hs 294085 ESTs 1 7 

316890 AA837079 Hs 24647 ESTs 15 

60 316983 AI480204 Hs 177131 ESTs 15 

317604 AI650625 Hs 300756 ESTs 16 

317951 AW2O6520 Hs 129621 ESTs 15 

319400 W26902 Hs 154085 ESTs 1 7 

320757 H22654 Hs6382 EST duster (not in UniGene) 15 

65 321594 AA021402 Hs 11067 ESTs 17 

322102 H45589 EST cluster (not in UniGene) 15 

322814 AI824495 Hs 21 1038 ESTs 2 2 
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322929 A13655S5 Hs.146246 ESTs 2.3 

323831 AA335715 Hs.200299 ESTs 1.7 

324044 AL045752 Hs.22350 ESTs 1.8 

324675 AW014734 Hs.157969 ESTs 22 

5 325272 CH.11_hsgi)5866902 1-5 

325558 CH.12_hs gi[6056302 1 .6 

325656 CH.14_hsgi|6055305 1 6 

326120 CH.17_hsgi|5867l94 15 

326139 CH.17_hsgi|5867203 1 5 

10 326855 CH.20_hs gi[6552460 1 5 

327438 CH.02 hs gi[6004454 1.6 

329733 CH.14_p2gi|6065783 1 6 

3X931 F01443 Hs.284256 ESTs 4.6 

331591 N71677 Hs.42146 ESTs 19 

15 332159 AA621393 Hs.112984 EST 1.5 

332364 W94688 Hs.103253 periSpin 2.1 

332502 H21819 Hs.14896 Homo sapiens done 24590 mRNA sequenc 1.5 

334175 CH2JLFGENES.349J0 1.5 

334347 CH22_FGENES.375_31 1.8 

20 334737 CH22„FGENES.424_12 1.8 

335352 CH22_FGENES.539_5 1.5 

335639 CH22„FGENES.584_19 1.6 

336244 CH22_FGENES.746_2 1.5 

336336 CH22_FGENES.814_8 1.7 

25 336865 CH22_FGENE$.305-1 1.6 

337494 CH22_FGENES.799-12 1.6 

337764 CH22_EM:AC000097.GENSCAN.119-1 1.8 

337983 CH22 EMAC005500.GENSCAN.110-1 2 

338192 CH22_EM;AC005500.GENSCAN.228-1 1.5 

30 339366 CH22_BA354H2.GENSCAN.34-2 1.5 
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TABLE ISA 



Table 15A shows the accession numbers for those pkeys lacking unigenelD's for Table 15. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 

Pkey: Unique Eos probeset identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 

15 

Pkey CAT number Accession 

126300 250375J D81972 BEO03132 

20 112538 504579J M908813 R70255 

123505 genbank_AA600135 AA600135 - 

104672 6735_7 AA349096 AI368018 F21390 F17759 R48772 A1421485 AI300352 H43971 AI378525 F33652 R47898 AI264177 F22289 N28263 
AI276281 R48205 AI245302 AI190036 AI281050 AW242903 H42892AA910870AW473816 H25721 AW451438 F19647F22375 
H45809 F33447 AA774528 AA007629 H42537 C01077 F32386 

2 5 322102 46708J H45589 H 19807 AF0750 38 H 19808 H42 437 

336865 CH22_4590FG_305_1_ 

338192 CH22_6755FG„LINKEM:AC00 

329733 c14_p2 

326120 c17_hs 

3 0 326139 c17_hs 

326855 c20_hs 

335352 CH22_2699FG_539_5_UNK_EM 

335639 CH22_2999FG_584_19_UNKLE 

307206 AI192534 

35 307377 AI222691 

337494 CH22_5727FG_799_12_ 

337764 CH22_6115FG_LINK_EM:AC00 

337983 CH22_6438FG„LINK_EM:AC00 

339366 CH22_8336FG_LINK_BA35411 

40 325272 c11Jis 

325558 c12_hs 

325656 c14 hs 

334175 CH22_1455FG_349_10JJNK_E 

304182 H91086 

45 334347 CH22_1640FG_375_31_LINK_E 

327438 c_2_hs 

304622 AA516384 

334737 CH22_2049FG_424_12_LINK_E 

304682 AA550994 

5 0 336244 CH22„3642FG_746_2_LIN^DA 

306193 AA923457 

336336 CH22_3746FG_814_8_LINK_BA 
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TABLE 15B 

Table 15B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 15. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey; Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al.* refers to the publication 

entitled The DNA sequence of human chromosome 22." Dunham t. et al., Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 

Nt_position: I ndicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


Nt jwsition 


334347 


Dunham, I. etal. 


Plus 


13663814-13663926 


334737 


Dunham, I etal. 


Plus 


15998517-15998685 


335639 


Dunham, 1. etal. 


Plus 


25173591-25173696 


337494 


Dunham, 1 . etal. 


Plus 


33339024-33339148 


334175 


Dunham, I. etal. 


Minus 


11668659-11668597 


335352 


Dunham, 1. etal. 


Minus 


22681512-22681384 


336244 


Dunham, 1. etal. 


Minus 


31402729-31402583 


336336 


Dunham, 1. etal. 


Minus 


33797209-33797076 


336865 


Dunham, 1. etal. 


Minus 


8622405-8622289 


337764 


Dunham, 1. etal. 


Minus 


40356404035446 


337963 


Dunham, t. etal. 


Minus 


7275495-7275271 


338192 


Dunham, 1. etal. 


Minus 


13248453-13248277 


339366 


Dunham, 1. etal. 


Minus 


33647431-33647293 


325272 


5866902 


Minus 


13247-13312 


325558 


6056302 


Plus 


70930-71030 


325656 


6056305 


Minus 


78190-78707 


329733 


6065783 


Plus 


163237-153450 


326120 


5867194 


Plus 


36116-36276 


326139 


5867203 


Minus 


218901-218960 


326855 


6552460 


Minus 


111390-111463 


327438 


6004454 


Minus 


199569-199692 
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TABLE 16: Table 4 from BRCA 001-5 US 



Table 16, a subset of table 15, depicts a preferred group of genes highly downregulated in 
breast cancer cells. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene TrUe: Unigene gene title 

R1: Ratio of normal breast tissue to tumor 



Pkey 


ExAccn 


UniGene [D 


Unigene Title 


R1 


100502 


TIGR:HT1496 


Hs 169228 


Adrenal-Specific Protein Pg2 


2.3 


101367 


M12963 


Hs4 


alcohol dehydrogenase 1 (class I); alpha 


2.9 


102857 


X00129 


Hs.76461 


retinol-binding protein 4; interstitial 


3 


104672 


AA007629 




gfycerol-3-phosphate dehydrogenase 1 


2.4 


107099 


AA609645 


Hs 211568 


eukaryotic translation initiation factor 4 gam 


2.7 


108604 


AA099820 


Hs.49696 


ESTs 


2.4 


115949 


AA443800 


Hs 43125 


ESTs 


2 


115965 


AA446661 


Hs 173233 


ESTs 


2.2 


119175 


R71792 


Hs 301002 


ESTs; Weakly similar to cell death activator 


2.8 


119798 


W73386 


Hs 249129 


ESTs 


3 


122127 


AA434447 


Hs 106771 


ESTs 


2.5 


122348 


AA443695 


Hs. 293410 


ESTs 


2.1 


129285 


T62068 


Hs 11006 


ESTs 


2.1 


131267 


AA211776 


Hs.2504 


myonresin 1 (skelemin) (185kD) 


3.8 


131282 


M12272 


Hs.4 


alcohol dehydrogenase 3 (class t); gamma 


2.2 


131810 


D49487 


Hs 194236 


leptin (murine obesity homolog) 


2.5 


133120 


X64559 


Hs.65424 


tetranectin (plasminogen-binding protein) 


2 


133601 


S95936 


Hs.284176 


transferrin 


2.3 


134111 


N79674 


Hs 8022 


TU3A protein 


4.6 


301396 


AA923549 


Hs.224121 


ESTs 


2.1 


311794 


AW238092 


Hs.254759 


ESTs 


2.1 


312575 


H25237 


Hs.306814 


ESTs 


2,3 


313283 


W32480 


Hs. 157099 


ESTs 


2.2 


322814 


A1824495 


Hs.211038 


ESTs 


2.2 


322929 


AI365585 


Hs 146246 


ESTs 


2.3 


324675 


AW014734 


Hs. 157969 


ESTs 


2.2 


330931 


F01443 


Hs.284256 


ESTs 


4.6 


332364 


W94688 


Hs, 103253 


perilipin 


2.1 


337983 
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TABLE 16A 

Table 16A shows the accession numbers for those pkeys lacking unigenelD's for Table 16. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accession 

104672 6735 7 AA349096 AI368018 F21390 F17759 R48772 AI421485 AI30O352 H43971 AI378525 F33652 R47898 AI264177 F22289 N28263 
AI276281 R48205AI245302 AM90036 AI231050 AW242903 H42892 AA910870 AW473816 H25721 AW451438 F19647 F22375 
H45809 F33447 AA774528 M007629 H42537 C01077 F32386 
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TABLE 17: Table 1 from BRCA 014 P 



5 Table 17 shows accession numbers representing 759 sequences of breast cancer genes or 
fragments thereof encoding breast cancer modulating proteins. For each overexpressed gene 
identified, a ratio of the relative amount of expression in breast tumors versus normal breast 
tissue. 



10 



15 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenefD; Unigene number 

Unlgene Title: Unigene gene title 

R1 : Ratio of tumor to normal breast tissue 



Pkey ExAccn UnigenefD Unigene Title R1 

20 100227 AV654694 Hs.82316 interferon-induced, hepatitis C-associat 3 

100405 AW291587 Hs.82733 nidogen2 3.2 

100406 AI962060 Hs.1 18397 AE-binding protein 1 3.6 
100420 D86983 Hs 118893 Melanoma associated gene 3.2 
100911 X83300 Hs 289103 SMA4 5.2 

25 100960 J00124 Hs 117729 keratin 14 (epidermolysis bullosa simple 4.3 

101011 BE387036 Hs 1211 acid phosphatase 5, tartrate resistant 3 

101183 AA442324 Hs795 H2A histone family, member O 3.2 

101194 L20971 Hs 188 phosphodiesterase 4B, cAMP-specific (dun 3 

101329 U66042 Hs 82171 Homo sapiens clone 19 1B7 placenta exp res 4.1 

30 101378 BE563085 Hs 833 interferon-stimulated protein, 15 kDa 53 

101474 R07566 Hs 73817 small Inducible cytokine A3 (homologous 3 9 

101491 M25809 Hs 64173 ATPase, H+ transporting, lysosomal (vacu 4 5 

101530 M29874 Hs 1360 cytochrome P450, subfamily I IB (phenobar 9 

101602 AA353776 Hs 901 CD48 antigen (B-cell membrane protein) 3 4 

35 101663 NM_003528 Hs2178 H2B histone family, member Q 5.6 

101758 BE019494 Hs 79217 pyrrol in e-5-carboxy late reductase 1 3 6 

101767 M81057 Hs 180884 carboxy peptidase B1 (tissue) 12 

101817 M89907 Hs 1 52292 SWI/SNF related, matrix associated, acti 3.2 

101851 BE260964 Hs 82045 midkine (neurite growth-promoting factor 4.1 

40 101878 M97815 Hs 183650 cellular retlnoic acid-binding protein 2 6.5 

102209 NM„002038 Hs 265827 Interferon, alpha -inducible protein (do 3 

102214 U23752 Hs32964 SRY (sex determining region Y)-box 11 3 

102297 MM_00 1504 Hs 198252 G protein-coupled receptor 9 3.7 

102299 NM_005824 Hs 155545 37 kDa leucine-rich repeat (LRR) protein 3.7 

45 102301 NMJW5651 Hs 183671 tryptophan 2 ,3-dioxygenase 5.2 

102305 AL043202 Hs 90073 chromosome segregation 1 (yeast homolog) 3 5 

102369 U39840 Hs 299867 hepatocyte nuclear factor 3, alpha 3.9 

102591 U62325 Hs 324125 amyloid beta (A4) precursor protein-bind 4 

102721 H16646 Hs 1 1 8666 hypothetical protein PP591 3 5 

50 102739 AA363025 Hs 155572 Human clone 23801 mRNA sequence 3.2 

102791 AF080229 gb:Human endogenous retrovirus K clone 1 3 

102804 NMJ)02318 Hs 83354 lysyf oxidase-like 2 3.2 

102903 M73779 Hs 250505 retinoic acid receptor, alpha 3.3 

103010 X52509 Hs 161640 tyrosine aminotransferase 12.4 

55 103042 T81656 Hs.252259 ribosomal protein S3 4.5 

103117 X63578 Hs 295449 parvalbumin 3 

103207 X72790 gb:Human endogenous retrovirus mRNA for 5 9 

103282 BE390551 Hs 77628 steroidogenic acute regulatory protein r 3 9 

103284 AJ751601 Hs.8375 TNF receptor-associated factor 4 3 3 

60 103329 X85134 Hs 72984 retinoblasto ma-binding protein 5 31 

103364 X90872 Hs 279929 gp25L2 protein 3 

103385 NM_007069 Hs 37189 similar to rat H RE V1 07 3.4 

103456 AA496425 Hs.9629 papillary renal cell carcinoma (transloc 3 2 
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103498 Y09306 
103558 BE616547 
103563 L02911 
103612 BE336654 
103825 AI571835 
104073 AW779318 
104103 AW021102 
104115 AF183810 
104168 AA461618 
104173 AA084273 
104181 AF173296 
104189 AB040927 
104269 AI559444 
104307 AI929700 
104518 H20816 
104556 AV650851 
104658 AA360954 
104748 AAO 15879 
104755 T49951 
104825 AA035613 
104830 AW294092 
104865 T79340 
104906 BE298684 
104961 H78517 
105038 AW503733 
105088 H58589 

105092 AA1 48982 

105093 AL137566 
105304 AW134924 
105397 AA814807 
105409 AW505076 
105431 AA252033 
105552 AA256750 
105598 AA279439 
105650 W16741 
105688 AI299139 

105808 AI133161 

105809 AW973653 
105909 AA195191 
105965 AA131657 
106135 AL117474 
106184 W28948 
106293 N39842 
106400 BE397649 
106474 BE383668 
106484 AA351978 
106533 AL134708 
106614 AA648459 
106636 AW958037 
106661 AW499914 
106743 BE613328 
106844 AA485055 

106864 AI311928 

106865 AW192535 
106871 AW472981 
106942 AA995351 
106968 AF216751 
107105 AW963419 
107158 N32849 
107248 AW263124 
107265 BE379594 
107630 AW961576 
107710 AI955040 
107890 AA025386 
107985 T40064 
108000 AI263307 
108217 AA058686 



Hs.30148 homeodomaii-lnteracting protein kinase 3 3.4 

Hs.2785 keratin 17 3.7 

Hs.150402 Acfivin A receptor, type I (ACVR1) (ALK 3.2 

Hs.70937 H3 histone family, member A 4,5 

Hs.55468 ESTs 4 

Hs.88417 ESTs 3 8 

Hs.21509 ESTs 4 3 

Hs.26102 opposite strand to tricborhinophalangeal 7 6 

Hs.31704 ESTs, Weakly similar to KIAA0227 [H.sapi 3 6 

Hs.76561 ESTs, Weakly similar to S47072 finger pr 4 

Hs.283740 DC6 protein 3 

Hs.301804 K1M1494 prote'n 3.2 

Hs.293960 ESTs 4,3 

Hs.111680 endosutfine alpha 3.1 
Hs.1 12423 Homo sapiens mRNA; cDNADKFZp586H420 (f 3.2 

Hs.96900 hypothetical protein; K1AA1830 protein 4.4 

Hs.27268 Homo sapiens cDNA: FLJ21933 fis, done H 3.2 

Hs.33536 ESTs 3.2 

Hs.9029 DKFZP434 G032 protein 4.5 

Hs.141883 ESTs 6.9 

Hs.21594 hypothetical protein MGC 15754 11.1 

Hs.22575 B-cell CLL/tymphoma 6, member B (zinc fi 3.5 

Hs, 26802 protein kinase domains containing protei 6.5 

Hs.33905 ESTs 3.6 

Hs.9414 K1AA1488 protein 4.5 

Hs,35156 Homo sapiens cDNA FLJ 11027 fis, clone PL 3 8 

Hs.29068 ESTs 3 
Hs 32405 Homo sapiens mRNA; cDNA DKFZp586G0321 (f 4 8 

Hs 190325 ESTs 8.2 

Hs 7395 hypothetical protein FLJ23182 3 1 

Hs 301855 DiGeorge syndrome critical region gene 8 4 2 

Hs.242413 hypothetical protein DKFZp434K1421 4 4 

Hs28802 centaurin-alpha 2 protein 3.2 

Hs 279763 hypothetical protein FU 10504 3 5 

Hs 25635 HSPC003 protein 3 7 

Hs 17517 ESTs 5 5 

Hs.286131 CGI-101 protein 3 5 

Hs 20104 hypothetical protein FU00052 3 3 

Hs 51 1 1 hypothetical protein FLJ20729 3 2 

Hs 23830 ESTs 3 3 
Hs.41181 Homo sapiens mRNA; cDNADKFZp727C191 (fr 3 2 

Hs 10762 ESTs 3.3 

Hs 301444 KIAA1673 4.1 
Hs.94109 Homo sapiens cDMA FU13634 fis, clone PL 31 
Hs.42484 hypothetical protein FLJ 1061 8 3 2 

Hs 4943 hepatocellular carcinoma associated prot 7 8 

Hs 145998 ESTs 3 
Hs.335951 hypothetical protein AF301222 38 
Hs,286 ribosomat protein L4 3 3 

Hs.7579 hypothetical protein FLJ 10402 3 
Hs.21938 hypothetical protein FLJ12492 4 2 

Hs.1 58213 sperm associated antigen 6 34 
gb:c}o89h04.x1 NCLCGAP„Kid5 Homo sapiens 4 4 
Hs.1 9479 ESTs 3 6 

Hs.321130 hypothetical protein MGC277 1 4 1 

Hs.31314 retinoblastoma-birxting protein 7 3.6 
Hs.26813 CDA14 53 
Hs.1 55223 stanniocaldn 2 3.4 
Hs„31844 hypothetical protein FLJ 12586 3.1 
Hs.315111 nuclear receptor co-rep ressor/HDAC3comp 5.9 
Hs.49136 ESTs, Moderately similar to ALU7_HUMAN A 3.9 
Hs.60178 ESTs 4.6 
Hs.265398 ESTs, Weakly similar to transformation r 3 
Hs.61311 ESTs, Weakly simftar to S10590 cysteine 3.1 
Hs.71968 Homo sapiens mRNA; cDNA DKFZp564F053 (fr 4.8 
Hs.239884 H2B histone family, member L 3.3 
Hs.62588 ESTs 3.8 
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108435 


T82427 


Hs.194101 


Homo sapiens cDMA: FLJ20869 fts, clone A 


3 


108591 


AB033073 


Hs.43857 


similar to glucosam'me-6-sulfatases 


3.3 


108733 


AA121022 




gb:zri84f10.r1 Stratagene lung carcinoma 


3.9 


108771 


AF068290 


Hs.79741 


hypothetical protein FLJ10116 


6.1 


108819 


AA011449 


Hs.271627 


ESTs 


3.6 


108912 


AA1 36674 


Hs.1 18681 


EST 


3.9 


109086 


AF186114 


Hs.270737 


tumor necrosis factor (ligand) superfami 


3.7 


109124 


AK000684 


Hs. 183887 


hypothetical protein FLJ22104 


3.1 


109132 


AI970536 


Hs. 16603 


hypothetical protein FL1 13163 


3.7 


109163 


N23235 


Hs.30567 


ESTs, Weakly similar to B34087 hypotheti 


4.5 


109277 


AA1 96443 


Hs.86043 


Homo sapiens cDNA FLJ 13558 fis, clone PL 


3.7 


109410 


AW504732 


Hs.21275 


hypothetical protein FLJ 1 101 1 


4.6 


109454 


AA232255 


Hs.295232 


ESTs, Moderately similar to A46010 X-lin 


6.4 


109514 


A A 2340 87 


Hs.262346 


ESTs, Weakly similar to S72482 hypotheti 


4.8 


10QS81 


R45584 


Hs 23025 


ESTs, Weakly similar to ALU5_HUMAN ALU S 


3.3 


109632 


AA325138 


Hs.235873 


hypothetical protein FLJ 22672 


3 


109544 


AWQ7?Qfi4 


Uc: 901531 


ESTs, Highly similar to 1203217A dehydro 


3 




FOQfinci 




gb:HSC33H092 normalized infant brain cDN 


3.2 


109768 


FOfifttft 

ruDooo 


Hs 14763 

MS. ItI 


ESTs 


3.2 


1flGftfi7 
I \nSQ\Jl 


rvtOQMU 


He 19499 


ESTs 


3.8 


109842 


MVVO IOHOO 


Hs 9^00 


enhrtp r_Arripr familv 1fi fmnnnr^irhnxvlir. 


3.3 


I WOiJO 


AKfYllfiRfl 

ArsUU IDOU 


Hs VVlftft 


nKF7P4'UF091 nrotein 


3.6 


1 10024 


n»?3f J lot 


Hs 11050 


ESTs 


4.2 


1 10561 


A AT7QSQ7 

MHO/ <7JJ ' 


Hs <i1QQ 


ucppicTi rimtpin similar to uhlrniitin-nnn 


5.1 


i i unfa 




Ms 9AQ1SQ 
nS ZH3 103 




5.3 


1 10707 


AlZOtX) OZ 


Ms; 1*ifi17 
nb. too 1 1 


FSTs Weaklv similar to ALU4 HUrVlAN ALU S 


3.7 


1 mens 

I I US I 0 


RFflQ99fl*\ 

Dcuyzzoo 


nb.Za/ Z*t 


hvnnthpttr^l nrntpin Fl J 1^187 


37 


i i i toy 


NWDOO 


Ms 9QOQ41 
nii.zuui?*to 


ESTs 


4 


1 111W 

I I I 1 00 


IwOOUJ 


Ms 191358 


ESTs 


31 


I I I 1 3a 


A t7R7i^ I i 


Mc 9QA99 


ESTs 


4 5 


1 11336 


MrtO/ JOO 


Hs ?Qfiq4 


ESTs 


54 


1 1 mm 


r\Uf OOD 


Ms IfLTiS 

1(3. IOOJO 


ESTs 


32 


111*^9 
i i iooz 


ROftMfl 




nrvvf1Qf09 s1 Soares fetal liver SDleen 


31 


I I \ 003 


AAfift9flfi4 
MMOUZW+ 


Ms 919fif) 
no ZOZOV 


ESTs 


3 2 


1 11fi9^ 


KjOZOZ 


Mc 94Q44 


F^iTc W Afl klu similar tn 91 0Q9fifiA B f¥> II 


33 


•j 1107c 


noozoy 


Mc 9Q^94fi 
nb ZjOZhd 


F^lTc Waaltlu cimilar tn nillattvP nl^O [H 


31 


1 1 1892 


AA491HP1 


Ms 191ftft 


ESTs 


34 


1 1 

1 1 1030 


APn70S9fi 


na fOMZj 


Mnmn canipns rlnnp 947A7 mRrslA spniipnnp 


3 3 


1 1Z IZO 


Mvvo/yuzy 


Ms 11AT*A 
nb 1 IOOOO 


F^Ts Wpaklv similar tn unnampH nmtpin 

CO 1 fVCdlMV Dill NICIJ U( tl luL 1 pi U Lwil I 


4 4 


1 19170 


DcZ*+0f *+0 


Ms 9flftS9Q 

no Z0OJZ3 


hvnnthptirjil nrntpin FU296T5 

nypuu Jcuvxii piwiciii r utiujj 


73 


1 12287 


rvDUO JU04 


Ms ?14flflfi 
no jj*tuv/u 


KIAA19?ft nrntpin 


32 


1 19'W"1 


W9j1T^! 

nz^oo** 


Mc 96195 
nb.ZO I ZO 


ESTs 


4 4 


1 i9*vn 

I IZOvO 


RI\j17Q7 




nh'vnft7h07 s1 Snarps infant hrain 1 N IB H 


34 


1 1 947ft 


RCC(Tfi7 
rSOOUOf 


Ms 9AAR4 
no. zoo oh 


ESTs 


82 


1 19*ifi1 


AI7Q1dCrt 


Mc 19QR71 
nb. izyo/ o 


F^Ts Wpaklv similar tn A 16036 n/t orhrom 


55 


1 izDJl 


KozWU 




nh-uififihOfi c1 ^namc rJar^inta Mh9MP Hnmn 


3 9 




Kozool 


nb Jjj 


ESTs 


54 


1 19^57 
I IZOJ/ 


AWAiljlfl7ft 
MVVO*H0f 0 


Mc 1Q7RQ 
nb I 3i 0? 


hurvifript^al nrntpin Mfi(^.il174 


32 




AM1 fl/lfifi 


Mc nPiR5 
nb . ooou j 


ESTs 


47 


112917 


AA082465 


Hs. 125031 


c holine/etha no lam ine phosphotra nsfe ra se 


37 


113070 


AB032977 


Hs6298 


KIAA1151 protein 


31 


113095 


AA828380 


Hs 126733 


ESTs 


34 


113117 


AW8 13731 


Hs 159153 


ESTs, Moderately similar to S65657 alpha 


34 


113187 


BE613410 


Hs31575 


SEC63, endoplasmic reticulum translocon 


3.2 


113200 


T57773 


Hs 10263 


ESTs 


35 


113206 


BE262470 


Hs.241471 


RNB6 


62 


113374 


T79925 


Hs.269165 


ESTs, Weakly similar to ALU1 HUMAN ALU S 


37 


113440 


U54727 


Hs 191445 


ESTs 


3 


113494 


T91451 


Hs 86538 


ESTs 


34 


113518 


AW367738 


Hs 323954 


postmeiolic segregation increased 2-like 


31 


113571 


AI702609 


Hs 15713 


hypothetical protein MGC2776 


31 


113822 


MM 004585 


Hs 17466 


reUnoic acid receptor res ponder (tazaro 


3 9 


113835 


AI912410 


Hs.27475 


Homo sapiens cONA FLJ 12749 lis, clone NT 


3 


113938 


W81598 




gb:zd88g02.s1 Soaies_fetaLhearLNbHH19W 


4.6 


113947 


W84768 




gb:zh53d03.s1 Soares Jetaljiver_spteen_ 


3.1 


113970 


W27249 


Hs.8109 


hypothetical protein FLJ21080 


6.9 


114086 


AA378776 


Hs,288649 


hypothetical protein MGC3077 


4.3 
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114148 
114424 
114518 
114563 
114965 
114995 
115121 
115134 
115167 
115253 
115277 
115327 
115354 
115657 
115676 
115709 
115729 
115787 
115830 
115835 
115850 
115900 
115935 
115948 
116092 
116115 
116184 
116192 
116208 
116246 
116443 
116470 
116726 
116845 
117026 
117216 
117296 
117403 
117691 
118229 
118363 
118416 
118470 
118502 
118695 
118925 
119025 
119036 
119063 
119075 
119620 
119741 
119747 
119754 
119905 
120084 
120241 
120326 
120742 
120870 
120885 
120970 
121054 
121095 
121103 
121121 
121337 



AW470411 
AW780192 
AW1 63267 
AI979168 
A1733881 
AA769266 
AI634549 
AW968073 
AA749209 
BE149S45 
AA81410O 
N46436 
AA281636 
AA405620 
AA953006 
AW293849 
AA417812 
AH 26772 
AW970529 
AA521410 
NM 014937 
AK001500 
AA354549 
AL042465 
AB041035 
AL042355 
AW450737 
AA464976 
A1219083 
AF265555 
AW962196 
AI272141 
AK001114 
AA649530 
H88256 
AI569804 
AL133427 
H84455 
AB040959 
AW968941 
Al 183838 
N66028 
AW970584 
AL1 57488 
AK000465 
N92293 
BE003760 
R95872 
R 16833 
M10905 
W47620 
AF041853 
AI970797 
AL037824 
AW449064 
W94472 
AA825686 
AA1963C0 
AA225084 
AA357172 
AA365515 
AA398118 
AW976570 
AA320134 
AA398936 
AA399371 
AW885727 



Hs.238433 
Hs.267595 
Hs.106469 
Hs.82226 
Hs.72472 
Hs.193657 
Hs.88155 
Hs.194331 
Hs.43728 
Hs.289038 
Hs.86693 
Hs. 109221 
Hs.334827 
Hs.55158 
Hs.88143 
Hs,58279 
Hs.38775 
Hs.40479 
Hs.86434 
Hs.41371 
Hs.52463 
Hs.165186 
Hs.41181 
Hs.43445 
Hs.93847 
Hs.70202 
Hs.128791 
Hs.62528 
Hs.42532 
Hs.250646 
Hs.321264 
Hs.83484 
Hs.53913 

Hs.50456 
Hs.42792 
Hs.42506 
Hs.40639 
Hs.93836 
Hs.166254 
Hs.48938 
Hs.49105 
Hs.291033 
Hs.50150 
Hs.50081 
Hs.206832 
Hs.55209 
Hs.1 17572 
Hs.53106 
Hs.287820 
Hs.56009 
Hs.43670 
Hs.64859 
Hs.194695 
Hs.1 19571 
Hs.59529 
Hs.321176 
Hs.21145 



neurotrim'tn 
ESTs 

suppressor of varl (S.cerevisiae) 3-like 
glycoprotein (transmembrane) nmb 
BMP-R1B 
ESTs 
ESTs 

ESTs, Highly similar to A55713 inositol 
hypothetical protein 
hypothetical protein MGC4126 
ESTs 
ESTs 
ESTs 

ESTs, Weakly similar to T29520 hypotheti 
ESTs 

ESTs, Weakly similar to ALU7_HUMAN ALU S 
ESTs 
ESTs 

hypothetical protein FLJ21816 
ESTs 

KIAA0966 protein 
hypothetical protein FU 13852 
Homo sapiens mRNA; cDNA DKFZp727C191 (fr 
pofy(A)-specific ribonuclease (deadenyla 
NM_016931:Homo sapiens NADPH oxidase 4 ( 
WD repeat domain 10 
CGI-09 protein 

ESTs, Moderately similar to A46010 X-tin 
ESTs, Moderately similar to ALU8_HUMAN A 
baculoviral IAP repeat-containing 6 
LBP protein 32 

SRY (sex determining region Y)-box 4 
hypothetical protein FLJ10252 
gb:ns44f05.s1 NCl_CGAP_AM Homo sapiens 
ESTs, Moderately similar to ZN75_HUMAN Z 
ESTs, Weakly similar to I78885 serine/th 
Homo sapiens mRNA full length insert cON 
ESTs 

DKFZP434N014 protein 
hypothetical protein DKFZp566l133 
hypothetical protein FLJ21802 
FKBP-associated protein 
ESTs 

Homo sapiens mRNA; cONA DKFZp564B182 (fr 
K1AA1199 protein 

ESTs, Moderately similar to ALU8_HUMAN A 
Homo sapiens mRNA; cDNA DKFZp434K05l4 {f 
chemokine binding protein 2 
ESTs, Moderately similar to ALU1„HUMAN A 
fibronectin 1 

Z-S'-oKgoadenylate synthetase 3 (100 k 
kinesin farniiy member 3A 
ESTs 

ras homotog gene family, member I 
collagen, type 111, alpha 1 (Ehlers-Oanl 
ESTs, Moderately similar to ALU1_HUMAN A 
ESTs, Weakly similar to S65824 reverse t 
hypothetical protein RG083M05.2 
gb:nc21d06.r1 NCI_CGAP_Pr1 Homo sapiens 
Hs.292581 ESTs, Moderately similar to ALU1_HUMAN A 
Hs.301872 hypothetical protein MGC4840 
Hs.97579 ESTs, Weakly similar to A46010 X-Rnked 
Hs.97387 ESTs 

Hs.196029 Homo sapiens mRNA for KIAA1657 protein, 
Hs.97697 EST 

Hs.189095 similar to SALL1 (sal (DrosoprrJa)-hke 
Hs.301570 ESTs 
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3.1 

4.8 

10.1 

3.6 

3.2 

4.2 

3 

3.6 
3.9 
3.4 
4.8 
3.5 
9,3 
3.4 
4 

3.1 
3.6 
3.1 
3 

3.2 
3 

3.1 
6.7 
3.6 
3.1 
3.3 
3.2 
3.6 
4.1 
4.1 
8.6 
3.2 
3.5 
3.1 
3.2 
4.7 
3 

3.3 

4.3 

3.1 

3.4 

5.2 

3.4 

3,3 

19.7 

3.7 

4.1 

3.2 

3.3 

3.1 



3.1 
8.4 

3.6 
3.2 
3.6 
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5.3 
4 

3.5 
6.3 
4.7 
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121351 AW206227 Hs.287727 hypothetical protein FU23 132 5 

121611 M31669 Hs.1735 inhibin, beta B (activin AB beta poly pep 3.6 

121643 AA640987 Hs.193767 ESTs 5 6 

121770 NMJJ15902 Hs 278428 progestin induced prote 'in 3 4 

5 122125 AK000492 Hs 98806 hypothetical protein 4 1 

122338 AA443311 Hs 98998 ESTs 3 

122417 AA446965 Hs 112092 ESTs 4 7 

122513 AI767879 Hs.99214 ESTs 3 8 

122544 AW973253 Hs 292689 ESTs 3 

10 122655 AA323296 Hs 97837 Homo sapiens mRNA; cDNA DKFZp547J047 (fr 5 6 

122805 AA526911 Hs 82772 collagen, type XI, alpha 1 3 2 

122851 AW205931 Hs 99598 hypothetical protein MGC533 8 8 6 

123105 AA487809 Hs 166011 catenin (cadherin-associated protein), d 3 

123111 AA228776 Hs 191721 ESTs 6 9 

15 123249 AA371307 Hs 125056 ESTs . 3 6 

123273 AA491253 Hs.173611 EmpiricaJly selected from AFFX single pr 7 

123385 BE149685 Hs 17767 KIM 1554 protein 3.1 

123419 T66087 Hs 112482 Homo sapiens unknown mRIMA sequence 3 4 

123485 AI308875 Hs 103849 hypothetical protein DKF2p761D112 31 

20 123645 A1675944 Hs 188691 Homo sapiens cDNA FU 12033 fis, clone HE 3.8 

123819 AA580082 Hs 112264 ESTs 4 7 

124012 AA352723 Hs 241471 RNB6 3 8 

124243 H69125 Hs 133525 ESTs 4.1 

124357 N22401 gb:yw37g07.s1 Morton Fetal Cochlea Homo 4 1 

25 124359 N22508 Hs 139315 Homo sapiens cDNA: FU21479 fis, clone C 3 6 

124567 AW451645 Hs 151504 Homo sapiens cDNA FU 11973 fis, done HE 31 

124911 N34151 Hs. 174195 interferon induced transmembrane protein 3.5 

124972 R41396 Hs 101774 hypothetical protein FU23045 4 3 

125006 BE065136 Hs 145696 splicing factor (CC1.3) 6 

30 125042 T78906 Hs 269432 ESTs, Moderately similar to ALU 1JHUMAN A 8.1 

125184 W60326 Hs 288584 Homo sapiens cDNA FU 11750 fis, clone HE 4 7 

125243 AW970536 Hs 105413 ESTs 3 1 

125286 AF086534 Hs.187561 ESTs, Moderately similar to ALU 1_HUMAN A 3 3 

125304 AL359573 Hs.124940 GTP-binding proteii 3 

35 125330 AW880562 Hs 114574 ESTs 3 

125331 AI422996 Hs. 161378 ESTs 3 2 

125685 AI924630 Hs4943 hepatocellular carcinoma associated prot 3 2 

126257 N99638 gb:za39g11.r1 Scares fetal liver spleen 4 

126474 AW975814 Hs326714 Homo sapiens clone IMAGE713177, mRNA se 4 

40 126666 AA648886 Hs 151999 ESTs 3 8 

126872 AW450979 gb:Ul-H-BM*a-12-04Jl.s1 NCI_CGAP_Su 3 

127431 AW771958 Hs.175437 ESTs, Moderately similar to PC4259 tern 3 6 

127980 AA961459 Hs.125644 ESTs 41 

127997 AW068311 Hs 311054 Homo sapiens mRNA full length insert cDN 3 3 

45 128420 AA650274 Hs41296 fib ronectin leucine rich transmembrane p 4 6 

128609 NMJJ03616 Hs 102456 survival of motor neuron protein interac 3 9 

128946 Y13153 Hs 107318 kynurenine 3-monooxygenase (kynurenine 3 31 

128955 AA775076 Hs 185807 Homo sapiens, Similar to PRO0478 protein 3 9 

129092 D56365 Hs 63525 pory(rC) -binding protein 2 3 3 

50 129270 AA357185 Hs 109918 ras homolog gene famify, member H 31 

129301 AF182277 Hs.330780 cytochrome P450, subfamity MB (phenobar 3 9 

129385 AA172106 Hs 110950 Rag C protein 6 2 

129619 AA209534 Hs 284243 tetraspan NET-6 protein 3 4 

129629 AK000398 Hs,11747 hypothetical protein FU20391 3 

55 129725 X56411 Hs.1219 alcohol dehydrogenase 4 (class II), pi p 3 2 

130069 AI754813 Hs 146428 collagen, type V. alpha 1 5 4 

130092 X03363 Hs 323910 v^erb-b2 avian erythroblastic leukemia v 4 4 

130298 AI347487 Hs 132781 class I cytokine receptor 4.6 

130382 NMJ03450 Hs 155204 zinc finger protein 174 5.6 

60 130622 AI582291 Hs 16846 ESTs, Weakly similar to 04HUD1 debrisoqu 3 

130703 R77776 Hs 18103 ESTs 3.8 

130881 AA809875 Hs 25933 ESTs 4.2 

130954 AB014544 Hs 21572 KIAA0644 gene product 4.7 

131095 AI399653 Hs 22917 ESTs 4.3 

65 131153 H09048 Hs 23606 ESTs 3.8 

131253 R71802 Hs24853 ESTs 3.5 

131372 AW293399 Hs,144904 nuclear receptor co-repressor 1 3.6 
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131507 AI826268 Hs.27769 ESTs, Weakly similar to MCATJHUMAN MFTOC 

131587 AI695549 Hs.183868 glucuronidase, beta 

131739 AF017986 Hs.31386 secreted frizzJed-related protein 2 

131795 BE501849 Hs.32317 high- mobility group 2 OB 

5 131970 D86960 Hs.3610 KIAAO 2 05 gene product 

131986 NMJJ02314 Hs.36566 LIM domain kinase 1 

132093 AA400091 Hs.39421 ESTs 

132122 AA426202 Hs.40403 Cbp/p300-interacting transactivator, wit 

132159 D76435 Hs.41154 Zic family member 1 (odd-paired Drosophi 

10 132333 AA192669 Hs.45032 ESTs 

132406 AL133731 Hs.4774 Homo sapiens mRNA; cDNA DKFZp761C1712 (f 

132482 AV660345 Hs.238126 CGI-49 protein 

132543 BE568452 Hs.5101 protein regulator of cytokinesis 1 

132624 AA326108 Hs 33829 bHLH protein DEC2 

15 132700 AA319233 . Hs.5521 ESTs 

132725 NMJJ06276 Hs.184167 splicing factor, arginine/serine-rich 7 

132790 W73311 Hs.169407 SAC2 (suppressor of actin mutations 2, y 

132847 T46195 Hs.58189 eukaryotic translation initiation factor 

132857 Y00272 Hs.184572 ceil division cycle 2, G1 to S and G2 to 

20 132936 AL120659 Hs.6111 a ryl-hydro carbon receptor nuclear transl 

133130 AI128606 Hs.6557 zinc finger protein 161 

133142 AW952412 Hs.65874 ESTs, Weakly similar to A40348 Etav/Sex- 

133167 AW162840 Hs.6641 kinesin family member 5 C 

133225 AW600291 Hs.6823 hypothetical protein FL J 104 30 

25 133274 AA085191 Hs.6949 hypothetical protein MGC1 1275 

133275 293241 Hs.239934 CGl-96 protein 

133287 AW797437 Hs.69771 B -factor, properdin 

133376 BE618768 Hs.7232 acetyl-Coenzyme A carboxylase alpha 

133462 AW675064 Hs.73875 fu mary la cetoacetate hydrolase (fumarylac 

30 133740 AW162919 Hs.170160 RAB2, member RAS oncogene family-like 

133831 BE274552 Hs 76578 protein inhibitor of activated STAT3 

133976 AJ908165 Hs. 169946 GATA-binding protein 3 (T-celi receptor 

134666 BE391929 Hs,8752 transmembrane protein 4 

134710 AI433797 Hs.8889 serine hydroxy methyltransferase 1 (solub 

35 134731 D89377 Hs,89404 msh (Drosophila) homeo box homolog 2 

134776 J05582 Hs.89603 mucin 1 , transmembrane 

135230 AF064804 Hs ,96757 suppressor of Ty (S.cerevisiae) 3 homoto 

135303 R61253 Hs.98265 KIAA1 877 protein 

135400 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 

40 135411 L10333 Hs.99947 reticulon 1 

300089 AI199738 Hs.208275 ESTs, Weakly similar to ALUA_HUMAN !!!! 

300233 AW614220 Hs.189402 ESTs 

300254 AW183618 Hs.55610 solute carrier family 30 (zinc transport 
300256 AW591433 Hs.298241 Transmembrane protease, serine 3 

45 300378 Z45270 Hs.235873 hypothetical protein FLJ22672 

300973 AA572949 Hs.207566 ESTs 
301111 R10799 Hs.191990 ESTs 
301341 AA887801 Hs.208229 G protein-coupled receptor 
301548 AI091631 Hs.203845 two pore potassium channel KT3.3 

50 301884 AA312082 Hs.105445 GDNF family receptor alpha 1 

301936 U79745 Hs.1 14924 solute carrier tamily 16 {monocarboxyltc 
301975 T97905 gb:ye54d0.r1 Soares fetal liver spleen 

302001 AB020711 Hs.278346 K1AA0904 protein 
302067 BE542706 Hs.222399 CEGP1 protein 

55 302094 AW749321 Hs.6786 ESTs 

302099 AL049670 Hs.137576 ribosomal protein L34 pseudogene 1 
302145 NM_003613 Hs.151407 cartilage intermediate layer protein, nu 
302235 AL049987 Hs.166361 Homo sapiens mRN A; cONA DKFZp564F1 12 (fr 
3C2276 AW057736 Ms 323910 HER2 receptor tyrosine kinase (c-erb-b2, 

60 302290 AA179949 Hs.175563 Homo sapiens mRMA; cDNA DKFZp564N0763 (( 
302372 AL117406 Hs.200102 ATP-binding cassette transporter MRP8 
302378 AL109712 Hs.296506 Homo sapiens mRNA full length insert cDN 

302384 AI678059 Hs.202676 syn a ptonemai complex protein 2 

302385 AJ224172 Hs.204096 lipophilin B (uteroglobin family member) 
65 302680 AW192334 Hs.38218 ESTs 

302330 A1038997 Hs.132921 ESTs 

302857 AF282265 Hs.44836 inner centromere protein antigens (135kD 
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302892 AW176909 Hs.42346 calcine u^'n-binding protein calsarcin-1 3.4 

302970 W05508 Hs.312679 ESTs, Weakly similar to A490 19 dynein he 5.1 

303271 AA652687 Hs.96151 Human DNA sequence from clone RP5-1103G7 3.7 

303289 AL121460 Hs.272673 hypothetical protein FU20 508 4.1 

5 303357 AW006352 Hs.159643 ESTs, Weakly similar to T32554 hypotheti 4.2 

303540 AA355607 Hs.309490 ESTs, Weakly similar to putative WHSC1 p 4.3 

303563 AA367699 Hs.10082 potassium intermediate/small conductance 3 3 

303642 AW299459 gb:xs50d08.x1 NCI_CGAP_Kid1 1 Homosapien 4 2 

303780 AI424014 Hs.18995 KIAA1 304 protein 3 6 

10 303797 AW629759 gb:hh70e05.y1 MCI_CGAP_GU1 Homo sapiens 4 9 

303852 R53434 Hs.90207 hypothetical protein MGC1 11 38 3 7 

304328 AA149951 Hs.62112 zinc finger protein 207 3 

304782 AA582081 gb:nn32h08.s1 NCI_CGAP_Gas1 Homo sapiens 41 

305913 AA876109 gb nx24h01,s1 NCLCGAP.GC3 Homo sapiens 3 

15 305917 AA876469 gb:oe48b04.s1 NCI_CGAP_Pr25 Homo sapiens 3.1 

307010 AI140014 gb qa68f09.x1 Soares_fey_hearLNbHH19W 3 5 

307041 AI144243 gb:qb85b12.x1 SoaresJetal_heart_NbHH19W 3 9 

308106 AI476803 gbt|77e12.x1 Soares_NSF_F8_9W_OT_PA_P_S 4 3 

308307 AI581398 Hs.172928 collagen, type I, alpha 1 4 6 

20 308615 AK000142 Hs,101774 hypothetical protein FU2304 5 4.4 

309177 AI951118 Hs.326736 Homo sapiens breast cancer antigen NY-BR 17.3 

309328 AW024348 Hs 233191 EST, Weakly similar to A2721 7 glucose tr 3 2 

309574 AW168083 gb.xg59g04.x1 NC1__CGAP_Ut4 Homo sapiens 31 

309583 AW170035 Hs.326736 Homo sapiens breast cancer antigen NY BR 57.6 

25 310064 AI199712 Hs.148486 ESTs, Weakly similar to 1917210A Pro/Arg 4 6 

310098 AI685841 Hs.161354 ESTs 36 

310438 AW022192 Hs 200197 ESTs 4 6 

310683 AI939456 Hs. 160870 ESTs 3.2 

310727 AK000703 Hs.323822 Homo sapiens mRNA for KIAA1551 protein, 3 6 

30 310781 AI380797 Hs.158992 ESTs 102 

310895 AI955121 Hs 165724 N-acety1galactosam'ine4-0-sutfotransfera 34 

310955 AI476732 Hs.263912 ESTs 105 

311117 AI671439 Hs.196029 Homo sapiens mRNA for KJAA 1657 protein, 31 

311166 AI821005 Hs.1 18599 ESTs 10.8 

35 311237 AA641098 Hs.208809 ESTs, Moderately similar to ALU 1 .HUMAN A 4 3 

311465 AI758660 Hs 206132 ESTs 4.4 

311587 AI828254 Hs.271019 ESTs, Weakly similar to A47582 B-ceilgr 5 1 

311598 AW023595 Hs.232048 ESTs 5 8 

311774 AA700870 Hs.14304 ESTs 33 

40 311785 AI056769 Hs.133512 ESTs 3 9 

311872 R12375 Hs.194600 ESTs 3 3 

311889 AA767342 Hs. 122483 ESTs, Weakly similar to PSF„HU MAN PTB^S 3 

311913 AI358522 Hs270188 ESTs 3 

311923 T60843 Hs 189679 ESTs 5 6 

45 311935 AA216387 gb:nc16b02.s1 NCIjCGAP_Pr1 Homo sapiens 5 2 

312019 AA373630 Hs.188750 ESTs 3 

312021 AA759263 Hs.14041 ESTs 34 

312067 T78968 Hs.14411 ESTs 3 5 

312090 T80177 Hs 118064 similar to rat nuclear ubiquitous casein 38 

5 0 312147 AI633744 Hs.195648 ESTs, Weakly similar to I38022 hypotheti 4 4 

312153 BE261944 Hs.118625 hexo kinase 1 52 

312168 T92251 Hs 198882 ESTs 3 3 

312182 T94344 Hs.326263 ESTs 3 3 

312187 AA700439 Hs.188490 ESTs 34 

55 312199 AW438602 Hs 191179 ESTs 39 

312219 H73505 Hs.1 17874 ESTs 4 

312226 AA315703 Hs.199993 ESTs, Weakly similar to ALUBJWMAN t!!l 4 9 

312299 AA972712 Hs.269737 ESTs 5 7 

312544 AA516420 Hs 183526 ESTs, Weakly similar to 138022 hypotheti 6 3 

60 312638 AW439195 Hs256880 ESTs, Weakly simflar to S65657 alpha- 1 C- 4 9 

312826 AW291545 Hs 185018 ESTs 49 

312837 AW292286 Hs 255058 ESTs 4 4 

312980 AA497043 Hs. 11 5685 ESTs 3 1 

313070 Af422023 Hs. 161338 ESTs 4.3 

65 313079 N76497 Hs.1787 proteolipid protein 1 (Pefizaeus-Merzbac 3.3 

313089 AF026944 Hs.293797 ESTs 5.8 

313096 AW073310 Hs.163533 Homo sapiens cDNA FLJ14142 fis, clone MA 4.5 
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313126 
313166 
313197 
313280 
313325 
313328 
313352 
313385 
313393 
313417 
313434 
313569 
313591 
313615 
313915 
313975 
313979 
313997 
314043 
314078 
314097 
314121 
314129 
314138 
314236 
314244 
314305 
314306 
314322 
314394 
314401 
314465 
314506 
314510 
314546 
314547 
314558 
314627 
314648 
314691 
314729 
314754 
314814 
314864 
314881 
314882 
314981 
315006 
315021 
315051 
315060 
315073 
315080 
315175 
315183 
315193 
315196 
315198 
315240 
315263 
315282 
315296 
315368 
315397 
315489 
315498 
315526 



AA746503 
AI801098 
AW979008 
AW960454 
AI420611 
AW449211 
AW150945 
AI032087 
AI674685 
AA741151 
W92070 
AI273419 
AA046309 
AI540978 
C18863 
AW175896 
AI535895 
AV657317 
AA827082 
AW1 29357 
AA648744 
AI732083 
AA228366 
AA740616 
AA743396 
AL036450 
AI280112 
A1697901 
AA907153 
AW961597 
AI660412 
AA602917 
AA833655 
AI204418 
AW007211 
AA399272 
AI873274 
AA425310 
AW979268 
AW207206 
AA457367 
AW026761 
BE350122 
AW971198 
AI095087 
AA828032 
AW972359 
AI538613 
AA533447 
AW292425 
AA551104 
AW452948 
AA744550 
AI025842 
AW136134 
AI241331 
AI367347 
AI741506 
R38772 
AW510994 
AI222165 
AA876905 
AB037745 
AA218940 
AI378817 
AA628539 
AI193043 



Hs.283313 ESTs 
Hs.151500 ESTs 
Hs.222487 ESTs 
Hs.222830 ESTs 
Hs.127832 ESTs 

Hs. 105445 GDNF family receptor alpha 1 
Hs.144758 ESTs 
Hs.269819 ESTs 
Hs.200141 ESTs 
Hs.137323 ESTs 

gb:zh48g05.r1 Saares_feta!Jiver_spleen_ 
Hs.135146 hypothetical protein FU 13984 

gb:zf12f01.s1 Soares_fetalJieait_NbHH19W 
Hs.301997 hypothetical protein FU 13033 
Hs. 163443 Homo sapiens cDNA FU 11576 fis, clone HE 
Hs.65114 keratin 18 
Hs.221024 ESTs 

Hs.288649 hypothetical protein MGC3077 

Hs.291872 ESTs 

Hs.329700 ESTs 

Hs.269493 ESTs 

Hs.187619 ESTs 

Hs.115122 ESTs 

gb:ny97f11.s1 NCl_CGAP_GCB1 Homo sapiens 
Hs. 189023 ESTs 
Hs.103238 ESTs 

Hs.125232 Homo sapiens cDNA RJ13266 fis, clone OV 
Hs.192425 ESTs 
Hs.190060 ESTs 

Hs.130816 ESTs, Moderately similar to 138022 hypot 
Hs.234557 ESTs 
Hs.156974 ESTs 

Hs.206868 Homo sapiens cDKA FU14056 fe, done HE 
Hs. 190080 ESTs 

Hs.16131 hypothetical protein FU1 2876 
Hs.144341 ESTs 
Hs.190721 ESTs 

Hs. 155 766 ESTs, Weakfy simitar to A47582 B-cellgr 

gb:EST391378 MAGE resequences, MAGP Homo 
Hs.136319 ESTs 
Hs.191638 ESTs 
Hs.134374 ESTs 

Hs.1 57367 ESTs, Weakly similar to I78885 serine/th 
Hs.294068 ESTs 

Hs.1 52299 ESTs, Moderately simitar to S65657 alpha 
Hs.189076 ESTs 
Hs.293334 ESTs 

Hs.298241 Transmembrane protease, serine 3 
HS.312989 ESTs 
Hs.163484 ESTs 

Hs.1 89048 ESTs, Moderately similar to ALUC_HUMAN I 
Hs.257631 ESTs 
Hs.136345 ESTs 
Hs.152530 ESTs 
Hs.220277 ESTs 

Hs.131765 ESTs, Moderately similar to 138937 DNA/R 

Hs.44898 Homo sapiens done TCCCTA00151 mRNA sequ 

Hs.165900 ESTs, Weakly similar to ALU1_HUMAM ALU S 

Hs.172619 myelin transcription factor 1-like 

Hs.220740 ESTs 

Hs.144923 ESTs 

Hs.125286 ESTs 

Hs.104896 KIAA1324 protein 

Hs.137516 fidgeMe 1 

Hs.191847 ESTs 

Hs.1 16252 ESTs, Moderately similar to ALU1_HUMAN A 
Hs.128685 ESTs, Weakly similar to T17226 hypotheti 
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315530 AW015415 Hs.127780 ESTs 8 9 

315562 AA737415 Hs.152826 ESTs 5 5 

315634 AA837085 Hs.220585 ESTs 6 3 

315647 AA648983 Hs 212911 ESTs 3 6 

5 315707 AI418055 Hs 161160 ESTs 5 1 

315772 AW515373 Hs 271249 Homo sapiens cDNA FL J 13580 fis, clone PL 3 1 

315850 AW270550 Hs 116957 ESTs 3 8 

315858 AA737345 Hs 294041 ESTs 5 

315878 AA683336 Hs 189046 ESTs 31 

10 315977 AW865916 Hs 151206 ESTs 4 7 

315978 AA830893 Hs 119769 ESTs 41 

315995 AI217477 Hs 194591 ESTs 4 1 

316012 AA764950 Hs 119898 ESTs 7 

316042 AI469960 Hs.170698 ESTs 4 9 

15 316052 AI962796 Hs 136754 ESTs 4 1 

316072 AW517524 Hs 135201 NOD 2 protein 3 2 

316074 AW975114 Hs 293273 ESTs 3 8 

316100 AW203986 Hs 213003 ESTs 3 2 

316133 AI187742 Hs 125562 ESTs 3 7 

20 316177 AI904982 Hs 293102 ESTs, Moderately similar to ALU 1_HU VAN A 30.7 

316186 AI433540 gb:ti69g05.x1 NCLCGAP„Kkl11 Homo sapten 3 1 

316244 AI640761 Hs 224988 ESTs 3 5 

316303 AA740994 Hs 209609 ESTs 3 8 

316313 AA741300 Hs 202599 ESTs, Weakly similar to I38022 hypotheti 44 

25 316364 AA747807 Hs 149500 ESTs 3 2 

316580 AA938198 Hs 146123 poly(A) polymerase gamma 9 4 

316697 AW293174 Hs 252627 ESTs 4 4 

316715 AI440266 Hs 170673 ESTs, Weakly similar to T24832 hypotheti 3 

316868 AI660898 Hs 195602 ESTs 3 2 

30 316869 AI954880 Hs 134604 ESTs 3 2 

316886 AA836331 Hs. 134981 ESTs 4 4 

316897 AA838114 Hs 221612 ESTs 3 7 

316943 AW014875 Hs 137007 ESTs 4 6 

317069 AI732892 Hs. 190489 ESTs 5 9 

35 317194 AW445167 Hs 126036 ESTs 41 

317360 AI125252 Hs 126419 ESTs 3 5 

317404 AI806867 Hs 126594 ESTs 51 

317452 AA972965 Hs 135568 ESTs 6 9 

317501 AI822034 Hs 137097 ESTs 4 6 

40 317674 AW294909 Hs 132208 ESTs 4 3 

317803 AW664964 Hs 128899 ESTs 61 

317834 X56348 Hs 287270 ret protooncogene (multipte endocrine n 3 1 

317850 AI681545 Hs.152982 hypothetical protein FLJ 131 17 3 4 

317881 AI827248 Hs 224398 Homo sapiens cDNA FLJ1 1469 fis, clone HE 9 6 

45 317902 AW102941 Hs 211265 ESTs 4 1 

317916 AI565071 Hs159983 ESTs 10.3 

318042 AW294522 Hs 149991 ESTs 31 

318223 AI077540 Hs 134090 ESTs 3 9 

318327 AW294013 Hs 200942 ESTs 3 

50 318332 AIO93930 Hs.163440 Homo sapiens cDNA: FLJ21000 fis, done C 4 4 

318418 AF107493 Hs 118498 Homo sapiens LUCA-15 protein mRNA, splic 5 4 

318558 AW402677 Hs 146381 RN A binding motif protein, X chromosome 4.4 

318625 AA526235 Hs 193162 Homo sapiens cDNA FL J 11 9B3 fis, cloneHE 59 

318634 T49598 Hs 156832 ESTs 4 

55 318740 NMJX)2543 Hs 77729 oxidised low density lipoprotein (lectin 7 3 

318744 AI793124 Hs 144479 ESTs 17.8 

318781 F11802 Hs6B18 ESTs 3 

319191 NMJ)12391 Hs 79414 prostate epithelium-specific Ets transcr 3 6 

319478 AI524124 Hs 270307 ESTs 4 6 

60 319510 W88532 Hs 254562 ESTs 3 3 

319551 AA761668 gb:nz24c08.s1 NCI_CGAP_GCB1 Homo sapiens 3 2 

319745 T79366 Hs. 108258 actio binding protein; mac rophin (microf 3 3 

319834 AA071267 gb:zm61g01.r1 Stratagene fibroblast (937 6 2 

319840 C19035 Hs 164259 ESTs 3.3 

65 319977 AA534222 gb:nj21d02.s1 NCLCGAPJVA1 Homo sapiens 4.3 

320074 AA321166 Hs 278233 ESTs 3.4 

320167 AA984373 Hs 90790 Homo sapiens cDNA: FLJ22930 fis, clone K 4.1 
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320167 
320211 
320416 
320588 
320635 
320654 
320742 
320832 
320915 
321016 
321107 
321171 
321253 
321318 
321642 
321644 
321683 
321758 
321811 
321828 
321910 
321937 
321978 
322035 
322136 
322258 
322296 
322303 
322476 
322520 
322521 
322567 
322595 
322675 
322766 
322818 
322882 
322975 
323091 
323131 
323168 
323244 
323262 
323332 
323333 
323335 
323645 
323663 
323693 
323782 
323817 
323930 
323974 
324001 
324036 
324261 
324285 
324296 
3243C5 
324432 
324585 
324598 
324603 
324631 
324716 
324748 
324771 



T99949 
AL039402 
AI026984 
U7B082 
N50617 
AI160015 
AI601188 
AA214584 
AI359144 
BE144167 
AI732643 
AI769410 
AA610649 
AB033041 
AI432199 
AW975944 
AI471598 
U29112 
D80630 
R59890 
H67065 
AL049351 
N77342 
AL137517 
AF075083 
BE265745 
W76326 
AI357412 
AW963372 
T55958 
AF147347 
AF155108 
W92147 
AA017656 
AW068805 
AW043782 
AW248508 
C16391 
AI902456 
AK002088 
AL120862 
AW675572 
AL133990 
AI829520 
AV651680 
AI655499 
AW445014 
BE081058 
AA317962 
AW961560 
AA410943 
AL043683 
AI825204 
AL044949 
AI472078 
BE069341 
AM31159 
AI524039 
AA642007 
AA464510 
AI823969 
AW972227 
AW993522 
M937116 
BE169746 
AW974941 
AA631739 



Homo sapiens cDNA FLJ14832 fis, clone OV 
DEME-6 protein 
ESTs 

RMA polymerase II transcriptional regula 
small nuclear ribonucleopratein polypept 
ESTs 
ESTs 
ESTs 

Homo sapiens cDNA; FLJ23031 fis, done L 
hypothetical protein similar to RNA-bind 
ESTs 
ESTs 
ESTs 

vang (van gogh, Drosophi1a)-l1ke 2 
ESTs 
ESTs 
ESTs 
ESTs 

gb*JHUMO91D02B Human fetal brain (TFupwa 
nuclear receptor subfamily 1, group I, m 
ESTs, Weakly similar to ALU7.HUMAN ALU S 
Homo sapiens mRNA; cDMA DKFZp566C093 (fr 
Homo sapiens cDNA FL) 12900 fis, clone NT 
hypothetical protein DKFZp56401278 
gb:Homo sapiens full length insert cDNA 
ESTs, Weakly similar to ALUCJHUMAN 
gb:zd60d04.r1 SoaresJetaLheart_NbHH19W 
ESTs 

PRO2000 protein 

gb:yb35f05.r1 Stratagene fetal spleen (9 
gb'Homo sapiens full length insert cDKA 
Homo sapiens, Similar to RIKEN cONA 2810 
ESTs 

gb:ze39h01.r1 Soares retina N2b4HR Homo 
Homo sapiens cDNA FU12280 fis, done MA 
ESTs 

Homo sapiens cDNA FLJ14035 fis, done HE 
gb:C 16391 Clontecn human aorta poIyA mRN 
ESTs, Weakly similar to 138022 hypothetl 
Homo sapiens cDNA FU1 1226 fts, clone PL 
programmed cell death 9 (PDCD9) 
ESTs 
ESTs 

gb:wl19c06.x1 NCL_CGAP_Ut1 Homo sapiens 
ESTs 
ESTs 
ESTs 
ESTs 

ESTs, Moderately similar to PC4259 fern 
ESTs 
BMP-R1B 

hypothetical protein FLJ 10803 
ESTs 
ESTs 
ESTs 

gb:QV3-BT0381-270100-073-c08 BT0381 Homo 
Hs.122954 ESTs 
Hs. 192524 ESTs 
Hs,116369 ESTs 
Hs. 152812 ESTs 
Hs.132678 ESTs 

Hs.163986 Homo sapiens cDNA: FU22765 6s, clone K 
Hs.292934 ESTs 

Hs.293683 ESTs, Weakly similar to I54374 gene NF2 
Hs. 1 2504 likely ortholog of mouse Arkadia 
Hs.292385 ESTs, Weakly similar to I78885 sehne/th 
Hs.335440 EST 
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Hs.303428 
Hs.125783 
Hs.293662 
Hs. 167738 
Hs.80506 
Hs.118112 
Hs.120910 
Hs.290167 
Hs.143688 
Hs.49994 
Hs.144151 
Hs.221461 
Hs.333239 
Hs.137507 
Hs.247084 
Hs.237396 
Hs.197531 
Hs.196151 

Hs.83623 
Hs.271530 
Hs.302058 
Hs.21851 
Hs.334473 

Hs.194359 

Hs.1 57601 
Hs.46677 



Hs.256150 
Hs.118394 

Hs.288467 
Ha293616 
Hs,279727 

Hs 210761 
Hs 270124 
Hs 124165 
Hs 193620 
Hs.190642 

Hs.208558 
Hs.161712 
Hs.197746 
Hs.243023 
Hs.249721 
Hs.97600 

Hs.8173 
Hs.211408 
Hs.116298 
Hs.303662 



5.3 
92 
31 
31 
61 
35 
3 

3.7 

3.1 

3.3 

12.3 

3.3 

3 

3.9 
3 

11.7 

3.8 

4,4 

3.2 

3.1 

4.7 

3.5 

5 

19 

3.6 

3 

4.4 
11.5 
3 
3 

4.2 
4 

5.4 

3.1 

5.2 

7.6 

5.9 

16.5 

4 

3.3 

6.3 

4.6 

10.5 

6.2 

4.3 

9.2 

3.1 

4 

3 

3.2 

8.4 

3.3 

4.5 

4.5 

8.4 

49.4 

3 

3 

3.3 
16.5 
3.3 
5 

10.4 

3.3 

3.2 

3 

3 
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324774 AI031771 Hs.132586 ESTs 4.2 

324823 AW516704 Hs.208726 ESTs 3.4 

324824 AI826999 Hs.224624 ESTs 3.1 
324826 AA7048Q6 Hs.143842 ESTs, Weakly similar to 2004399A chromos 4.4 

5 324961 AA613792 gb:no97h03.s1 NC!_CGAP_Pr2 Homo sapiens 3.9 

324987 AI375572 Hs. 172634 ESTs 18.8 

324994 AI805416 Hs.213897 ESTs 3.3 

325146 AI064690 Hs.171176 ESTs 4.2 

325372 Phase 2 & 3 Exons 4.4 

10 325544 Phase 2 & 3 Exons 5.7 

327075 Phase 2 & 3 Exons 3.8 

332798 C22000007:gi|t2314195|ernb|CAB99338.1 1 (A 4.3 

334223 NM_005080*:Homo sapiens X-box binding pr 26.2 

334447 NMJ)12429*:Homo sapiens SEC14 (S. cerevi 3.9 

15 335809 NM_014509*:Homosapienskraken-iike(BK1 10,1 

335824 ENSPO0OO0249072*:DJ222E13.1 (N-TERMINAL 20 

338255 NM_014323*:Homo sapiens zinc finger prot 9 

409430 R21945 Hs.166975 splicing factor, a rginine/serine-rich 5 4 

428046 AW812795 Hs.155381 ESTs, Moderately similar to 1 380 22 hypot 4.6 

20 432558 R97268 Hs.177269 ESTs 3.2 

436808 AA731602 Hs.120266 ESTs 3.9 

448569 BE382657 Hs.21486 signal transducer and activator of trans 4.1 

453542 AW836724 Hs.339660 Homo sapiens mRNA expressed only in plac 3.7 

M97935 AFFX control: STAT1 3.2 

25 M97935 AFFX control: STAT1 3 

M55150 fumarylacetoacetate 3 

M13755 interferon stimulated protein; 15 kDa 4.5 

AI052047 ESTs 6.7 

AA252033 ESTs; Weakly similar to !!!! ALU SUBFAMILY J 3.2 

30 AA401739 ESTs 3.3 

H18459 hepatocellular carcinoma associated protein; 3 

R48744 ESTs 4.2 

M31682 inhibin; beta B (activin AB beta polypeptide) 3 

AA4 16873 ESTs 3 

35 D80240 HUM5G11 A Human fetal brain (TFujiwara) Homo 4 

R49590 ESTs 3.2 

CH22_FGEMES.678_5 16.8 

CH22_FGENES.619_7 12.9 

CH22_FGENES.619_12 11.3 

40 CH22_EM:AC005500,GENSCAN.127 9 9.2 

CH22_EM:AC0O55O0,GENSCAN.304 2 8.5 

CH22_FGENES.271_8 8.4 

CH22_FGENES,619_13 8 

CH22_FGENES.271_7 7.3 

45 CH22_FGENES.617_7 7.2 

CH.07_hsgi|6004473 7.1 

CH22_FGENES,264_1 6.8 
X03363 HER2 receptor tyrosine kinase (c erbB 2; ERBB2; neu) 6,6 

CH22_FGENES.617_9 6.5 

50 CH.07_hsgi|5868264 5.8 

CH.19_hsgi|5867439 5.7 

CH22_FGENES,6 3 5.3 

CH.17_hsgi[5867230 5.1 

CH.20Jisgi|6552458 5.1 

55 CH22_EMAC005500.GENSCAN.148 22 4.7 

CH22_FGENES.669_10 4.6 

AA034918 K1AA1028 protein 4.6 

CH22_FGENES.48J2 4.5 

CH22_FGENES.118_2 4 5 

60 AF049569 ESTs 4.4 

M13955 multiple UniGene matches 4,3 

CH22.FGENES.619J 4.3 

CH22„FGENES.13 7 4.3 

HG4126 HT4396 Zinc Finger Protein Hzf4 4.3 

65 CH22_FGENES.360_3 4.3 

CH22_FGENES.706_9 4.3 

CH.2l_hsgi|6531965 4.2 
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15 



20 



25 



30 



35 
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45 



HG2614HT2710 
X83535 

HG4716HT5158 
HG4677HT5102 



AA714311 



HG2465 HT4871 



AA707750 



AA976074 



CH.17_hsgi|5867215 
CH22_FGENES.669_8 

CH22_FGENES,48J8 

matrix metalloproteinase 14 (membrane inserted) 
CH22_FGENES.271_6 
CH22_FGENES.617_3 
CH22_FGENES.290_8 

CH22.FGENES.13 5 
CH22.FGENES.13 2 
CH.14_hsgi|6682474 
CH.02_hsgi(5867750 
CH22„FGENES.617_8 

CH22_DJ32I10.GENSCAN.23 39 

CH22_FGENES.543_20 

CH22„EM:AC005500.GENSCAN.96 1 

CH22_FGENES.204_2 

CH22_FGENES.619_4 

CH.16_hsgit5867087 

EST cluster (not in UniGene) 

CH22_EM:AC005500.GENSCAN.149 9 

CH22_EM;AC005500.GENSCAN.421 5 

CH22JGENES.13 4 

CH.07_hsgi|6004478 

CH22_FGENES.360_1 

CH22_FGENES.6_2 

CH2Z_C20H12.GENSCAhl.16 2 

CH22_C65E1.GENSCAN.8 1 

ESTs; Weakly similar to cis Golgi matrix 

CH22_FGENES.307_4 

CH22_EMAC005500.GENSCAN.248 14 

CH.06_hs gi|5902482 

CH22_FGENES.669_5 

CH22_DJ32I10.GENSCAN.19 8 

CH22_FGENES.527_6 

CH22_FGENES.330_10 

CH22_FGENES.14 2 

ESTs 

CH22.FGENES.226 7 
CH22_FGENES.13 3 
CH22_EM:AC005500.GEN SCAN .209 12 
CH22_FGENES.271_3 
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4.1 
4.1 

Collagen, Type Viu, Alpha 1 4.1 

4.1 

4 

3.9 
3.9 
3.8 

Guanosine 5' Monophosphate Synthase 3.8 
3.8 
3.8 
3.8 
3.8 
3.7 

Oncogene Ret/Ptc2, Fusion Activated 
3.7 
3.7 
3.7 
3.5 
3.5 
3.5 
3.4 
3.4 
3.4 
3.3 
3.3 
3.3 

Dna Binding Protein Ap 2, Alt. Splice 3 
3.3 
3.2 
3.2 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3 
3 
3 
3 
3 



3.3 
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TABLE 17A 



10 



Table 17A shows the accession numbers for those pkeys lacking unigenelD's for Table 17. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



15 



Pkey: Unique Eos probeset identifier number 

CAT number: Gene cluster number 
Accession; Genbank accession numbers 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Pkey CAT number Accession 

116845 393481.1 AA649530 AA659316 H64973 
103207 30635,-4 X72790 

126257 182217J N99638 AW973750 AA328271 H90994 AA558020 AA234435 N59599 R94815 

102791 37186J AF080229 AF080231 AF080230 AF080232 AF080233 AF080234 BE550633 AI636743 AW614951 BE467547 AI680833 
AI633818 N29986 U87592 U87593 U87590 U87591 S46404 U87587 M463992 AW206802 AI970376 AI583718 AI672574 
N25695 AW665466 AI818326 AA126128 AI480345 AW013827 AA248638 Ai2 14968 AA204735 AA207155 AA206262 AA204833 
AW003247 AW496808 AI080480 AI631703 AI651023 A1867418 AW818140 AA502500 AI206199 AI671282 AI352545 BE501030 
AI652535 BE465762 AA206331 AW451866 AA471088 AA206342 AA204834 AA206100 AW021661 AA332922 N66048 
AA703396 H92278 AW139734 H92683 U87589 U87595 H69001 U87594 BE466420 AI624817 BE466611 AI206344 AA574397 
AA348354 AI493192 

126872 142696.1 AW450979 AA136653 AA 1 366 56 AW4 19381 AA984358 AA4 92073 BE 168945 AA809054 AW238038 BE011212 BE011359 
BE01 1 367 BE01 1368 BE011362 BE011215 BE011365 BE011363 
1746257.1 R82040R70934 
176835J AA225084 AA302713 

324239.1 AI311928 AA936030 T51931 AA609816 AA487195 AA664207 
genbank_F09609 F09609 
genbank_R08440 R08440 



112631 
120742 
106864 
109700 
111532 
113938 
113947 
124357 
108733 
112303 
322136 
322296 
321811 
314648 
322520 
322521 
322675 
323332 
316186 
322975 
324261 
323817 
301976 
324961 
303642 
303797 
319551 
311935 
319834 
319977 
314138 
313591 



genbank_W81598 W81598 

genbank_W84768 W84768 

genbank_N22401 N22401 

504187J AA121022 AA126422 

genbanK.R54797 R54797 

46802J AF075083 H52291 H52528 

47334J W76326 AF086341 W72300 

1527481.1 D80630 D80896 D80895 

293660.1 AW979268 AA878419 AA431 342 AA431628 

38916.1 T55958 T57205 AF147346 

38917.1 AF147347 T55426 T55503 

86787_1 AA017656 AA017374 AA019761 

179142.1 AI829520 AI791832 AA228414 AI791823 AA229211 AA229315 
425440J AI433540 AA728984 AA804981 
1510563.1 C16391 C16413 

273265J BE069341 AW748403 AL044891 AI908240 AA393080 

233566.1 AA410943 AW948953 AA334202 AA332882 

128835 J T97905AA101672 

376239.1 AA613792 AW182329 T05304 AW858385 

284260J AW299459AA417112 

386364.1 AW629759 AW749955 AA633408 AI65 1005 

357371 J AA761668 AA573621 R92814 R09670 

174129.1 AA216387 T63548AA228676 

1 12523.1 AA071267 T65940 T64515 AA071334 

345248,1 AA534222 AA632632 T81 234 

179960 J AA740616 AA654854 AA229923 

103087.1 AA046309 AI2635O0 AA046397 
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308106 A1476803 

338255 CH22_6856FG_LINK_EM:AC00 

335809 CH22 3181FG 617_6_L1NK_EM 

335824 CH22_3197FG_619J1_LINK_E 

307010 AI140014 

307041 AI144243 

305913 AA876109 

305917 M876469 

309574 AW168083 

325372 c12_hs 

325544 c12_hs 

332798 CH22J4FG_6_5_UNK_C4G1.G 
334223 CH22_1507FG_360_4_LINK_EM 
327075 c21_hs 

334447 CH22_1746FG_387_7_LINK_EM 
304782 AA582081 

313434 441798J W92070 AW0 19952 W92053 



277 



WO 02/(>5<>377 



PCT/US02/02242 



TABLE 17B 



Table 17B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 17. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



Pkey: Unique number corresponding to an Eos probeset 

1 0 Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham I. et a I., Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



15 



Pkey Ref Strand Nt ^position 



334447 Dunham, 
335809 Dunham, 

20 335824 Dunham, 
332798 Dunham, 
334223 Dunham, 
338255 Dunham, 
325372 5866920 

25 325544 6682452 
327075 6531965 



et.al. Plus 
et.al. Plus 
elal. Plus 
et.al. Minus 
et.al. Minus 
et.ai. Minus 
Minus 
Plus 
Plus 



14308764-14308824 

26310772-26310909 

26376860-26376942 

232147-231974 

12734365-12734269 

15242294-15242231 

1117061-1117304 

171228-171286 

40413184041431 



278 



WO 02/059377 



PC T/l S02/02242 



TABLE 18: Table 2 from BRCA 014 P 

Table 18 shows genes with atleast five times the expression in breast tumor tissue than is 
5 expressed in normal body tissues. 



Pkey; Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

10 UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

Rt : Ratio of tumo r to normal body tissue 



15 Pkey ExAccn UnigenelD Unigene Title 



20 



25 



30 



35 



40 



45 



50 



55 



60 



101378 
101530 
101767 
101878 
103010 
104115 
104825 
107105 
108819 
112287 
112561 
112637 
113206 
113970 
114965 
118925 
119905 
121611 
129301 
133976 
134731 
300254 
301884 
302001 
302067 
302276 
302290 
302372 
302385 
309177 
309583 
310781 
311166 
311935 
312153 
313328 
313915 
314097 
314138 
314506 
314558 
314691 
315006 
315021 
315051 
315060 



BE563085 
M29874 
M81057 
M97815 
X52509 
AF183810 
AA035613 
AW963419 
AA011449 
AB033064 
AI791493 
R82331 
BE262470 
W27249 
AI733881 
N92293 
AW449064 
M31669 
AF182277 
AI908165 
D89377 
AW183618 
AA312082 
AB020711 
BE542706 
AW057736 
AA179949 
AL1 17406 
AJ224172 
AI951118 
AW170035 
AI380797 
AI821005 
AA216387 
BE261944 
AW449211 
C18863 
AA648744 
AA740616 
AA833655 
AI873274 
AW2O7206 
AI538613 
AA533447 
AW292425 
AA551104 



Hs.833 
Hs.1360 
Hs.1808M 
Hs. 183650 
Hs.161640 
Hs.26102 
Hs.141883 
Hs. 155223 
Hs.271627 
Hs.334806 
Hs.129873 
Hs.164599 
Hs.241471 
Hs.8109 
Hs.72472 
Hs.206832 
Hs. 11 9571 
Hs.1735 
Hs.330780 
Hs. 169946 
Hs.89404 
Hs.55610 
Hs. 105445 
Hs.278346 
Hs.222399 
Hs.323910 
Hs.175563 
Hs.200102 
Hs.204096 
Hs.326736 
Hs.326736 
Hs.158992 
Hs.118599 



Hs.1 18625 
Hs.105445 
Hs.163443 
Hs.289493 

Hs.206868 
Hs.190721 
Hs.136319 
Hs,298241 
Hs.312989 
Hs. 163484 
Hs. 189048 



interferon-stimuiated protein, 15 kDa 
cytochrome P450, subfamily MB (phenobar 
carboxypeptidase B1 (tissue) 
cellular retinoic acid-binding protein 2 
tyrosine aminotransferase 
opposite strand to trichorhinophalangeai 
ESTs 

stanntocalcin 2 
ESTs 

K1AA1238 protein 

ESTs, Weakly similar to A36036 cytochrom 
ESTs 
RNB6 

hypothetical protein FLJ21080 
BMP-R1B 

ESTs, Moderately similar to ALU8_HUMAN A 
collagen, type III, alpha 1 (Ehlers-Danl 
inhibin, beta B (activin AB beta polypep 
cytochrome P450, subfamily IIB (phenobar 
GATA-binding protein 3 (T-ceR receptor 
msh (DrosophHa) homeo box homolog 2 
solute carrier family 30 (zinc transport 
GDNF family receptor alpha 1 
KIAA09O4 protein 
CEGP1 protein 

HER2 receptor tyrosine kinase (c-erb-b2, 
Homo sapiens mRNA; cDNA DKFZp564N0763 (f 34.1 
ATP-binding cassette transporter MRP8 6.7 
Hpophilin B (uteroglobin family member) 13.8 
Homo sapiens breast cancer antigen NY-BR 17.3 
Homo sapiens breast cancer antigen NY-BR 57.6 
ESTs 102 
ESTs 10.8 
gb:nc16b02.s1 NCI_CGAP_Pr1 Homo sapiens 5.2 
hexo kinase 1 5.2 
GDNF family receptor alpha 1 1 2.4 

Homo sapiens cDNA FIJ1 1 576 fis, clone HE 26.3 
ESTs 6.6 
gb:ny97f11.s1 NCI_CGAP_GCB1 Homo sapiens 5.9 
Homo sapiens cDNA FU14056 fis, done HE 8.5 
ESTs 27.4 
ESTs 20-7 
Transmembrane protease, serine 3 10.9 
ESTs 5.3 
ESTs 12.9 
ESTs, Moderately similar to ALUCJHUMAN I 5.8 
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R1 

5.3 
9 
12 
6.5 
12.4 
7.6 
6.9 
5.3 
6.1 
7.3 
8.2 
5.4 
6.2 
6.9 
10.1 
19,7 
8.4 
5.6 
6.2 
6.2 
5.8 
9.9 
5.7 
77 
7.3 
5.4 



WO 02/059377 



PCT/DS02/02242 



315196 AI367347 Hs.44898 

315530 AW015415 Hs.127780 

315634 AA837085 Hs.220585 

315012 AA764950 Hs.119898 

316177 AI904982 Hs.293102 

316580 AA938198 Hs.146123 

317803 AW664964 Hs.128899 

317881 AI827248 Hs.224398 

318740 NM_002543Hs,77729 

318744 AI793124 Hs.144479 

320211 AL039402 Hs.125783 

321107 AI732643 Hs.144151 

321644 AW975944 Hs.237396 

321978 N77342 Hs.21851 

322035 AL137517 Hs.334473 

322766 AW068805 Hs.288467 

322818 AW043782 Hs.293616 

322975 C16391 

323262 AL133990 Hs.190642 

323332 AI829520 

323817 AA410943 

324261 BE069341 

324432 AA464510 Hs.152812 

324598 AW972227 Hs.163986 

324603 AW993522 Hs.292934 

324987 AI375572 Hs. 172634 
325544 

330388 Hs.46 

334223 

335809 

335824 

AI052047 
R72427 



Homo sapiens clone TCCCTA00151 mRNA sequ8.2 



ESTs 8.9 

ESTs 6.3 

ESTs 7 

ESTs, Moderately similar to ALULHUMAN A 307 

poty(A) polymerase gamma 9.4 

ESTs 6.1 

Homo sapiens cDNA FLJ 1 1469 fis, clone HE 9.6 

oxidised low density lipoprotein (lectin 7.3 

ESTs 17.8 

DEME-6 protein 9.2 

ESTs 12.3 

ESTs 117 

Homo sapiens cDNA FLJ 1 2900 fts, clone NT 5 

hypothetical protein DKFZp56401278 19 

Homo sapiens cDNA FLJ 1 2280 fis, clone MA 5.2 

ESTs 7.6 

gb:C 16391 Clontech human aorta polyA mRN 1 6.5 

ESTs 10.5 

gb:wl19c06.x1 NCt_CGAP_Ut1 Homo sapiens 6.2 

BMP-R1B 8.4 
gb:QV3-BT0381-270100-073^c08 BT0381 Homo 49.4 

ESTs 16.5 

Homo sapiens cDNA; FLJ22765 fis, cione K 5 

ESTs 10.4 

ESTs 18.8 

Phase 2 & 3 Exons 5.7 

HER2 receptor tyrosine kinase {c-efb-b2, 6.6 

NM_005080*:Homo sapiens X-box binding pr 26.2 

NM_01 4509*: Homo sapiens kraken-like (BK1 10.1 
ENSP00000249072*:DJ222E13.1 (N-TERMINAL 20 

ESTs; Weakly similar to CYTOCHROME P450 67 

CH22,ErrtAC005500.GENSCAN.127 9 5.5 

CH22_FGENES.619_13 9.2 

CH22_FGENES.617_9 8 

CH22.FGENES.27W 6.5 

CH22_FGENES.619_7 7.3 

CH22_FGENES.271_8 12.9 

CH22_FGENES.619_12 8.4 

CH22„EM:AC005500.GENSCAN,304 2 11.3 

CH.07jisgi|60O4473 8.5 

CH22_FGENES.617_7 7.1 

CH22_FGENES.678_5 7.2 

CH22JGENES.678_5 16.8 



280 



WO 02/H59377 



PCT/IS02/02242 



TABLE 18A 



10 



Table 18A shows the accession numbers for those pkeys lacking unigenelD's for Table 18. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



15 



20 



25 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

323332 179142.1 AI82S520 AI791B32 AA228414 AI791823 AA229211 AA229315 

322975 1510563J C16391 C15413 

324261 273265_1 BE069341 AW748403 ALQ44891 AI908240 AA393080 

323817 233566.1 AA410943 AW948953 AA334202 AA332882 

311935 174129.1 AA2 15387 T63548 AA228676 

314138 179960.1 AA740616 AA654854 AA229923 

335809 CH22_3181FG_617_6JJNK_EM 

335824 CH22J197FG_619J1_UNK_E 

325544 c12 hs 

334223 CH22_1507FG_360J_UNK_EM 
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TABLE 18B 

Table 18B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 1 8. For each predicted exon, we have listed the genomic 

sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

1 0 Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I et a!.' refers to the publication 

entitled The DNA sequence of human chromosome 22." Dunham I. et al. Nature (1999) 402:489495. 
Strand : Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



15 



Pkey Ref St rand Nt_positio n 



335809 Dunham, L etal. Plus 26310772-26310909 

20 335824 Dunham, L et.al. Plus 26376860-26376942 

334223 Dunham, I. etal. Minus 12734365-12734269 

325544 6682452 Plus 171228-171286 
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TABLE 19: 1045 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES 

5 Table 19 shows 1045 genes up-regulated in breast cancer compared to normal adult tissues. 
These were selected from 59680 probesets on the Affymetrix/Eos-Hu03 GeneChip array such 
that the ratio of "average" breast cancer to "average" normal adult tissues was greater than or 
equal to 2.5. The "average" breast cancer level was set to the 90 :h percentile value. The 
"average" normal adult tissue level was set to the 90 th percentile value amongst 144 non- 

1 0 malignant tissues. In order to remove gene-specific background levels of non-specific 

hybridization, the 15 th percentile value amongst the 144 non-malignant tissues was subtracted 
from both the numerator and the denominator before the ratio was evaluated. 
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45 



50 



55 



Pkey; Unique Eos proceset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal body tissue 



Pkey ExAccn UnigenelD UnigeneTitle 



R1 



408591 

406964 

400291 

407277 

449746 

426878 

400292 

427585 

408045 

407178 

407377 

450705 

407212 

428848 

404561 

407980 

447350 

450375 

422109 

435496 

453160 

420813 

415989 

422505 

424399 

423575 

429441 

431474 

448595 

427217 

402578 

422805 

424634 

456207 

424086 

459587 



AF015224 
M21305 
AA401369 
AW1 70035 
AI668594 
BE069341 
AA250737 
D31152 
AW138959 
AA195651 
C16391 
U90304 
AA412108 
NM_000230 

AA046309 

AI375572 

AA0O9647 

S73265 

AW840171 

AI263307 

X51501 

AI267700 

AL120862 

AI905687 

C18863 

AJ224172 

AL1 33990 

AB014544 

AA399272 

AA436989 

NMJXJ3613 

AA193450 

AI351010 

AA031956 



Hs.46452 mammagbbin 1 137.6 
gb '.Human alpha satellite and satellite 3 71 .0 

ESTs 68.4 
Homo sapiens breast cancer antigen NY -BR 54.2 
ESTs, Weakly similar to CP4Y_HUMAN CYTOC 46.4 
gb:QV3-BT0381-270100-073-c08BT0381 Homo 44.8 



Hs.190721 
Hs.326736 
Hs.176588 



Hs.72472 BMP-R1B 
Hs.179729 collagen, type X, alpha 1 (Schmid metaph 
Hs.245123 ESTs 
Hs.104106 ESTs 

gbC16391 Clontech human aorta potyA mRN 
Hs.25351 iroquois homeobox protein 5 
Hs,269350 ESTs 

Hs.194236 teptin (murine obesity homolog) 

trichortiinophalangeal syndrome I (TRPS1) 
gb:zf12f01.s1 SoaresJetaJ_hearLNbHH19W 

Hs. 172634 ESTs 

Hs.8850 a disintegrin and metalloproteinase doma 
Hs.1473 gastrin-re leasing peptide 
Hs.265398 ESTs, Weakly similar to transformations 
Hs.239884 H2B histone family, member L 
Hs.99949 proiactin-induced protein 
Hs.317584 ESTs 

Hs. 124 165 programmed cell death 9 (PDCD9) 
Hs.2533 aldehyde dehydrogenase 9 family, member 
Hs.163443 Homo sapiens cONA aJ11576 fis, clone HE 
Hs.204096 lipophilin B (uteroglobin family member) 
Hs.190642 ESTs 

Hs.21572 KIAA0644 gene product 
Hs.144341 ESTs 

C1001134;gil2117372lp:r|ll65981 fatty ac 
Hs.121017 H2A histone family, member A 
Hs.1 51407 cartilage intermediate layer protein, nu 

gb;zr40e07.r1 Soares_NhHMPu„S1 Homo sapi 
Hs. 102267 rysyl oxidase 

gb:zk15e04.s1 Soares„pregnantjJterus_NbH 
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37.4 
32.9 
31.9 
30.4 
27.7 
24.8 
22.0 
21.9 
21.8 
19.8 
17.3 
16.5 
16.5 
16.0 
15.8 
15.8 
15.5 
14.8 
14.5 
13.7 
13.6 
13 5 
(3.G 
12.8 
126 
12.2 
12.0 
11.9 
11.9 
11.5 
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416208 AW291168 Hs.41295 ESTs, Weakly simitar to MUC2_HUMAN MUCIN 11.5 

429170 NM.001394 Hs.2359 dual specificity phosphatase 4 11.5 

407276 AI951118 Hs 326736 Homo sapiens breast cancer antigen NY-BR 11 4 

434377 AW1 37148 Hs 306593 Homo sapiens cDNA FLJ 11382 fis. clone HE 113 

5 448390 AL035414 Hs 21068 hypothetical protein 111 

452401 NMJJ07115 Hs 29352 tumor necrosis factor, alpha -induced pro 11.0 

421037 AI684808 Hs 197653 programmed cell death 9 (PDCD9) 10 9 

452461 N78223 Hs.108106 transcription factor 10.7 

443348 AW873596 Hs 182278 calmodulin 2 (phosphorylase kinase, delt 10 6 

10 421155 H87879 Hs 102267 lysyl oxidase 10 5 

402606 NM_G24626:Homo sapiens hypothetical prot 10.4 

409269 AA576953 Hs.22972 hypothetical protein FLJ 13352 10.4 

447268 AI370413 Hs 36563 hypothetical protein FLJ224 18 10.3 

447033 AI357412 Hs 157601 ESTs 10.2 

15 400295 W72838 Hs.2533 aldehyde dehydrogenase 9 family, member 10.1 

424905 NM 002497 Hs.153704 NIMA (never in mitosis gene a)-related k 10.1 

432441 AW292425 Hs.163484 ESTs 9.9 

427365 AI873274 Hs 190721 ESTs 9,9 

438950 H23789 Hs.144530 EST 9.8 

20 422835 BE218705 Hs.121378 metaBothionein-like 5, tesUs-specific 9.7 

425692 D90041 Hs 155956 N-acetyttransf erase 1 (arylamine N-acety 9.7 

411869 W20027 Hs 23439 ESTs 9.6 

439820 AL360204 Hs 283853 Homo sapiens mRN A full length insert cDN 9.6 

445730 AI624342 Hs 170042 ESTs 95 

25 459583 AI907673 g b: I L-6T1 52 -080 399-004 BT152 Homo sapien 9 3 

452744 AI267652 Hs 30504 Homo sapiens mRNA; cDNA DKFZp434E082 (fr 9 2 

432596 AJ224741 Hs 278461 matrilin3 9.1 

400297 AI127076 Hs 334473 hypothetical protein DKFZp5640 1278 9 1 

449448 D60730 Hs57471 ESTs 91 

3 0 423945 AA410943 gb:zt32h03.r1 So a res ovary tumor NbHOT H 9 1 

406348 Target Exon 9 0 

424735 U31875 Hs. 272499 short-chain alcohol dehydrogenase family 9.0 

453392 U23752 Hs 32964 SRY (sex determining region Y) -box 11 9 0 

433365 AF026944 Hs 293797 ESTs 8 8 

35 405654 NA C12001521:gi|7513934|pirl|T31081 cca3 pr 8 8 

418601 AA279490 Hs.86368 calmegin 8 8 

451110 AI955040 Hs.265398 ESTs, Weakly similar to transformation-r 8 7 

408771 AW732573 Hs 47584 potassium voltage-gated channel, delayed 8.5 

409041 AB033025 Hs 50081 KIAA1 199 protein 84 

40 442353 BE379594 Hs 49136 ESTs, Moderately similar to ALU7_HUMAN A 8 3 

451561 N52812 Hs 177403 ESTs 8 2 

424001 W67883 Hs 137476 paternally expressed 10 8.2 

429859 NM_0Q7Q50 H s. 225952 protein tyrosine phosphatase, receptor t 8.1 

423887 AL080207 Hs 134585 DKFZP434G 2 32 protein 8.1 

45 405095 NA Target Exon 81 

419296 AA236115 Hs.120785 ESTs 8.0 

447164 AF026941 Hs 17518 Homo sapiens cig 5 mRNA, partial sequence 8.0 

413472 BE242870 Hs.75379 solute carrier family 1 (glial high affi 8.0 

416747 AW876523 Hs.15929 hypothetical protein RJ 12910 8 0 

50 415385 R17798 Hs7535 CO BW- like protein 7.9 

434424 AI811202 Hs 325335 Homo sapiens cDNA; FU23523 fis, done L 7 9 

420931 AF044197 Hs 100431 small inducible cytokine B subfamily (Cy 7.9 

406687 M31126 Hs 272620 matrix meta tlo proteinase 1 1 (MMP11; stro 7 8 

400285 NA Eos Control 7 7 

55 437207 T27503 Hs 15929 hypothetical protein FLJ 12910 7 6 

427119 AW880562 Hs.114574 ESTs 7 5 

429534 AW976987 Hs 1 63327 ESTs, Weakly similar to 21 09260A B cell 7 5 

433426 H69125 Hs 133525 ESTs 7 5 

411078 AI222020 Hs 182364 CocoaCrisp 7 4 

60 426214 H 59846 Hs 128355 ESTs, Moderately similar to ALU 7_HUMAN A 7 4 

447475 AI380797 Hs 158992 ESTs 7 3 

415263 AA948033 Hs 130853 ESTs 7 2 

439569 AW602166 Hs 222399 CEGP1 protein 7 2 

414142 AW368397 Hs 150042 Homo sapiens cONA FLJ14438 fis, clone HE 7 1 

65 426261 AW242243 Hs. 168670 peroxisomal f am esyla ted protein 7 0 

444783 AK001468 Hs 62180 anilfin (Drosophila Scraps homolog), act 6 9 

445885 AI7340O9 Hs.127699 KIAA1603 protein 6 9 
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429432 AI678059 
410781 AI375672 
443788 AI732543 
421373 AA808229 
451398 AI793124 
404253 

441096 AI015591 
426215 AW963419 
428227 AA321649 
422956 BE545072 
411111 AW818127 
434988 AJ418055 
442580 A1733682 
449611 AIS70394 
408000 L11690 
420757 X78592 
431089 BE041395 
400301 X03635 
427356 AW023482 
425704 U79293 
441134 W29092 
424902 NM_003866 
448693 AW004854 
431448 AL1 37517 
444342 NM_014398 
422168 AA586894 
453331 A1240665 
418007 M13509 
441233 AA972965 
418092 R45154 
430044 AM64510 
432837 AA310693 
433285 AW975944 
450701 H39960 
425707 AF1 15402 
410785 AW803341 
425398 AL049689 
414812 X72755 
459371 R20991 
411284 N28519 
453511 AL031224 
451807 W52854 
430510 AW162916 
415539 AI733881 
438199 AW016531 
417866 AW067903 
430019 AA463893 
439809 R41396 
423811 AW299598 
434539 AW748078 
439133 AI742605 
453931 AL121278 
444078 BE246919 
447102 BE167434 
451621 AI879148 
425236 AW067800 
421464 AA291553 
450736 AW970O60 
428C85 AA421081 
452838 U65011 
445424 AB028945 
456938 X52509 
422867 L32137 
438167 R28363 
433330 AW207084 
449765 N92293 
416276 U41060 



Hs-202676 
Hs.165028 
Hs.144151 
Hs.1 67771 
Hs.144479 

Hs.131004 
Hs.155223 
Hs.2248 
Hs.122579 

Hs.161160 
Hs. 130239 
Hs. 197075 
Hs.620 
Hs.99915 
Hs.283676 
Hs.1657 
Hs.97849 
Hs. 159264 
Hs.7678 
Hs.153567 
Hs.228320 
Hs,334473 
Hs. 10887 
Hs, 11 2408 
Hs,8895 
Hs,83169 
Hs.135568 
Hs.106604 
Hs.152812 
Hs.87329 
Hs.237396 
Hs.288467 
Hs.11713 

Hs.156369 
Hs.77367 



synaptonemal complex protein 2 6.9 

ESTs 6.9 

ESTs 6.9 

ESTs 6.8 

ESTs 6.8 
NM_021058*-JHomo sapiens H2B histone fami 6.8 

ESTs, Weakty similar to T1 7227 hypotheti 6.7 

stanniocalcin 2 6.6 

small inducible cytokine subfamily B (CX 6.6 

hypothetical protein FL J 10461 6.6 



gb:CM1-ST0277-O61299-059-b07 ST0277 Homo 6.6 



Hs,135191 
Hs.33102 
Hs.27099 
Hs,241576 
Hs.72472 
Hs.122147 
Hs.82772 
Hs,220933 
Hs.101774 
Hs.50895 
Hs.214410 
Hs.193696 
Hs.25144 
Hs.10290 
Hs.98471 
Hs.26770 
Hs.155223 
Hs. 190086 

Hs. 12388 

Hs.30743 

Hs.12696 

Hs.161640 

Hs.1584 

Hs.24286 

Hs.132816 

Hs.206832 

Hs.79136 



ESTs 
ESTs 
ESTs 

bulious pemphigoid antigen 1 (230/240kD) 
androgen receptor {dihydrotestosterone r 
ESTs, Weakly similar to unknown protein 
estrogen receptor 1 
ESTs 

Human clone 23948 mRMA sequence 
cellular retinoic acid-binding protein 1 
inositol poiyphosrjhate-4-phosphatase, ty 
hypothetical protein FLJ23537 
hypothetical protein DKFZd56401278 
similar to fysosome-associated membrane 
S100 calcium-binding protein A7 (psorias 
ESTs 

matrix metalloproteinase 1 (MMP1; inters 
ESTs 
ESTs 
ESTs 

HSPC072 protein 
ESTs 

Homo sapiens cDNA FLJ 12280 fe, clone MA 
E74-like factor 5 (ets domain transcript 
gb:IL2-UM007^090300-05OD03 UM0079 Homo 5.9 
hypothetical protein similar to tenascin 5.9 
monokine induced by gamma interferon 5.8 
gb.yg06h01.r1 Scares infant brain 1N1B H 
ESTs, Weakly similar to unnamed protein 
transcription factor AP-2 beta (adivati 
hypothetical protein FU23293 similar to 
hypothetical protein PR02577 
BMP-R1B 
ESTs 

collagen, type XI, alpha 1 
ESTs 

hypothetical protein FU23045 
homeoboxC4 

ESTs, Weakly similar to MUC2JHUMAN MUCIN 5.4 
ESTs 
ESTs 

U5 snRNP-specific 40 kDa protein (hPrp8- 
ESTs, Weakly similar to T1 87 12 hypotheti 
fatty acid binding protein 7, brain 
stanniocalcin 2 
ESTs 



6.6 
6.6 
6.6 
6.5 
6.5 
6.5 
6.5 
6.5 
6.4 
6.4 
6.4 
64 
6 2 
6.1 
6,1 
6.1 
6.0 
6,0 
6,0 
5.9 
5.9 
59 
5.9 
5.9 



5.8 
5,8 
5.8 
5.7 
5.7 
5.6 
5.6 
5.5 
5.5 
5.5 
5.4 



5.4 
5.4 
54 
5.4 
5.4 
5.3 
5.3 



gb:EST382l40 MAGE resequences, MAGK Homo5,3 

ESTs 5.3 

preferentially expressed antigen in mela 5.3 

cortactin SH3 domain-binding protein 5.3 

tyrosine aminotransferase 5.3 

cartilage oligomeric matrix protein (COM 5.2 

ESTs 5.2 

hypothetical protein N1GC14801 5.2 

ESTs, Moderately similar to ALU8„HUMAN A 5.2 

UV-1 protein, estrogen regulated 5.2 
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400300 X03363 HER2 receptor tyrosine kinase (c-erb-b2, 5.2 

418004 U37519 Hs.87539 a Idehyde dehydrogenase 3 family, member 5.2 

439840 AW449211 Hs.105445 GDNF family receptor alpha 1 5.2 

428771 AB028992 Hs 193143 KJAA1069 protein 5.2 

5 455047 AW852530 gb;PM1 -CT0243-071 099-00 1-g06 CT0243 Homo 5.2 

419169 AW851980 Hs.262346 ESTs, Weakly similar to S72482 hypotheti 5.2 

453197 At916269 Hs 109057 ESTs, Weakly similar to ALU5_HUMAN ALU S 5.1 

400298 AA032279 Hs 61635 six trans membrane epithelial antigen of 5.1 

431023 AI283133 Hs 297420 ESTs 5 1 

10 427666 AI791495 Hs 180142 caJmodulin-like skin protein 5.1 

427718 Ar798680 Hs.25933 ESTs 5.1 

434531 AA642007 Hs 116369 ESTs 51 

429220 AW207206 Hs.136319 ESTs 51 

405494 NA C200 1837*:gi|1 2697903 |dbj|BAB2 1770. 1| (A 5.1 

15 452930 AW195285 Hs 194097 ESTs, Weakly similar to I38022 hypotheti 5.1 

444910 AI201849 gb:qs76g04J(1 NCt_CGAP_Pr28 Homo sapiens 5.1 

453310 X70697 Hs553 solute earner family 6 (neurotransmitte 5 0 

444381 BE387335 Hs 283713 ESTs, Weakly similar to S64054 hypotheti 5.0 

450603 R43646 Hs. 12422 ESTs 5 0 

20 416575 W02414 Hs 38383 ESTs 5 0 

438504 AW665281 Hs 224625 ESTs 5.0 

416209 AA236776 Hs 79078 MAD2 (mitotic arrest deficient, yeast, h 5 0 

428804 AK000713 Hs 193736 hypothetical protein FU20706 5 0 

420077 AW512260 Hs 87767 ESTs 4 9 

25 450480 X82125 Hs 25040 zinc finger protein 239 4 9 

437637 AJ003029 Hs 65792 syntrophin, gamma 2 4 9 

431808 M30703 Hs 270833 amphiregu)in(scliwannoma-derived growth 4 9 

418836 AI655499 Hs 161712 ESTs 4 8 

442441 AI820662 Hs 129598 ESTs 4 8 

30 435635 AF220050 Hs 181385 uncharacterized hematopoietic stem/proge 4 8 

400286 NA C 1600 0922 :gi|74991 03 [ pir ||T20903 hypothe 4 8 

407506 U71600 gb;H u man zinc finger protein zfp31 (zf 31 4 8 

420026 AI831190 Hs 166676 ESTs 4 8 

441377 BE218239 Hs.202656 ESTs 4 8 

35 457726 AI217477 Hs 194591 ESTs 4 8 

412785 AW997556 Hs 78521 KIAA1717 protein 4 8 

428368 BE440042 Hs 83326 matrix metalloproteinase 3 (stromelysin 4 7 

436026 AI349764 Hs 217081 ESTs 4 7 

409110 AA191493 Hs48778 niban protein 4 7 

40 400284 NA estrogen receptor 1 4 7 

410102 AW248508 Hs 279727 Homo sapiens cDNA FLJ 14035 fis, clone HE 4 7 

407819 R42185 Hs 274803 ESTs 4 7 

430486 BE062109 Hs 241551 chloride channel, calcium activated, fa m 4 7 

422896 AW961489 Hs 154116 ESTs 4 7 

45 453616 NMJ)03462 Hs 33846 dyne'm, axonemal, light intermediate pol 4 7 

427427 AF077345 Hs 177936 ESTs 4 6 

421751 AW813731 Hs.159153 ESTs, Moderately similar to S65657 alpha 4 6 

454074 R63503 Hs 28419 ESTs 4 6 

405718 C4000799*:gi|6330365|dbj(BAA86508.1| (AB 4 6 

50 444649 AW207523 Hs 197628 ESTs 4 6 

429431 240313 Hs 106330 Homo sapiens done IMAGE;23371, mRNAseq 46 

427811 M81057 Hs. 180884 carboxypeptidaseBI (tissue) 4 6 

447342 AI199268 Hs.19322 Homo sapiens, Similar to RJKEN cDNA 2010 4 6 

430345 AK000282 Hs.239681 hypothetical protein FLJ20275 4 6 

55 454307 AW855717 gb:RC1 -CT0279-O81 299-0 13-b01 CT0279 Homo 4 6 

400303 AA242758 Hs.79136 LIV-1 protein, estrogen regulated 4 6 

438180 AA808189 Hs.272151 ESTs 4.6 

451340 AW936273 gb;QVO-DT0O2O-O9020O-1O7-g07 DT0020 Homo 4 6 

458711 AL036877 Hs. 282878 ESTs 4,6 

60 457430 AA514660 Hs.128443 ESTs 4 6 

416030 H15261 Hs.21948 ESTs 4,6 

447233 AW246333 Hs.17901 Homo sapiens, clone IMAGE :39370 15, mRNA, 4.6 

445537 AJ245671 Hs.12844 EGF-like-domain, muitipJe 6 (EGFL6) 4 ,5 

424590 AW966399 Hs.46821 hypothetical protein FLJ 20086 4.5 

65 432374 W68815 Hs.301885 Homo sapiens cDNA FU 11 346 fis, clone PL 4.5 

423833 AW503329 gb:UI-HF-BNO-akx-e-02-0-U!.r1 NIH_MGC_50 4.5 

406747 AI925153 Hs.2 17493 annexinA2 4.5 

286 



WO 02/059377 



PCT/US02/02242 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



412102 
431716 
411050 
401418 
436194 
436211 
414080 
424115 
415786 
442117 
452784 
432731 
410534 
405196 
430217 
401793 
415747 
423679 
400238 
425627 
400608 
458634 
407771 
405906 
405925 
439382 
445263 
407162 
419536 
454359 
411558 
450715 
421451 
452864 
409757 
413043 
413499 
444619 
408380 
406992 
404285 
425247 
428046 
446163 
421147 
426451 
415227 
452176 
452862 
443646 
425523 
424687 
430009 
434469 
451381 
450229 
455700 
431924 
436S85 
401451 
431676 
409092 
429270 
443903 
427122 
410275 
432912 



H56435 
D89053 
AWB14902 
NA 

AK001074 
AK001581 
AA1 35257 
AA335497 
AW419196 
AW664954 
BE453857 
R31178 
AW905138 
NA 

N47863 



gb:yq98e09.r1 Soares fetal liver spleen 4.5 
Hs 268012 fatty-acid -Coenzyme A ligase, long-chain 4.5 
gb:MR1-ST0206-120400-022-f08 ST0206 Homo 4.5 



4.5 
4.5 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 



AA381209 
AB007975 
NA 

AF019612 

AV657310 
AL138272 
NA 
NA 

BE247684 
H57646 
N63855 
AA603305 
N71277 
AA102670 
AI2664S4 
AA291377 
AA033714 
NMJJ01898 
BE1 58766 
BE144884 
BE538082 
AF123050 
S82472 
NA 

NM_005940 
AW812795 
AA026880 
AW592167 
A1908165 
AW821113 
AA024538 
AW378065 
AI085198 
AB007948 
JO 5070 
AA894564 
AA634806 
BE241831 
R18717 
BE068115 
AKD00850 
AI836558 

AI685464 
AI735283 
W60379 
AI220547 
AW057735 
U85658 
BE007371 



Ct4000338*:gi{7459502|pirj|S74665 outer 
Hs.333435 Homo saptens cDNA FU10212 fis, clone HE 
Hs 334828 hypothetical protein FU10719; KIAA1794 
Hs. 47783 B aggressive lymphoma gene 
Hs.293965 ESTs, Weakly similar to I38022 hypotheti 
Hs. 257924 hypothetical protein FLJ 13782 
Hs. 128899 ESTs 

Hs.151258 hypothetical protein FU21 062 
Hs.287820 fibronectin 1 

gb:GV0-NN1071-280400-207-g07 NN1071 Homo4.4 
C2000662*:gi|75l2792|pir||T12482 hypothe 4.4 
Hs.336901 ribosomal protein S24 4.4 
C17001545:gi|5360127|gblAAD42882.1lAF155 4.4 
gb:EST94257 Activated T-celb i Homo sap 
Hs.131454 KIAA0506 protein 

C19000274*:gi|1274l327|ref|XP_008833.2| 
Hs 297007 membrane-bound transcription factor prot 

C1000l899:gi|7508633|pir|(T25392 hypothe 
Hs.282898 ESTs 
Hs.62713 ESTs 

Target Exon 
Target Exon 
Hs.103070 ESTs 
Hs.42586 KIAA1560 protein 
Hs.1 42634 zinc finger protein 

gb:np12d11.s1 NCl_CGAP_Pr3 Homo sapiens 
gb:za36e03.s1 Soares fetal Twer spleen 
Hs.70725 gamma-aminobutyric acid (GABA) A recepto 
Hs.31570 ESTs, Weakly similar to KIAA1324 protein 
Hs.50831 ESTs 

Hs. 287629 hypothetical protein FU 14260 
Hs.123114 cystatinSN 

gb IL2-HT0397-071 299-024-F02 HT0397 Homo 4.2 
gb*CM0-HT0182-04109£O65-e11 HT0182 Homo4.2 



4.4 
4.4 
4.4 
4.4 
4.4 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.2 
4.2 
4.2 
4.2 
4.2 



Hs.8172 ESTs, Moderately similar to A460 10 X-lin 

Hs.44532 diubiquitin 

gb:beta -pol=DNA polymerase beta {exon a 
C60 0 1 909 :g i|70444 1 |dbj|BAA 1 8909. 1 1 (D298 

Hs. 155324 matrix metalloproteinase 1 1 (MMP1 1; stro 

Hs.155381 ESTs, Moderately similar to I38022 hypot 

Hs.25252 prolactin receptor 

Hs.293299 ESTs 

Hs.169946 GATA-binding protein 3 (T-cell receptor 
Hs.72402 ESTs 

Hs.282990 Human DNA sequence from clone RP1-28H20 

Hs.8687 ESTs 

Hs.164226 ESTs 

Hs.1 58244 KIAA0479 protein 

Hs.151738 matrix metalloproteinase 9 (gelatinase B 

Hs.22242 ESTs 

gb:ab28c02.r1 Stratagene lung (937210) H 
Hs.172330 hypothetical protein MGC27Q5 
Hs.8929 hypothetical protein FU11362 

Hs.272203 
Hs.1 84987 



4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 



gb:CM1-BT0368-061299-O6O-g07 BT0368 Homo 4.1 



Hs.1 72608 

Hs.57773 

Hs.1 35223 

Hs.323910 

Hs.61796 

Hs.200313 



Homo sapiens cDNA FU20843 (is, clone AD 
ESTs 

NM_004496*:Homo sapiens hepatocyte nude 
gb:tt88f04.x1 NC l_CG A P„Pr28 Homo sapiens 
ESTs 
ESTs 
ESTs 

HER2 receptor tyrosine kinase (c-erb-b2, 
transcription factor AP-2 gamma (activat 
ESTs 
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403585 Target Exon 4.1 

438295 AI394151 Hs.37932 ESTs 4.1 

420380 M640891 Hs.102406 ESTs 4.1 

431118 BE264901 Hs.250502 carbonic an hyd rase VIII 4.1 

5 416182 NM.004354 Hs.79069 cyclinG2 4.1 

418994 AA296520 Hs.89546 selectin E {endothelial adhesion molecul 4.1 

400555 Target Exon 4.1 

410079 U94362 Hs.58589 glycogenin2 4.0 

427674 NMJJ03528 Hs.2178 H2B histone family, member Q 4.0 

10 427131 AA448460 Hs.1 12017 6E36gene 4.0 

439759 AL359055 Hs.67709 Homo sapiens mRNA full length insert cDN 4.0 

429353 AL 117406 Hs.2001 02 ATP-bmding cassette transporter MRP8 4.0 

421296 NMJ02666 Hs.103253 perilipin 4.0 

418819 AA228776 Hs.191721 ESTs 4.0 

15 424188 AW954552 Hs.142634 zinc finger protein 4.0 

455431 AW938484 gb:CMO-DT0057-290200-253-d06 DT0057 Homo4.0 

404142 NA Target Exon 4.0 

441143 AI027604 Hs.159650 ESTs 4.0 

444540 AI693927 Hs.265165 ESTs 4.0 

20 415579 AA165232 Hs.222069 ESTs 4.0 

452891 N75582 Hs.212875 ESTs, Weakly similar to DYH9JHUMAN CILIA 4.0 

414605 BE390440 gb:601 283601 F1 NIH_MGC_44 Homo sapiens c 4.0 

452281 T93500 Hs.28792 Homo sapiens cDNAFU 11041 fis, clone PL 4.0 

417801 AA417383 Hs.82582 integrin, beta-like 1 (with EGF-like rep 4.0 

25 446232 AI281848 Hs.194691 retinoic acid induced 3 4.0 

447377 X77343 Hs.334334 transcription factor AP-2 alpha (activat 4.0 

437854 AL1 19723 gb:DKFZp761A2124_r1 761 (synonym: hamy2) 4.0 

446140 AA356170 Hs.26750 hypothetical protein FLJ2 1908 4.0 

452240 AI591147 Hs.61232 ESTs 4.0 

3 0 459574 AI741122 Hs. 101810 Homo sapiens cDNA FLJ1 4232 fis, clone NT 4.0 

458673 N99626 gb;za39d1U1 Soa res fetal liver spleen 4.0 

444858 AI199738 Hs.208275 ESTs, Weakly similar to ALUA_HUMAN I!!! 4.0 

452166 AI948607 Hs.264680 ESTs 4.0 

452681 AF153330 Hs.30246 solute carrier family 19 (thiamine trans 3.9 

35 450192 AA263143 Hs.24596 RAD51 -interacting protein 3.9 

406554 NA Target Exon 3.9 

416259 AA573006 Hs,19173 ESTs 3.9 

445813 242023 Hs.106576 alanine-g lyoxy late aminotransferase 2-li 3.9 

451024 AA442176 gb:zw63b08.r1 Soares_total_fetus_Nb2HF8^ 3.9 

40 413930 M86153 Hs.75618 RAB11A, member RAS oncogene family 3.9 

401781 Target Exon 3.9 

415296 F05086 Hs.328142 ESTs 3.9 

452564 AA026777 gb;ze93c11.r1 SoaresJetalJiearLNbHH19W 3.9 

442500 AJ819068 Hs.209122 ESTs 3.9 

45 419759 Z21336 Hs.1 35411 actin related protein 3.9 

424638 AI472106 Hs.49303 Homo sapiens cDNA FU 11 663 fis, clone HE 3.9 

439699 AF0S6534 Hs.187561 ESTs, Moderately similar to ALU INHUMAN A 3.9 

428042 AA419529 Hs.76391 myxovirus (influenza) resistance 1, homo 3.9 

452501 AB037791 Hs.29716 hypothetical protein FLJ 10980 3.9 

5 0 453049 BE537217 Hs.30343 ESTs 3.9 

443213 BE568414 Hs.145497 Homo sapiens cDNA; FLJ22097 fis, clone H 3.9 

443489 AI073512 Hs.133916 ESTs 3.9 

455092 BE152428 gb:CMO-HT0323-1 51 299-1 26-b04 HT0323 Homo3.9 

401785 NM_002275*;Homo sapiens keratin 15 (KRT1 3.9 

5 5 426427 M86699 Hs.1 69840 TTK protein kinase 3.9 

446009 AI989885 Hs.231926 ESTs 3 9 

436033 H75391 Hs.255748 ESTs 3.9 

451067 BE172186 gb:MRO-HT0559-110300-005-h11 HT0559 Homo3.8 

419348 AA236645 Hs.98274 ESTs 3 8 

60 444635 Al 184268 Hs.339665 ESTs 3.8 

412140 AA219691 Hs.73625 RAB6 interacting, kinesin-like (rabkines 3.8 

403593 NA Target Exon 3.8 

442323 AW016669 Hs.29190 ESTs 3.8 

419854 AW664873 Hs.87836 Homo sapiens PAC done RP5-1087M1 9 from 3.8 

65 433871 W02410 Hs.205555 ESTs 3.8 

445253 AI217928 Hs.144762 ESTs 3.8 

409542 AA503020 Hs.36563 hypothetical protein FLJ224 18 3.8 
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443162 
458194 
422475 
440705 
447290 
403426 
427821 
454288 
443901 
410658 
410672 
428579 
445495 
447995 
401747 
420633 
423545 
433138 
434715 
428664 
450951 
402696 
446868 
458154 
422026 
419440 
421524 
417283 
401508 
410303 
420362 
433384 
434302 
443938 
448420 
458712 
433404 
405232 
430491 
455609 
450164 
453948 
436061 
401049 
418867 
420179 
458663 
437259 
428309 
450522 
451952 
412209 
425201 
443830 
439255 
414869 
409064 
407721 
445135 
404091 
409731 
405153 
423248 
403639 
404360 
422352 
423338 



T49951 
AW383618 
AL359938 
AA904244 
AI476732 



AA470158 
BE222648 
AW206942 
AW105231 
AW794600 
NM.005756 
BE622641 
AI742618 

NMJ314581 
AP000692 
AB029496 
BE005346 
AK001666 
AA018534 
NA 

AV660737 
AW816379 
U80736 
AB020689 
AA312082 
N62840 
NA 

AA324597 
U79734 
AI021992 
AA629065 
R55373 
BE623004 
AI347502 
T32982 

AL1 09791 
BE011668 
AI239923 
AI970797 
AI248584 
NA 

D31771 
N74530 
AV658444 
AI377755 
M97815 
AI698839 
AL120173 
AW901456 
AA352111 
AI142095 
BE164500 
AA157291 
AA062954 
Y12735 
AK000054 
NA 

AA125985 

AA380177 
NA 

AA766296 
AB007961 



Hs.9029 
Hs.265459 
Hs.117313 
Hs.153205 
Hs.263912 

Hs.98202 
Hs.279458 
Hs.253594 
Hs.192035 

Hs.184942 
Hs.38489 
Hs.181733 

Hsi74480 
Hs.1 29781 
Hs.59729 
Hs.116410 
Hs.189095 
Hs.103334 

Hs.135100 
Hs.335018 
Hs.1 10826 
Hs.90419 
Hs.105445 
Hs.48648 

Hs.21851 
Hs.97206 
H&124244 
Hs.1 16301 
Hs.20864 

Hs.107872 
Hs.102720 

Hs.241559 

Hs.30098 
Hs.&4859 
Hs.1 90745 

Hs.89404 
Hs.21168 
Hs.280776 
Hs.1 20695 
Hs.1 83650 

Hs.301663 



DKFZP434G032 protein 
ESTs, Moderately similar to ALU2_HUMAN A 
Meis (mouse) homotog 3 
ESTs 
ESTs 

Target Exon 
ESTs 

ESTs, Highly similar to c380A1.1b [H.sap 
ESTs 
ESTs 

qb:RC64JM0014-17Q300-022-C05 UM0014 Homo3.8 
y 3.8 



3.6 
3.8 
3.B 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 



Hs.143273 



Hs.21479 
Hs.141883 
Hs.38018 
Hs.12347 

Hs.56145 

Hs.125845 



Hs.99200 
Hs.127338 



G protein-coupled receptor 64 
ESTs, Weakly similar to 138022 hypotheti 
ESTs', Weakly similar to nitrf lase homolo 
Homo sapiens keratin 17 (KRT17) 
odorant-binding protein 2A 
chromosome 21 open reading frame 5 
semaphorln sem2 
ESTs 

similar to SALL1 (sal (Drosophtla}-like 
ESTs 

C3002523:gi|6686211|sp|Q27533jYH2W_CAEEL3.7 

ESTs ,7 
ESTs 

trinucleotide repeat containing 9 
KIAA0882 protein 
GDNF family receptor alpha 1 
ESTs 

NM_024817:Homo sapiens hypothetical prot 
Homo sapiens cDNA FLJ 12900 fe, clone NT 
huntingtin interacting protein 1 
ESTs 
ESTs 
ESTs 

gb^01441282F1 NIH_MGC_72 Homo sapiens c 3.7 
hypothetical protein FLJ20761 37 
ESTs 37 
NrA_015832:Homo sapiens methyl-CpG bindln 37 
Homo sapiens mRNA full length Insert cDN 3.7 
gb;CNl3-BN0223-100500-177-a04 BN0223 Homo3.7 
ESTs 37 
ESTs 37 
Homo sapiens cDNA: FU21 326 fis, clone C 3.7 
Target Exon 

msh (Drosophila) homeo box homolog 2 
ESTs 

tankyrase, TRF1 -interacting ankyrin-rela 
ESTs 

cellular retinoic acid-binding protein 2 
gbwd31f02.x1 Soares^NFL_TJ3BC_S1 Homos3.6 

ESTs 3 - 6 
gb:RC0-NN1012-270300-031-cO7 NN1012 Homo3.6 
gb;EST50061 Activated T-cells XX Homo sa 3.6 

ESTs 36 
gb:RC4-HTO469-23O3OO-O14-e10 HT0469 Homo 3.6 



3.8 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
37 
3.7 



3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
37 
3.7 
37 



3.6 
3.6 
3.6 
3.6 
3.6 
3.6 



ubinuctein 1 
ESTs 

duaj-specificrty tyrosine-(Y)-phosphoryt 
hypothetical protein FLJ 20047 
Target Exon 

thymosin, beta, identified in neuroblast 
Target Exon 

ribu lose-5-phosphate-3-epimerase 
ENSP00000233023*:CDNAFUl2662fis,clon ... 
C7001385:gi|12082809|gb|AAG48618.1[AF315 3.6 

ESTs \l 
KIAA0492 protein 3.6 
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424202 


BE350295 


Hs. 15032 


431750 


AA514986 


Hs.283705 


439907 


AA853978 


Hs. 124577 


453596 


AA441838 


Hs.62905 


406446 


NA 




418454 


AA315308 


Hs.195870 


434360 


AW015415 


Hs. 127780 


409079 


W87707 


Hs.82065 


440132 


AI697121 


Hs.202466 


448706 


AW291095 


Hs.21814 


440671 


AW297920 


Hs. 130054 


407647 


AW860158 




459023 


AW968226 


Hs.60798 


402820 






417009 


AA191719 


Hs.314714 


447048 


AW393O80 


Hs.228320 


449978 


AI806335 


Hs.200829 


428062 


AA420683 


Hs.98321 


452909 


NM_015368 


Hs.30985 


400610 


NA 




417843 


W07361 


Hs.22545 


419335 


AW960146 


Hs.284137 


451592 


AI805416 


Hs.213897 


443270 


NWL004272 


Hs.337737 


423948 


AW392342 


Hs.283077 


449424 


AW448937 


Hs. 197030 


423B41 


AW7 53967 




416806 


NM_000288 


Hs.79993 


422060 


R20893 


Hs. 325823 


433104 


AL043002 


Hs.128246 


415778 


H84847 


Hs.49391 


413054 


AW3 16843 


Hs.66309 


416636 


N32536 


Hs.42645 


424639 


AI9 17494 


Hs.9812 


424827 


AI057094 


Hs.96867 


437782 


A [370876 


Hs.79090 


411514 


AW850178 




413783 


AA314337 


Hs.301547 


421106 


AA877124 


Hs.1 72844 


431291 


N25521 


Hs.25275 


440623 


AI935016 


Hs.216639 


455838 


BE1 45808 




458771 


AW295151 


Hs.163612 


442942 


AW1 67087 


Hs.131562 


436550 


Z50158 


Hs.270235 


418849 


AW474547 


Hs, 53565 


424420 


BE614743 


Hs. 146688 


430916 


nil sJ\h)\J£. 1 


Hs.88414 


4?? (130 


AI90R400 


Hs. 143789 


439405 


AF086224 


Hs.55238 


405917 


NA 




452727 


AW993582 


Hs.176220 


426320 


W47595 


Hs.169300 


421070 


AA283185 


Hs.19327 


424625 


AW904466 


Hs.321197 


428508 


BE252383 


Hs.184668 


455651 


BE064962 




410555 


U92649 


Hs. 64311 


447754 


AW073310 


Hs.163533 


418636 


AW749855 




404097 


NA 




434205 


AF1 19861 


Hs.283032 


421072 


AI215069 


Hs.89113 


402421 


NA 




405248 


NA 




407636 


AJ404672 


Hs.334483 


403000 


BE247275 


Hs.151787 



RAN binding protein 17 3.6 
ESTs 3,6 
ESTs 3.6 
hypothetical protein FU14834 3.6 
Target Exon 3.6 
hypothetical protein FU 14991 3.6 
ESTs 3.6 
interleukin 6 signal transducer (gp130, 3.6 
ESTs, Weakly similar to S65824 reverse t 3.6 
interieukin 20 receptor, alpha 3.6 
ESTs 3.5 
gb:RC0-CTO379-29O10O-032-bO4 CT0379 Homo 3.5 

3,5 
3.5 
3.5 
3.5 
3.5 
3.5 
3.5 
3,5 
3.5 
3.5 
3.5 
3.5 
3.5 
3.5 



ESTs 

NM„017646*:Homo saptens tRNA isopentertyl 
ESTs 

hypothetical protein FU23537 
ESTs, Weakly similar to T30171 nfnein - 
hypothetical protein FLJ14103 
pannexin 1 
Target Exon 

Homo sapiens cDNA FU12935 fis, clone NT 
hypothetical protein FLJ12888 
ESTs 

Homer, neuronal immediate early gene, 1B 
centrosomal P4.1 -associated protein; unc 
ESTs 

gb;RC2-CT0304-080 1 00-fl 1 1 -hi 2 CT0304 Homo 3.5 
peroxisomal biogenesis factor 7 3.5 
ESTs, Moderately similar to ALU5JHUMAN A 3.5 
ESTs, Moderately similar to unnamed prot 3.5 
hypothetical protein LOC54149 3.5 
hypothetical protein MGC1 1 061 3.5 
solute carrier family 16 (monocarboxylic 3.5 
Homo sapiens cDNA FU14388 fis, clone HE 3.5 
Homo sapiens cDNA: FU231 55 fis, clone L 3.5 
exportin 1 (CRM1, yeast, homolog) 3.5 
g b: I L3-CT02 19-271 099-022 -H 1 2 CT0219 Homo 3.5 
ribosomal protein S7 3.5 
ESTs 3.5 
Kruppel-type zinc finger protein 3.5 
ESTs 3.5 
gb:MR0-HT0208-1 01 299-1 03-f 1 1 HT0208 Homo 3.5 

3.5 
3.5 
3.5 
3.5 
3.5 
3.5 
3.5 
3.5 
3.5 
3.5 
3.4 
3.4 
3.4 
3.4 



ESTs 
ESTs 

ESTs, Weakly similar to MMHUB1 laminin b 
Homo sapiens PfG-M mRNA for martnosyltran 
prostaglandin E synthase 
BTB and CNC homology 1, basic leucine zi 
ESTs 
ESTs 

C17000675;gip290703|gblAAF46150.1f (AE0 
ESTs 

transforming growth factor, beta 2 
ESTs 

PDZ domain protein {Drosophila inaD-like 
SBBI31 protein 

gb:RC 1 -BT031 3-1 3040001 6-c02 BT0313 Homo 3.4 

a disintegrin and metal loproteinase doma 3.4 

Homo sapiens cDNA FU14142 fis, clone MA 3.4 

gb:QV4-BT0534-28129£O53-c05 BT0534 Homo 3.4 

C5000242*:gl|9369379|gb(AAF67128.1|ACC06 3.4 

hypothetical protein PRO201 5 3.4 

ESTs 3.4 

C1001578*:gi|6759903jgb|AAF28099.1| (AF1 3.4 

Target Exon 3.4 

hypothetical protein FU2357 1 3.4 

U5 snRNP-specific protein, 116 kD 3.4 
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433393 AF038564 Hs.98074 ifchy (mouse homolog) E3 ubiqu'rtin prote 3,4 

432239 X81334 Hs.2936 matrix metalbprotetaase 1 3 (colbgenase 3.4 

458747 BE618395 Hs.257391 hypothetical protein DKFZp761J1 523 3,4 

442082 R41823 Hs.7413 ESTs; calsyntenin-2 3.4 
5 417974 AA210765 gb.zr90c06.r1 NCLCGAP_GCB1 Homo sapiens 3.4 

446002 AJ346468 Hs. 145789 ESTs 3.4 

448995 AI613276 Hs.5662 guanine nucleotide binding protein {G pr 3.4 

436007 AI247716 Hs.232168 ESTs 3.4 

424698 AA164366 Hs.151973 hypothetical protetn FLJ2351 1 3.4 

10 435202 AI971313 Hs.1 70204 K I AA0551 protein 3.4 

410467 AF102546 Hs.63931 dachshund (Drosophila) homolog 3.3 

405460 NA Target Exon 3.3 

441826 AW503603 Hs.129915 phosphotrieste rase related 3.3 

453472 AL037925 gb:DKFZp564M037_r1 564 (synonym: hfbr2) 3.3 

15 447078 AW885727 Hs.301570 ESTs 3.3 

441690 R81733 Hs.33106 ESTs 3.3 

420092 AA814043 Hs.88045 ESTs 3.3 

418478 U38945 Hs.1 174 cyclin-dependent kinase inhbitor 2A (me 3.3 

408908 BE296227 Hs.250822 serine/threonine kinase 15 3.3 

20 414737 AI160386 Hs.125087 ESTs 3.3 

449650 AF055575 Hs.23838 calcium channel, voltage-dependent, L ty 3.3 

418912 NM_0CO685 Hs.89472 angiotensin receptor 1 3.3 

436405 AA160079 Hs.172932 Homo sapiens mRNA for partial 3'UTR, seq 3.3 

453911 AW503857 Hs.4007 SarcolemmaJ- associated protein 3,3 

25 409361 NM_005982 Hs.54416 sine oculis home obox (Drosophila) homofo 3.3 

429548 AW 138872 Hs.135288 ESTs 3.3 

420807 AA280627 Hs.57846 ESTs 3.3 

409695 AA296961 gb:EST1 1251 4 Adrenal gland tumor Homo sa 3.3 

445189 AI936450 Hs.147482 ESTs 3.3 

3 0 402892 NA Target Exon 3.3 

426681 AA994896 Hs.22514 ESTs 3.3 

456722 AA741545 Hs.282832 ESTs, Weakly similar to T2 4961 hypotheti 3.3 

409430 R21 945 Hs.1 66975 splicing factor, arginlne/serine-rich 5 3.3 

443194 AI954968 Hs. 279009 matrix G la protein 3 3 
3 5 445432 AV653771 gb:AV653771 GLC Homo sapiens cDNA done 3 3 

410908 AA121686 Hs.10592 ESTs 3 3 

406151 NA Target Exon 3 3 

436461 AW511956 Hs.293261 ESTs 3.3 
411171 AW820260 gb;GV2-ST0296-150200-040-c10 ST0296 Homo 33 

40 432415 T16971 Hs.289014 ESTs, Weakly similar to A43932 mucin 2 p 3 3 

439310 AF086120 Hs.102793 ESTs 3 3 

401575 NA Target Exon 3.3 

420900 AL045633 Hs,44269 ESTs 3 3 

445628 A1344166 Hs 155743 ESTs 33 

45 448243 AW369771 Hs 52620 integrin, beta 8 3.3 

445102 AW204610 Hs.22270 ESTs 3 3 

442118 M976718 Hs.202242 ESTs 3.3 

416783 AA206186 Hs.79889 monocyte to macrophage differentiation-a 33 

435039 AW043921 Hs.1 30526 ESTs 3 3 

5 0 451474 T70874 Hs.207636 ESTs 32 

442559 T10213 Hs, 159993 gycosyrtransferase 3 2 

453921 AI824009 Hs.44577 ESTs 3 2 

420036 R60336 Hs.52792 Homo sapiens mRNA; cDN A DKFZp586l 1823 (f 3.2 

435627 W88774 Hs, 11 8370 ESTs 3.2 

55 411598 BE336654 Hs 70937 H3 histone family, member A 3.2 

446733 AA863360 Hs 26040 ESTs, Weakly similar to fatty acid omega 32 
410153 BE311926 Hs 15830 hypothetical protein FU1 2691 3.2 
403637 NA C 300 1 1 06 *:g i| 1 004 72 0 1 jd bj|B AB 1 3394 . 1 1 (A 32 
405547 NM„018833*:Homo sapiens transporter 2. A 3.2 
60 427878 C05766 Hs 181022 CGI-07 protein 3 2 

451871 AI821005 Hs.1 18599 ESTs 3.2 
410313 R10305 Hs.185683 ESTs 3.2 
416856 N27833 Hs.269028 ESTs, Weakly similar to 1 38 022 hypotheti 3.2 
449490 AI652777 Hs.197069 ESTs 3.2 
65 450506 NMJ04460 Hs.418 fibroblast activation protein, alpha 3.2 
440684 AI253123 Hs.127356 ESTs, Highly similar to S21424 nestin [H 3.2 
459055 N23235 Hs.30567 ESTs, Weakly similar to B3 4087 hypotheti 3.2 
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452190 H26735 Hs.91668 Homo sapiens cione PP1498 unknown mRNA 3.2 

430965 AA489732 Hs.154918 ESTs 3.2 

405394 Target Exon 3.2 

424693 BE169810 Hs.47557 ESTs 3,2 

5 454265 H03556 Hs.300949 ESTs, Weakly similar to thyroid hormone 3.2 

437687 AA765917 Hs.122840 ESTs 3.2 

428372 AK000684 Hs.183887 hypothetical protein FL J 22 104 3.2 

414083 AL121282 Hs.257786 ESTs 3.2 
411670 AW856552 gb:RC1-CT0294-060100-012-a04 CT0294 Homo 3.2 

10 416283 NMJW5429 Hs.79141 vascular endothelial growth factor C 3.2 

437488 AA758239 Hs.180330 ESTs 3.2 

428398 AI249368 Hs.96558 ESTs 3.2 

452042 H38857 Hs.243901 Homo sapiens cDNA FLJ20738 fis, clone HE 3.2 

421477 AI904743 Hs.104650 hypothetical protein FLJ 10292 3.2 

15 438078 AI016377 Hs.131693 ESTs 3.2 

448816 AB033052 Hs.22151 KIM 12 26 protein 3.2 

419519 AI198719 Hs.176376 ESTs 3.2 

404580 NW_0141 12*;Homo sapiens tn'chorhinophala 3.2 

447046 AA326187 Hs.17170 G protein-coupled receptor 4 3.2 

20 457473 AW974903 Hs.291231 ESTs 3.1 

429838 AW904907 Hs.30732 hypothetical protein FLJ13409; K1AA1711 3.1 

459702 AI204995 gb;an03c03.x1 Stratagene schizo brain S1 3.1 

400195 NA NM_007057*:Homo sapiens ZW10 interactor 3.1 

417860 AW408557 Hs.235498 hypothetical protein FLJ 14075 3.1 

25 417995 AW974175 Hs.1B8751 ESTs, Weakly similar to MAPBJHUMAN MICRO 3.1 

422589 AA312735 Hs.30512 Homo sapiens mRNA for KIAA0556 protein, 3.1 

435870 AA701327 Hs.17949 ESTs 3.1 

440801 AA906366 Hs.190535 ESTs 3.1 

426274 D38122 Hs.2007 tumor necrosis factor (ligand) superfami 3.1 

3 0 423728 AW891294 Hs.132136 solute carrier family 4, sodium bicarbon 3.1 

439677 R82331 Hs.164599 ESTs 3.1 

452834 AI638627 Hs,105685 K1AA1 688 protein 3.1 

431349 AA503653 Hs.156942 ESTs, Moderately similar to AUI2_HUMAN A 3.1 

417576 AA339449 Hs.82285 phosphoribosylglydnamide formyltransfer 3.1 

3 5 430264 AA470519 gb:nc71f10.s1 NCLC6AP_Pr1 Homo sapiens 3.1 

418827 BE327311 Hs.47166 HT021 3.1 
410835 AW806906 gb:QV4-ST0023-1 60400-1 72-d1 2 ST0023 Homo 3.1 

426269 H15302 Hs.168950 Homo sapiens mRNA; cDNA DKFZp566A1046 (f 3.1 

405336 NA Target Exon 3.1 

40 437783 AI683150 Hs.201550 ESTs, Weakly similar to ALU 1J-IU MAN ALU S 3.1 

440931 AI583052 Hs.270058 ESTs 3.1 
455945 BE160636 gb:PM1-HT0422-291299-002-c08HT0422 Homo 3.1 

430437 AI768801 Hs.169943 Homo sapiens cDNA FU1 3569 fts, clone PL 3.1 

405S48 NA Target Exon 3.1 

45 455685 BE066976 gb:PMO-BT0340-211299-003-c12 BT0340 Homo 3.1 

406970 M29994 gb:Human alpha ! spectrin gene, exon 12. 3.1 

409602 W26713 Hs.256972 ESTs 3.1 

423518 D45027 Hs.129732 R3H domain (binds single-stranded nuclei 3,1 

425653 AI065104 Hs.249718 ESTs, Weakly similar to A46010 X-frnked 3.1 

5 0 426326 BE165753 Hs.250528 Homo sapiens, clone IMAGE:4098694, mRNA, 3.1 

433805 AA706910 Hs.112742 ESTs 3.1 
437152 AL050027 gb;Homo sapiens mRNA; cDNA DKFZp566C03243.1 

448602 AI541305 Hs.48778 niban protein 3.1 

452844 AW407181 Hs.218377 Homo sapiens cDNA FU 11 927 fts, clone HE 3.1 

55 407366 AF026942 gb;Homo sapiens dg33 mRNA, partial sequ 3.1 

408254 AW807227 g b:MR4-$TQ062-1 80200-00 1 -e 1 0 ST0062 Homo 3.1 

424085 NMJM2914 Hs.139226 replication factor C (activator 1)2 (40 3.1 

416790 R83066 Hs.7043 succinate-CoA ligase, GDP -forming, alpha 3.1 

420020 BE295866 Hs.94382 adenosine kinase 3.1 

60 426119 W94997 Hs.189917 ESTs 3.1 

426968 U07616 Hs.173034 amphiphysin (Stiff-Mann syndrome with br 3.1 

457421 AL117431 Hs.112165 Homo sapiens cDNA FLJ 12 198 As, clone MA 3.1 

453403 BE466639 Hs.61779 Homo sapiens cDNA FLJ 1359 1 fis, clone PL 3.1 

454141 AW138413 Hs.139336 ATP-binding cassette, sub-family C (CFTR 3.1 

65 426650 AA382814 g b:EST9 6097 Testis I Homo sapiens cDNA 5 3.1 

450865 AI248013 Hs.106532 ESTs, Weakly similar to 138588 reverse t 3.1 

407993 AW135274 Hs.12433 ESTs 3.1 
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446465 
457888 
420058 
409248 
452747 
418926 
419346 
429826 
435147 
420139 
405609 
404274 
449777 
415459 
415245 
406291 
414210 
432055 
442246 
451353 
451177 
418026 
401326 
409920 
432887 
411789 
401045 
432162 
434627 
442611 
425477 
457183 
433014 
415542 
416173 
408155 
453913 
435495 
423629 
411836 
415030 
419606 
440310 
443608 
414140 
444781 
435393 
454071 
446922 
448062 
427711 
450382 
424866 
433043 
420802 
445625 
403677 
411093 
435255 
443127 
448104 
427315 
430414 
423600 
458562 
402109 
429629 



H38025 
BE219794 
AK001423 
AB033035 
BE153855 
AA232658 
AI830417 
N93266 
AL133731 
NMJ505357 
NA 

AI971362 
H07118 
N59650 
NA 

BE383592 
AW972359 
AI791988 
N21043 
AJ969716 
BE379727 
NA 

BE1 69746 
AI926047 
AF245505 

AA584062 
AI221894 
BE077155 
AW958879 
H91882 
NM_014711 
R 13474 
R52782 
AB014528 
AW004683 
AI754212 
AW021173 
AW901879 
D31118 
AW294795 
AA878939 
AI375957 
AA281279 
NM_014400 
AA701259 
AI041793 
BE175605 
AW295923 
M31659 
AA397658 
W01938 
W57554 
U22376 
BE246743 
NA 

BE067650 
W87434 
BE568102 
AI674618 
AA179949 
AW365665 
AI633559 
N34128 
NA 

BE501732 



arrestin 3, retinal (X^arrestin) 
ESTs 

Homo sapiens cDNA FU10561 (is, clone NT 
KIAA1209 protein 
!g superfamily receptor LNJR 
UDPij!ucose:gtycoorotein glucosyltransfe 
pofybromol 
ESTs 



Hs.308 
Hs.293471 
Hs.94694 
Hs.51965 
Hs.61460 
Hs.105794 
Hs.44143 
Hs.40747 

Hs.4774 Homo sapiens mRNA; cDNA DKFZp761C1712 (f 3.0 
Hs.95351 



3.1 
3.1 

3.0 
3.0 
3.0 
3.0 
3.0 
3.0 



Hs.231945 

Hs.6099 

Hs.27252 



Hs.293334 

Hs.129115 

Hs.42932 

Hs.13034 

Hs.83213 

Hs.12504 

Hs.162859 

Hs.72157 

Hs.272798 

Hs.39311 

Hs.177537 

Hs.270535 

Hs.1 18569 

Hs.279912 

Hs.290263 



3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 



lipase, hormone-sensitive 
ENSP00000241065XDNA 
NM_002944*:Homo sapiens v-ros avian UR2 
ESTs 
ESTs 
ESTs 

Taiget Exon 

gb£01297871F1 NIHMGCJ9 Homo sapiens c 3.0 
ESTs 3.0 
ESTs 3.0 
ESTs 3.0 
ESTs 3.0 
fatty acid binding protein 4, adipocyte 3.0 
C10000447*:gi[1168375|sp|P43467|AGA1_PED 3.0 
likely ortbolog of mouse Arkadia 
ESTs 

DKFZP564I1922 protein 
C11001883*.-gi|6753278jref|NP_033938.1|c 
hypothetical protein FU20413 
ESTs 

hypothetical protein DKFZp761B1514 
ESTs 

Dvl-binding protein IDAX (inhibition of 
KIAA0419 gene product 
ESTs, Weakly similar to I38022 hypotheti 
gb:yg99d09.M Soares infant brain 1NIB H 
Hs.43 1 33 KIAA0628 gene product 
Hs.78934 mutS (E. coli) homolog 2 (colon cancer, 
Hs.21951 Homo sapiens Xq pseudoautosomal region; 
Hs.18612 Homo sapiens cDNA; FLJ21909 fis, clone H 
Hs.314453 ESTs 

Hs.191735 hypothetical protein MGC10520 
Hs.198529 ESTs 
Hs. 125406 ESTs 
Hs.289074 F-box only protein 22 
Hs.23317 hypothetical protein FLJ14681 
Hs . 1 1950 G Pl-anchored metastasis-associated prote 
Hs.189299 ESTs 
Hs.42502 ESTs 



3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 
3.0 



Hs.255472 

Hs.1 80408 

Hs.60257 

Hs.337243 

Hs.125019 

Hs.1334 

Hs .288529 



Hs.106015 
Hs.180312 
Hs.316433 
Hs.175563 
Hs.120388 
Hs.310359 
Hs.145268 



gb:RC5-HT0580-10050(H]22-H07 HT0580 Homo3.0 



KIAA1 843 protein 

solute carrier family 25 (mitochondrial 
Homo sapiens cDNA RJ13598 fis, clone PL 
ESTs, Weakty simaarto ALU7JHUMAN ALU S 
lymphoid nuclear protein (LAF-4) mRNA 
v-myb avian myeloblastosis viral oncogen 
hypothetical protein FU22635 
C4001462:gi|4887715fcb|AAA79329.2| (L088 
gb-MR4-BT0358^9030O003^e01 BT0358 Homo 2.9 

ESTs, Moderately similar to ALU1_HUMAN A 2.9 

mitochondrial ribosomal protein S16 2.9 

Homo sapiens cONA FU11375 fis, done HE 2.9 
Homo sapiens mRNA; cDNA DKFZp564N0763 (f 2.9 



3.0 
3.0 
3.0 
2.9 
2.9 
2.9 
2.9 
2.9 



ESTs 
ESTs 
ESTs 

Target Exon 

Hs.30622 Homo sapiens cDNA FU 1301 0 fis, done NT 
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442295 
419752 
404721 
445107 
401987 
430566 
444517 
445563 
427691 
4 56561 
401458 
421039 
459504 
424962 
409617 
416931 
413221 
409732 
433687 
434340 
454529 
421379 
452291 
457402 
449051 
408761 
401093 
435061 
447985 
449340 
426384 
411905 
405953 
420854 
434265 
428365 
442861 
448337 
452554 
412248 
449450 
434757 
409038 
454545 
439842 
428479 
424800 
411086 
400250 
449168 
456482 
426044 
431854 
405873 
440400 
458265 
413708 
423739 
424408 
453096 
421825 
417742 
402765 
444378 
419172 
401497 
402376 



AI827248 Hs.224398 Homo sapiens cDNA FLJ1 1469 fis, done HE 2.9 
AA249573 Hs.152618 ESTs, Moderately similar to ZN91JWMAN Z 2.9 
NM_005596*:Homo sapiens nuclear factor I 2.9 
A1208121 Hs.147313 ESTs, Weakly similar to I38022 hypotheti 2.9 
NM_002737*:Homo sapiens protein kinase C 2.9 
ESTs 2.9 
ESTs 2.9 
ESTs 2.9 
ESTs 2 9 

ESTs, Weakly similar to T32250 hypotheti 2.9 
Target Exon 29 
cullin 5 2 9 

gb 601315974F1 NIH J/IGCJ Homo sapiens cD 2 9 
TRAM-like protein 2 9 

Homo sapiens mRNA; cDNA DKFZp434K05l4 (f 2.9 
adipose most abundant gene transcript 1 2 9 
gb PM0-HT0425-141299-O01-F08 HT0425 Homo2 9 
NY-REN-58 antigen 2 9 

gb:ny57g01.s1 NCLCGAP_Pr18 Homo sapiens 2 9 



AA481282 
AI939339 
AW873606 
AW1 94426 
AI868634 

NM_003478 
BE514127 
NM 012288 
BE003760 
D45371 
BE161151 
NMJ16122 
AA743991 
AI193043 
Z45439 
Y15221 
AF015592 
AW4 52648 
AW961400 
AA057264 

AI651474 

AI681475 

AW235786 

AI472078 

BE265067 

NA 

AW296927 

AA846811 

AA295331 

AA243837 

AW206453 

AW452434 

BE176480 

AL039852 

AI038997 

T97490 

AW806899 

AI910896 

Y00272 

AL035588 

BE070800 

NA 

NMJ16206 

AA485224 

AA502490 

AA383550 

NA 

AA994364 

A1075375 

BE1 58791 

AA398155 

AI754813 

AW294631 

AA298758 

R64719 

R41339 
AW338625 



Hs.190149 
Hs.146883 
Hs. 149006 
Hs.20726 
Hs.246358 

Hs.101299 

Hs. 153954 
Hs.55209 
Hs.80485 

Hs.56148 

Hs.1 28685 

Hs.270425 

Hs.103982 

Hs.28853 

Hs.149342 

Hs.333526 

Hs.238936 

Hs.163944 
Hs.200949 
Hs.195359 
Hs.303662 



ESTs, Weakly similar to T17226 hypotheti 
ESTs 

small inducible cytokine subfamily B (Cy 
CDC7 (cell division cycle 7, S. cerev'isi 
activation-induced cytidine deaminase 
HER2 receptor tyrosine kinase (c-erb-b2, 
ESTs, Weakly similar to (defline not ava 
C12000586*:gi|6330167|dbj|BAA86477.1|(A 
ESTs 
ESTs 

hypothetical protein MGC10954 
ESTs 



Hs.1 30554 

Hs.183861 

Hs.57787 

Hs.3782 

Hs.58006 

Hs.49136 
Hs.132921 
Hs.50002 

Hs.132413 
Hs.184572 
Hs.153203 



Hs.23142 

Hs.336695 
Hs.271699 

Hs.125594 
Hs.128193 

Hs.97600 
Hs.146428 
Hs.1 1325 
Hs. 183747 



Hs.12569 
Hs.22120 



2.9 
29 
29 
29 
29 
29 
29 
2.9 
29 
2.9 
29 
2.9 

gb:601 193893F1 N|H_MGC_7 Homo sapiens cD 2.9 
Target Exon 2 8 

gb:lll-H-BW0-ajc-c-07-0-ULs1 NCI_CGAP_Su 2 8 
Homo sapiens cDNA: FLJ23089 fis, done L 2 8 
Homo sapiens cDNA FLJ20042 fis, clone CO 2 8 
ESTs 2.8 
ESTs 2 8 

ESTs, Weakly similar to ALU5_HUMAN ALU S 2 8 
gb:RC3-HT0585-160300-022-c02 HT0585 Homo 2 8 
ESTs , Moderately similar to ALU7 JHUMAN A 2 .8 
ESTs 2 8 

small inducible cytokine subfamily A (Cy 2 8 
gb;QV4-ST0023-160400-172-c12ST0023 Homo 2 8 
ESTs 2 8 

ceU division cycle 2, G1 to S and G2 to 2 8 

MyoD family inhibitor 2 8 

gb:RC3-BT0502-251 199-011 -c07 BT0502 Homo 2 8 
Eos Control 2 8 

colon carcinoma related protein 2.8 
gb:aa41b12.s1 NCI_CGAP_GCB1 Homo sapiens2.8 



ESTs 2.8 

polymerase (DNA directed) iota 2 8 

Target Exon 2 8 

ESTs, Weakly similar to T25472 hypotheti 2 8 

ESTs, Weakly similar to IRX2_HUMAN IROQU 2 8 

gb;IL2-HT0397-O91299-O25-D02 HT0397 Homo 2.8 

ESTs 28 

collagen, type V, alpha 1 2 8 

ESTs 2 8 

ESTs, Moderately similar to CALB JHUMAN C 2.8 
gb:EST22d11 WATM1 Homo sapiens cDNA cton 2.8 

C1 00362 1 \gi| 1 2407405|gbJAAG53491 . 1 JAF22 2.8 

ESTs 2 8 

ESTs 2.8 

Target Exon 2 8 

C19000763*:gi|1 36391 2|pir||JC4296 ring f 2 8 
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405041 
408758 
431917 
437583 
453737 
458094 
401283 
410784 
417601 
418236 
435532 
454714 
418629 
442101 
405080 
414661 
425589 
429638 
428824 
421566 
414596 
440868 
452943 
443772 
432361 
430375 
406504 
423279 
424871 
453619 
423961 
422156 
415752 
419987 
406182 
416495 
444701 
408171 
430153 
413383 
414831 
413278 
433132 
437030 
439031 
449532 
406153 
406625 
444698 
432328 
429628 
420149 
431207 
438394 
443304 
427660 
408460 
416515 
429922 
418203 
439509 
402184 
450496 
451963 
457938 
441541 
441111 



NA 

NM 003686 
D16181 
M761190 
AA744862 
AF086325 
NA 

AW803201 
NM_014735 
AW994005 
AW291488 
AW815098 
BE247550 
AI651930 
AK000375 
T97401 
AI650633 
AI916662 
W23624 
NM_000399 
BE386S70 
R79707 
BE247449 
AV646449 
AI378562 
AW371048 
NA 

AW959861 
NMJX)4525 
H87648 
D 13666 
N34524 
BE314524 
NM_005014 
NA 

X69970 
AI916512 
AA301228 
AW968128 
AA128978 
M31158 
BE563085 
AB026264 
AA742577 
AF075079 
W74653 

Y13647 
A1188139 
AI572739 
H09604 
AA255920 
AA495925 
BE379623 
AI050073 
AI741320 
AA054726 
N91716 
Z97630 
X54942 
AF086332 
NA 

AW449251 
AI825440 
AI373638 
AA938663 
AI806867 



Hs.47504 
Hs.2868 
Hs.244627 
Hs. 194293 



C3001706*:git1345652|sp|P15989|CA36_CHIC 2.8 



Hs.82292 
Hs.337534 
Hs.1 17305 

Hs.86859 
Hs.135684 
Hs.88820 
Hs.21929 
Hs. 143688 
Hs.211577 
Hs. 173059 
Hs.1395 

Hs.263339 
Hs.31082 
Hs.282872 
Hs.159585 
Hs.93758 

Hs.290943 
Hs.153595 
Hs.33922 
Hs. 136348 

Hs.78776 
Hs.94070 

Hs.79350 
Hs.198394 
Hs.43299 
Hs.336679 
Hs.154706 
Hs.77439 
Hs.833 
Hs.284245 
Hs.303781 

Hs.271593 

Hs.1 19597 
Hs.147050 
Hs.195471 
Hs.13268 
Hs.88095 
Hs.9394 
Hs.27693 
Hs.135338 
Hs.114121 
Hs.285574 
Hs.194140 
Ks.226117 
Hs.83758 
Hs.58314 



exonudease 1 2,8 
peripheral myelin protein 2 2,8 
ESTs 2.8 
ESTs, Weakly similar to I54374 gene NF2 2 8 
gb:Homo sapiens full length insert cDNA 2 8 
Target Exon 28 
gb:tL2-UM0077-070500-080-E06 UM0077 Homo 28 
KIAA0215 gene product 2 8 

ESTs 2 8 

Homo sapiens, done IMAGE:3682908, mRNA 2 8 
gb:QV4-ST021 2-091 1 99-023-f 10 ST0212 Homo 2 8 



growth factor receptor-bound protein 7 2 8 

ESTs 2 8 

HDCMC28P protein 2.8 

ESTs 28 

Homo sapiens cDNA: FU23031 fis, clone L 2.8 

kinectin 1 (kinesin receptor) 2.7 

ESTs 2.7 

early growth response 2 (Krox-20 (Drosop 2.7 

gb:601 275271F1 NIHJ/IGCJO Homo sapiens c 2.7 



ESTs, Moderately similar to 138022 hypot 27 

hypothetical protein FLJ10525 2.7 

ESTs 27 

ESTs 27 

H4 histone family, member H 27 

C5000558:gi|4504675|reflNP_002175.1i int 27 

ESTs 27 

tow density lipoprotein-related protein 27 
Homo sapiens, cbne MGC:9084, mRNA, comp 2.7 

osteoblast specific factor 2 (fasdciin 27 

gb:yy56d10.s1 Soares_multipte_sclerosis_ 27 

putative transmembrane protein 2.7 

osteomodulin 2.7 

Target Exon 27 

RYK receptor-like tyrosine kinase 27 

ESTs 27 

hypothetical protein FLJ12890 27 

ESTs 27 

hypothetical protein FU 1491 7 27 

protein kinase, cAMP-dependent, regulato 27 

interferon-stimulated protein, 15 kDa 27 

hypothetical protein IMPACT 2 7 

EST 27 

gb:Homo sapiens full length insert cDNA 27 

ESTs, Moderately similar to A47582 B-cel 27 

Target Exon 27 

stearoyl-CoA desaturase (delta-9-desatur 2.7 

ESTs 27 

6-phosriiofrix^o-2-kinase/fnjctose-2,6-bi 2.7 

ESTs 27 

ESTs 27 

ESTs 27 

peptidylprolyl isomerase (cyclophirtn)-l 27 

ESTs 2.7 

Homo sapiens cDNA: FLJ23228 fis, clone C 27 

ESTs 27 

ESTs, WeaWy similar to I38022 hypotheti 27 

H 1 histone family, member 0 2 7 

CDC28 protein kinase 2 27 

ESTs 27 
ENSP00000245238*:CDNA FLJ10922 fis, cton 27 

Hs.257131 ESTs 27 

Hs.224952 ESTs 2.7 

Hs.1 33900 ESTs 27 

Hs.199828 ESTs 27 

Hs,1 26594 ESTs 27 
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423020 AA383092 Hs,1608 replication protein A3 (14kD) 2.7 

445354 AV653485 Hs.6390 Homo sapiens done FLB3344 PRO0845 mRNA, 2 7 

427961 AW293165 Hs,143134 ESTs 2 7 

410889 X91662 Hs.66744 twist (Drosophila) homolog (acrocephalos 2 7 

5 445234 AW137636 Hs.146059 ESTs 2 7 

413903 AA496493 Hs.23136 ESTs 2 7 

406069 NA TaigetExon 2.7 

447410 AI470235 Hs.172698 EST 2 7 

401256 NA NM_024089*:Homo sapiens hypothetical pro 2 7 

10 415139 AW975942 Hs.48524 ESTs 2.7 

420218 AW958037 Hs.286 ribosomaJ protein L4 2 7 

455511 BE144762 gb:CM0-HT018(K)41099-O65-b04 HT0180Homo2.7 

438825 BE327427 Hs.79953 ESTs 26 

409564 AA045857 Hs.54943 fracture callus 1 (rat) homolog 2 6 

15 452837 AL121053 Hs.5534 Homo sapiens cDNA FU12961 fs, clone NT 2.6 

434876 AF160477 Hs.61460 Ig superfa mi ly receptor LN I R 2.6 

421565 AK001122 Hs. 105859 hypothetical protein FU 10260 2 6 

453279 AW893940 Hs.59698 ESTs 2.6 

430785 Z30201 gb:HHEA22G Atrium cDNA library Human hea 2 6 

20 456986 D38299 Hs.170917 prostaglandin E receptor 3 (subtype EP3) 2 6 

433068 NMJ>06456 Hs.288215 sialyltransferase 26 

421952 AA300900 Hs.98849 ESTs, Moderatefy similar to AF1615 11 1 H 2 6 

429208 AA447990 Hs.190478 ESTs 2 6 

430733 AW975920 Hs.283361 ESTs 2 6 

25 441720 AI346487 Hs.28739 ESTs 2 6 

418986 AI123555 Hs.81796 ESTs 2 6 

432481 AW451645 Hs.151504 Homo sapiens cDNA FU1 1973 fis, clone HE 2 6 

434338 AW75431 1 gb:CM1-CT0337-141299-068-f07 CT0337 Homo 2.6 

417061 AI675944 Hs.188691 Homo sapiens cDNA FU 12033 fis, clone HE 2 6 

30 410530 M25809 Hs.64173 ATPase, H transporting, lysosomal (vacuo 2 6 

456672 AK002016 Hs.1 14727 Homo sapiens, clone MGC: 16327, mRNA, com 2.6 

425071 NM_013989 Hs. 1 54424 deiod'inase, iodothyronine, type II 2 6 

408868 AW292286 Hs.255058 ESTs 2 6 

451531 AA018311 Hs.1 14762 ESTs 2 6 

35 405822 Target Exon 2.6 

418301 AW976201 Hs.53913 hypothetical protein FU 10252 2 6 

417315 AI080042 Hs.336901 ribosomal protein S24 2 6 

434699 AA643687 Hs.149425 Homo sapiens cDNA FU1 1980 fis, clone HE 26 

443204 AW205878 Hs.29643 Homo sapiens cDNA FLJ13103 fis, clone NT 2 6 

40 405638 Target Exon 2.6 

452542 AW812256 gb:RC0-ST0174-191099-O31-a07 ST0174 Homo 2.6 

403943 C50O0355.-gij45O3225jreflNP_0O0765. 1 1 cyt 2 6 

404535 Z25884 Hs.121483 chloride channel 1 , skeletal muscle {Th 2 6 

402800 NA Target Exon 2 6 

45 449144 AI989503 Hs.233405 ESTs 2.6 

454934 AW846080 Hs.314324 ESTs 2 6 

424717 H03754 Hs.152213 wingless-type MMTV integration site fami 2 6 

428303 AW974476 Hs.183601 regulator of G-protein signalling 16 2 6 

427970 AA418187 Hs,330515 ESTs 2.6 

50 450638 AK001826 Hs.25245 hypothetical protein FU 11 269 2.6 

453034 BE246010 Hs.271468 Homo sapiens mRNA for FLJ00038 protein, 2 6 

455097 AW855802 gb:RC1-CT0279-17O200-O23-408 CT0279 Homo 2.6 

427317 AB028955 Hs.175780 KIAA1 032 protein 2 6 

408875 NM 015434 Hs.48604 DKFZP434B 168 protein 2 6 

5 5 427510 247542 Hs. 179 31 2 small nuclear RNA activating complex, po 2 6 

423201 NM_000163 Hs.125180 growth hormone receptor 2 6 

406271 Target Exon 2.6 

442696 BE566962 Hs.7063 Homo sapiens cDNA: FLJ20913 fis, clone A 2 6 

454018 AW016692 Hs.100855 ESTs 2 6 

60 435420 AI928513 Hs.59203 ESTs 2.6 

434398 AA121098 Hs.3838 sefum-inducible kinase 2 6 

455708 BE069326 gb:QV3-BT0381-170lOO-060-g03 BT0381 Homo 2 6 

439347 W24320 Hs.102941 Homo saptens cONA: FU21 531 fis, clone C 2 6 

407523 X64984 g b:H .sapiens mRNA HTPCRX10 for olfactory 2.6 

65 425101 AA830431 Hs.1 80811 ESTs 2.6 

435153 AA668763 Hs.291939 ESTs 2.6 

409139 AI681917 Hs 3321 ESTs, Highly similar to IRX1 JHUMAN IROQU 2.6 
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4551O0 BE160198 
414612 BE274552 
440283 A1732892 
423025 AA831267 
431473 AA825686 
401440 
403388 NA 
403775 NA 
405037 NA 
407447 AF290544 
420952 AA282067 
435447 AI872932 
440202 AW516211 
445854 AI702885 
421247 BE391727 
414870 N72264 
457411 AW085961 
424676 Y08565 
404443 

452268 NM 003512 
430832 AI073913 
444779 AI192105 
408633 AW963372 
459089 F13036 
447197 R36075 
454111 AW081681 
411165 NMJJ00169 
406922 S70284 
456045 H62943 
413111 BE065837 
423123 NMJH2247 
432201 AI538613 
427032 AF012023 
445417 AK001058 
422225 BE245652 
428330 L22524 
410011 AB020641 
426310 NM„00O909 
419589 AW973708 
437770 AA767881 
442818 AK001741 
414251 AL042306 
428301 AW628666 
431956 AK002032 
455732 BE080908 
458624 AI362790 
428257 BE394723 
418054 NMJXJ2318 
458652 AW375610 
458012 AI424899 
422996 BE091089 
410804 U64820 
457211 AW972565 
440029 AW089705 
448141 AI471598 
409163 AA065081 
431385 BE178536 
414569 AF109298 
439963 AW24752S 
455935 BE158687 
425025 AW953168 
416589 AA652687 
404826 

422938 NMJ01809 
421991 NM_014918 
417404 NML007350 
448516 AW898595 



gb:QV1-HTO413-O1020O-059-b03 HT0413 Homo2£ 

Hs.76578 protein inhibitor of activated STAT3 2.6 

Hs.190489 ESTs 2.6 

Hs. 12244 hypothetical protein FLJ20097 2.6 

Hs.321175 ESTs, Weakly similar to S65824 reverse t 2.6 

NM_021048:Homo sapiens melanoma antigen, 2.6 

C3O01398*:gi|12248917|dbj|BAB2O375.1l (A 2.6 

Target Exon 2.6 

NMJ>21628*:Homo sapiens arachidonate lip 2.6 

gb:Homo sapiens amlnopeptidase mRNA, par 2.6 

Hs.88972 ESTs, Modetately similar to A46010 X-lin 2.6 



Hs.125300 
Hs. 145568 
Hs.102910 
Hs.300670 
Hs.130093 
Hs.151678 

Hs.28777 
Hs.100686 
Hs.147170 
Hs.46677 
Hs .27373 

Hs.269064 
Hs.69089 

Hs/1 54188 

Hs.124027 
Hs.298241 
Hs.173274 
Hs.12680 
Hs/1 18281 
Hs.2256 
Hs.57856 
Hs.169266 
Hs.201925 
Hs.122897 
Hs.8739 
Hs.97689 
Hs.98440 
Hs.272245 

Hs.278639 

Hs.275243 

Hs.83354 

Hs.117102 

Hs.188211 

Hs.66521 
Hs.32399 
Hs.293711 
Hs.197531 

Hs.11090 
Hs.1 18258 
Hs.5733 

Hs.12407 
Hs.96151 

Hs.1594 

Hs.110483 

Hs.82101 



gb:wm72e03.x1 NCl_CGAPJJt2 Homo sapiens 2.6 

ring finger protein 21, interferon-respo 2.6 

ESTs 2.6 

general transcription factor HH, polype 2.6 

KIAA1204 protein 2.6 

ESTs 2.6 

UDP-N-acetyl-alpha^alactosamine:polyp 2.6 

C8001428*:gi[6572242|embiCAB62951.1 j (Z9 2.6 

H2A histone family, member L 2.6 

ESTs, Weakly similar to JE0350 Anterior 2.6 

ESTs 2.6 

PRO2000 protein 2.6 
Homo sapiens mRNA; cDNA DKF2p56401763 (f 2.6 

gb:yh88o01 .s1 Soares placenta Nb2HP Homo 2.6 
ESTs, Weakly similar to T42689 hypotheti 
galactosidase, alpha 

gb:stearoyl-CoA desaturase [human, adipo 
ESTs 

gb:RC2-BTO316-11010O-O12-g12 BT0318 Homo 2.6 
SELENOPHOSPHATE SYNTHETASE ; Human selen2.6 

Transmembrane protease, serine 3 2.5 

integrin cytoplasmic do ma in -associated p 2.5 

Homo sapiens cDNA FU10196 fe, clone HE 2.5 

zinc finger protein 266 2.5 

matrix metalloproteinase 7 (MvP7; uterin 2.5 

PFTAIRE protein kinase 1 2.5 

neuropeptide Y receptor Y1 2.5 

Homo sapiens cDNA aj 13446 fis, done PL 2.5 

ESTs 2.5 

hypothetical protein FLJ10879 2.5 

VASA protein 2.5 

ESTs, WeakJy similar to I38022 hypothefi 2.5 

Homo sapiens cDNA FU11170 fis, done PL 2.5 
gb:QV1-BT0631-28O2004)84-hO7 BT0631 Homo 2.5 



2.6 
2.6 
2.6 
2.6 



KIAA1684 protein; likely homolog of mous 2.5 
S100 calcium-binding protein A6 (calcyd 2.5 
lysyt oxidase- like 2 2.5 
hypothetical protein FLJ13046 simHarto 2,5 
ESTs 2.5 
gb:PM4-8T0724-130400-006-c07 BT0724 Homo 2.5 
Machado-Joseph disease (spinocerebellar 2.5 
ESTs, Weakly simitar to S51797 vasodilat 2.5 
ESTs, Weakly similar to S64329 probable 2.5 
ESTs 2.5 
gb:zm1 3a03.s1 Stratagene pancreas (93720 2.5 
membrane-spanning 4-domains, subfamily A 2.5 
prostate ca ncer associated protein 1 2,5 
platelet-activating factor acety!riydrola 2.5 
gb:CMf>HT0395-2&0100-169-b09 HT0395 Homo2.5 
ESTs 2.5 
Human DNA sequence from clone RP5-1 103G7 2.5 
Target Exon 2 5 

centromere protein A (17kD) 2.5 
KIM0990 protein 2 5 

pleckstrin homology- like domain, family 2.5 
gb:RC1-NN0073-260400-011-g09 NN0073 Homo2.5 
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403356 


NA 


404983 




418282 


AA215535 


427409 


AW467143 


431806 


AF186114 


443367 


AW071349 


421246 


AW582962 


439217 


AF086041 


400925 




404552 


NA 


417168 


AL133117 


418841 


NM 002332 


426853 


U32974 


427738 


NM 000318 


457384 


AA501760 


447128 


AI271898 


454693 


AW813428 


434657 


AA641876 


402077 


NA 


400289 


X07820 


409723 


AW885757 


447020 


T27308 


455068 


AI807894 


431232 


AI024353 


408938 


AA059013 


411571 


AA1 22393 


426504 


AW162919 


428248 


AI1 26772 


408813 


AI580090 


423504 


N80077 


425441 


AA449644 


443066 


AW297921 


443556 


AA256769 


428943 


AW086180 


425320 


U29344 


430388 


AA356923 


423242 


AL039402 


416241 


N52639 


440244 


AI743977 


409239 


AA740875 


452464 


AW500507 


410718 


AI920783 


408877 


AA479033 


445150 


AI446747 


407756 


AA1 16021 


407633 


NMJ07069 


449754 


H00820 


419316 


AA236255 


429118 


H20669 


440331 


AL046412 


449344 


AI640355 


459006 


AW298631 


423165 


AI937547 


411337 


AW837349 


438290 


AA843719 


406414 




424498 


AB033043 


443464 


BE548446 


424856 


AA347745 


440304 


BE1 59984 


409045 


AA635062 


422648 


D86983 


428819 


AL1 35623 


412520 


AA442324 


430602 


D13752 


408031 


AA081395 


403133 





ENSP00000251525*:Hypolhetical protein Kl 2.5 
ENSP00000252242*:Keratin, type It cytosk 2.5 
Hs.98133 ESTs 2 5 

Hs.135411 actin related protein 2 5 

Hs 270737 tumor necrosis factor (Ifgand) superfami 2 5 

Hs.215937 ESTs 2 5 

Hs 102897 CGI47 protein 2 5 

Hs.42975 ESTs 2 5 

Target Exon 2 5 

ENSP00000220888*:ZINC FINGER TRANSCRIPTI2.5 
Hs 81376 Homo sapiens mRNA; cDNA DKFZp586L1121 (f 2 5 
Hs 89137 low density lipoprotein-reJated protein 2 5 

Hs 172777 baculoviral IAP repeat-containing 4 2 5 

Hs 180612 peroxisomal membrane protein 3 (35kD, Ze 2 5 
Hs 15806 Homo sapiens mRNA; cDNA DKFZp434 H20l9 (f 2 5 
Hs 1 64866 cyclinK 2 5 

gb:MR3-ST0192-O1020O-210-cO5 ST0192 Homo 2 5 
Hs 191840 ESTs 2 5 

Target Exon 2 5 

matrix metalloproteinase 10 (MMP10; str 2 5 
ESTs 2 5 

hypothetical protein FLJ11046 2 5 



Hs.2258 
Hs.257862 
Hs 16986 
Hs.47274 
Hs 131755 
Hs 22607 
Hs 70811 
Hs. 170160 
Hs 40479 
Hs 48295 
Hs 24792 
Hs 193063 
Hs.255703 
Hs 94949 
Hs. 37636 
Hs.83190 
Hs 240770 
Hs.125783 
Hs 32683 
Hs 205144 
Hs 44307 
Hs.192619 
Hs 191435 
Hs 130315 
Hs 338704 
Hs 38260 
Hs 37189 
Hs 30977 
Hs 298419 
Hs 35406 
Hs 202151 
Hs 312691 
Hs 27721 
Hs. 124915 

Hs 122341 

Hs.1 49377 

Hs.5167 

Hs.9521 

Hs 125395 

Hs.50094 

Hs.1 18893 

Hs.193914 

Hs.795 

Hs.1 84927 

Hs.42173 



Homo sapiens mRNA; cDNA DKFZp564B176 (fr 2 5 

hypothetical protein FLJ 14298 2 5 

ESTs 2 5 

hypothetical protein FLJ20516 2.5 

RAB2, member RAS oncogene family-like 2 5 

ESTs 2 5 

RNA helicase family 2 5 

chromosome 12 open reading frame 5 2 5 

Homo sapiens cDNA FLJ 14201 fis, clone NT 2 5 

ESTs 2 5 

methy Imalonyl-CoA epime rase 2 5 

ESTs, Weakfy similar to KIAA1392 protein 2.5 

fatty acid synthase 2 5 

nuclear cap binding protein subunit 2,2 2 5 

DEME-6 protein 2 5 

ESTs 2.5 

ESTs 2.5 

ESTs, Moderately similar to I38022 hypot 2 5 

KIAA1600 protein 2 5 

ESTs 2 5 

ESTs, Weakly similar to A47582 B-cell gr 2 5 

olfactory receptor, family 7, subfamily 2.5 

ubiquitin specific protease 18 2 5 

similar to rat HREV107 2.5 

ESTs, Weakly similar to B34087 hypotheti 2 5 

ESTs 2 5 

ESTs, Highly similar to unnamed protein 2 5 

ESTs 2 5 

ESTs 2.5 

Wolf-Hirscbhom syndrome candidate 1 -lik 2 5 

hypothetical protein MGC2601 2.5 

gb:QV2-LT0038-270300-108-d12LT0038Homo 2 5 

ESTs 25 

C5000506*:gi|124941(sp|P18614|ITA1_RAT t 2.5 

hypothetical protein DKFZp761L0424 2.5 

Homo sapiens mRNA; cDNA DKFZp434F152 (fr 2.5 

ESTs, Weakly similar to ZN43_HUMAN ZINC 2 5 

ESTs 2 5 
Homo sapiens mRNA; cDNA DKFZp434O0515 (f 2.5 

Melanoma associated gene 2.5 

K1AA0575 gene product 2.5 

H2A histone family, member O 2.5 

cytochrome P450, subfamily XI B (steroid 2.5 

Homo sapiens cDNA FU10366 fis, clone NT 2.5 

Target Exon 2.5 
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413189 BE070231 gb:QV4-BT0407-260100-087-f1 2 BT0407 Homo 2.5 

400346 AB041269 Hs.272263 Homo sapiens mRNA for keratin 19, partia 2.5 

435509 AM58679 Hs.181915 ESTs 2.5 

458145 AI239457 Hs.130794 ESTs 
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TABLE 19A 

Table 19A shows the accession numbers for those pkeys lacking unigenelD's for Table 19. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number: Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

407647 1007366J AW8601 58 AW862385 AW860159 AW862386 AW862341 AW821869 AW821893 AW062660 AW062656 
407980 103087J AA046309 AI263500 AA046397 

408254 1049346J AW807227 AW807576 AW807137 AW807157 AW807495 AW807494 AW807417 AW807083 AW845786 AW845801 
AW807130 AW807335 AW807081 AW807349 AW807339 AW807164 AW807341 AW807224 AW845903 AW177424 
AW807159 AW807123 BE141576 AW807340 AW807334 AW807520 AW607205 AW807505 BE141574 AW807390 
AW807395 AW845789 AW807101 AW807089 AW807519 AW807239 AW807509 AW807356 AW807526 AW807098 
AW807307 AW807153 AW807295 AW807313 AW807322 AW807265 AW807513 AW807516 AWB07166 AW807501 
AW807120AW807168AW807121 AW807527 AW807151 AW807203 AW807489 AW807511 AW807158 AW845800 
AW807507 AW845793 AW845796 AW807296 AW845781 AW807444 AW845871 AW807512 AW807242 AW807141 
AW807522 AW807487 AW807514 AW807142 AW807232 AW807379 AW8071 14 AW807518 AW807199 AW80721 1 
AW807498 AW807086 AW807492 AW807218 AW807082 AW807525 AW807493 AW807523 AW807087 AW845784 
AW807037 AW807 128 AW807080 AW8071 1 8 AW845807 AWB07524 AW845803 AW807249 AW845795 AW8071 60 
AW807343 AW807515 AW807233 AW807289 AW177102 AW807352 AW807394 AW177105 AW807176 AW177103 
AW845870 AW177099 AW177101 AW807528 AW807336 AW807038 AW177100 AW807411 AW807088 AW845865 
AW807226 AW807517 AW807397 AW807303 AW807177 AW807154 AW807136 AW807146 AW807085 AW807521 
AW807488 AW807385 AW807355 AW807223 AW807155 

409163 1 10418.1 AA065081 AA075017 AA084791 AA071015 AA081560 AA071459 AA545727 AA083100 M085366 AA115845 AA075457 
AA064704 AA082878 AA075742 AA069 1 62 

409695 1 14876.1 AA296961 AA296889 AA076945 AA077528 AA077497 

4 1 0534 1 207247.1 AW905 138 AW753008 R1 381 8 Z43519 

410672 1214882,1 AW794600 AW794730 

410784 1221005.1 AW803201 BE079700 BE062940 

410785 1221055.1 AW803341 AW803265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 
410835 1 223785.1 AW806906 AW80691 5 AW866460 AW666475 AW866462 AW866448 AW866372 AW866604 

411050 1 230330.1 AW814902 BE156656 BE156667 BE156590 BE156441 BE156447 

411086 1231500.1 BE070800 AW875226 BE149115 

411093 1231970.1 BE067650 AW817053 

411111 1232669.1 AW818127 AW818161 R09719 

411171 1234393.1 AW820260 AW820332 R94406 

411337 1239217.1 AW837349 AW837355 AW882717 

411514 1248638.1 AW850178 AW850233 AW850445 AW850446 

41 1670 1253680.1 AW856552 AW861101 AW856574 AW861099 A W861 100 AW 8 56573 AW856576 AW856562 
41 1905 12651B1J BE265067 BE264978 AW875420 
412102 1277395.1 H56435 H56572AW892929 
412209 1283610_1 AW901456 AW901450 AW901441 
412248 1285000.1 BE176480 AW903298 AW903313 

413043 1 346556.1 BE158766 BE061699 BE147360 BE147362 BE061666 BE061697 BE061647 BE061678 

4131 11 1349546.1 BE065837 BE065805 BE065799 BE065818 BE065839 BE065831 BE065894 BE065789 BE065792 

413189 1 352723 1 BE070231 BE070229 BE070255 

413221 1353887.1 BE161151 BE162495 BE161002 BE072205 BE160989 BE162482 
413499 1373910J BE1 44884 H97942 

413708 1384140.1 BE158791 BE158806 BE158748 BE158744 BE158740 BE158739 BE15881 1 BE 158700 BE 158741 BE1 58683 BE 158685 
414210 1426051 1 BE383592 BE261671 
414596 1465004J BE386870 Z41986 HO8501 
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414605 
415747 
416173 
417742 
417974 
418536 
419536 
420854 
422156 

422996 
423833 
423641 
423945 
425201 
426650 
426678 
430264 
430785 
431676 
433687 
434338 
434469 
435447 
437152 
437854 
439031 
439255 
444910 
445432 
446922 
447197 
448420 
448516 
450522 
450736 
451024 
451067 
451340 
452542 
452564 
453472 
454307 
454359 
454545 
454693 
454714 
455047 
455092 
455097 
455100 
455431 
455511 
455609 
455651 

455685 
455700 
455706 
455732 
455838 
455935 
455945 
456207 
456482 
458094 
458673 



1465790.-1 
155189.1 
1574973J 
1696282,1 
171 237 J 
177402.1 
185668.1 
197072.1 
212379.1 

223666.1 
232451.1 
232507.1 
233566.1 
247933.1 
270283.1 
273265.1 
315008.1 
323486.1 
336411.1 
373061.1 
383982.1 
387447.1 
406400.1 
43386.1 
444 18J 
46798.1 
470321.1 
624951J 
63943.1 
69865.1 
711623.1 
76273.1 
766241.1 
837264J 
844652.1 
85565.1 
85759.1 
86640.1 
921410 J 
92227.1 
968371.1 
1106070, 
1130674 
1223779. 
1229132. 
1230493. 
1250536. 
1252971, 
1253130. 
1253334. 
1289854. 
1321229.' 
1337548. 
1348732. 

1350393. 
1351264. 
1352232. 
1353874. 
1374605. 
1384144. 
1385588. 
165078.- 
192289. 
47311.1 
679507.1 



BE390440 

AA381209AA381245AA167683 
R52782R17313H24192 R19876 
R64719 Z44680R12451 
AA210765T95700 H94407 
AW749855 AA225995 AW7 50208 AW750206 
AA603305 AA244095 AA244183 

AW835572 AW385512 AI334966 W32951 H62656 H53902 R88904 AW835732 

BE091089 BE091 123 AA319959 

AW503329 N46610 AA331571 

AW753967 AA370795 AA331630 AW962550 

AA410943 AW948953 AA334202 AA332882 

AA352111 AW962247 AA429695 

M382814AA402411 AA412355 

BE069341 AW748403 AL044891 AI908240 AA393080 

AA470519 BE303010 BE302954 BE384120 

Z30201 AA486132 T72025 

AI685464 AW971336 M513587 AA525142 

M743991 AA604852 AW272737 

AW754311 AA630185AW803285 

AA634806 C18732 M729161 AA729860 

AI872932 AA682306 BE220163 W88695 T81307 H91447 

AL050027 BE089051 

AL1 19723 AL119874 AI909018 U50537 

AF075079H48601 H48795 

BE164500 AA632198 BE164502 

AI201849 BE069007 AW946544 

AV653771 BE089370 

BE175605 Z43529 F06610 BE175602 AV661027 

R36075 A1366546 R36167 

BE623004AA380669 BE263627 BE246433 

AW898595 AW898588 AW898590 AW898663 AW898592 AI525093 

AI698839AI909260 AI909259 

AW970060 AI732366 AI792313 AW839644 

AA442176AA259181 

BE172186 AA059279 AA020815 AA013437 
AW936273 AW340350 AA017208 
AW812256 AW812257 AI906423 AI906422 
AA026777 N50065 R09961 N54721 
AL037925 AL037931 AL037957 
AW855717 AW362452 AW362443 
N71277AW390764 

AW806899 AW865451 AW866393 AW866297 AW817869 
AW813428 AW813444 AW813367 AW813368 AW813429 AW813424 
AW815098BE154843 BE154831 
AW852530 AW852527 AW852526 

SaKaS AW855806 AW855796 AW855808 AW855793 AW855807 
BE160198 AW935898 T11520 AW935930 AW856073 AW861034 
AW938484 BEO0 1245 BE 001 190 

^llSm HeK BE011627 BE011679 BE011699 BE011678 BE011696 BE011675 BE01 1622 BE011635 
IS S 5 BE06 857 BE064856 BE064977 BE064960 BE064860 BE06481 5 BE064957 BE064804 
B S IS BE064806 BE064796 BE064818 BE064975 8E064619 BE064810 BE064668 BE065059 
BE066976 BE066928 BE066927 

BE068115 BE068104 BE0681Q2 BE068096 BE068103 BE068154 BE068198 

BE069326 BE069290 BE069352 

BE080908 BE072258 BE072190 BE072236 

BE145808 BE145807 BE181883 

BE158687 BE158688 

BE160636BE1 60606 BE1 60703 

AA193450 

AA485224AA287308 AA258121 
AF086325W72956W73221 AA219112 
N99626 A1302701 
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TABLE 19B 



Table 19B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 19. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham I. et al M Nature (1999) 402:489495. 
Strand; Indicates DNA strand from which exons were predicted. 

NLposirjon: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


Ntjjosttton 


400555 


9801191 


Minus 


134by4-i J48U 


400608 


ADO 1C £ c 


Minus 


fierce 07CCO 


400610 


Aft 4 

9887671 


Minus 


1 17606-1 17928,124040-1 ^4147 


400925 


7651921 


Plus 


38183-38391,43900-44086 


401045 


8117619 


Plus 


90044-901 84,91 1 1 1 -9 1345 


401049 


7232177 


Plus 


149157-150692 


401093 


8516137 


Minus 


22335-23166 


401256 


9796573 


M.nus 




401283 


9800093 


Minus 


4725647456 


401326 


9212516 


Minus 


226246-227505 


401418 


7452889 


Minus 


124865-125075 


401451 


6634068 


M.nus 


119926-121272 


401458 


9187886 


Plus 


76485-77597 


401497 


7361770 


Plus 


92607-92813 


401508 


7534110 


Minus 


110779-110983 


401575 


7229804 


M;nus 


76253-76364 


401747 


9789672 


Mmus 


118596-118616,119119-119244,1 19609-119761,120422-120990,130161-130381, 130468-130593,131097- 








131258,131866-131932,132451-132575,133580-134011 


401781 


7249190 


M.nus 


83215-83435,83531-83656,83740-83901,84237-84393.84955-85037,86290-86814 


401785 


7249190 


Minus 


165776-165996,166189-166314,166408-166569,167112-167268,167387-167469,168634-168942 


401793 


7263888 


Mmus 


102945-103083 


401987 


4406829 


Mmus 


72893-73021,76938-77049 


402077 


8117414 


Plus 


65014^65195 


402109 


8131678 


M:nus 


171722-171859,173197-173303 


402184 


8576001 


Mmus 


112844-112986,113505-113636 


402376 


9625329 


Mmus 


21753-22385 


402421 


9796341 


Mmus 


4660946662.46758-46811,86293-86346,89776-89829,90048-90101,102817-102924 


402578 


9884928 


Plus 


66350-66496 


402606 


9909429 


Mmus 


81747-82094 


402696 


7328818 


Minus 


23600-23731 


402765 


9367757 


Plus 


109588-109726 


402800 


6010175 


Plus 


4392144049,4618146273 


402820 


6456853 


Mmus 


82274-82443 


402892 


8086844 


Minus 


194384-194645 


403133 


7331427 


Plus 


38314-38634 


403356 


8569930 


Plus 


92839-93036 


403388 


9438331 


Plus 


112733-113001,114599-114735 


403426 


9719529 


Minus 


157156-158183 


403585 


8101208 


Minus 


131266-131769 


403593 


6862550 


Minus 


62554-62712,69449-69602 


403637 


8671936 


Minus 


142647-142771,145531-145762 


403639 


8671948 


Plus 


113234-113326,115186-115287,119649-119786 


403677 


7331517 


Minus 


55008-55083,62860-63051 


403775 


7770580 


Minus 


102247-102326,103095-103148 


403943 


7711864 


Plus 


100742-100904,101322-101503 
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404091 7684554 


Minus 


82121-83229 


404097 7770701 


Plus 


55512-55781 


404142 9856692 


Minus 


80316-80459 


404253 9367202 


Minus 


55675-56055 


404274 9885189 


Plus 


104127-104318 


404285 2326514 


Plus 


32282-32416 


404360 9858450 


Minus 


122873-122966,151324-151469,153093-153253 


404440 7528051 


Plus 


80430-81581 




Mini iC 


87198-87441 


404552 7243881 


Plus 


19854-20010 


40.4561 Q7Q5QR0. 


Minn*; 


69039-70100 


404580 6539738 


Minus 


240588-241589 


404721 9856648 


Minus 


173763-174294 


404826 6572184 


Plus 


47726-46046 


404983 4432779 


Minus 


51178-51374,52000-52173 


405037 7543748 


Minus 


127374-127578 


405041 7547195 


Plus 


121230-121714 


405095 8072599 


Plus 


138877-139066 


405153 9965565 


Minus 


175317-175500 


405196 7230083 


Minus 


135716-135851 


405232 7249042 


Plus 


125904-126063 


40574R 7259728 


Plus 


637-777 


4n5^fi 6094635 


Plus 


33267-33553 


405394 6624123 


Minus 


31900-32373 


4TK4fif) 7fift45fi.Q 


Minus 


52223-52389 


405494 8050952 


Minus 


70284-70518 




Plus 


124361-124520,124914-125050 


405600 575755^ 


Minn*; 


42614-43010,43583-43763,44863-45033.46429-46554,4781 5^1801 8,49961-50 153^1 624-5 1727,51823-51959.52702- 


52918,55469^55601,57111-57307,58169-58296,60215-60332,61482-61727 




Plus 


1 99260-199372, 1 99826-1 99929 




Minus 


53624-53759 


405718 9795467 


Plus 


113080-113266 


405822 6273498 


Minus 


15466O-154974.155203-155379 


405848 7651809 


Minus 


28135-28244 


405873 6758747 


Minus 


32129-32764 


405906 7705124 


Minus 


10835-11059 


405917 7712162 


Minus 


106829-107213 


405925 6758795 


Plus 


129935-130282 


405953 7960374 


Minus 


65101-65574 


406069 9117732 


Plus 


68880-69374 


406151 7144806 


Minus 


94087-94285 


406153 9929734 


Minus 


12902-13069 


406182 5923650 


Minus 


28256-28935 


406271 7534217 


Plus 


36179-36692 


406291 5686274 


Plus 


9562-9867 


406348 9255985 


Minus 


71754-71944 


406414 9256407 


Plus 


49593-49850 


406446 9454509 


Minus 


1 16424-1 16527,1 18721-1 18859,121 187-121 364 


406504 7711360 


Minus 


107068-107277 


406554 7711566 


Plus 


106956-107121 
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TABLE 20: 544 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES THAT ARE 
LIKELY TO ENCODE EXTRACELLULAR OR CELL- 
SURFACE PROTEINS 



10 



15 



20 



Table 20 shows 544 genes up-regulated in breast cancer compared to normal adult tissues 
that are likely to encode extracellular or cell-surface proteins. These were selected as for 
Table 19, except that the ratio was greater than or equal to 3.0, the "average" normal adult 
tissue level was set to the 85 th percentile value amongst 144 non-malignant tissues, and the 
96 th percentile value amongst the 73 breast cancer specimens was greater than or equal 100 
units, and the predicted protein contained a structural domain that is indicative of 
extracellular localization (e.g. ig, fh3, egf, 7tm domains, signal sequences, transmembrane 
domains). The predicted protein domains are noted. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Gen bank aocession number 

UnigenelD: Unigene number 

Pred.ProtDomains: Predicted Protein Domains 

Unigene Title: Unigene gene title 

R1 ; Ratio of 93^ percentile of tumor to 85* percentile of normal body tissue 



25 



30 



35 



40 



45 



50 



55 



Pkey 


ExAccn 


UnigenelD 


Pred.ProtDo mains 


UnigeneTrtle 


R1 


408591 


AF015224 


Hs.46452 


SS,Uteroglobin f SS,Uterog!obin 


mammaglobin 1 


168.6 


400291 


AA401369 


Hs.190721 


TM 


ESTs 


73.2 


449746 AI668594 


Hs.176588 


,SS,p450 


ESTs, Weakty similar to CP4Y_HUMAN CYTOC 


65,7 


407277 AW1 70035 Hs 326736 


TM 


Homo sapiens breast cancer antigen NY-BR 


57.6 


400292 AA250737 


Hs 72472 


death,ZU5,TM,ActMn recp.pkinase, 


BMP-R1B 


55.9 


424735 


U31875 


Hs 272499 


,SS,TM 


short-chain alcohol dehydrogenase family 


53.8 


425878 


BE069341 




TM 


gb:GV3-BT0381-27010XH)73^c08BT0381 Homo 


50.3 


428848 


NM_000230Hs 194236 


SS^eptin.SS.Leptin, 


leptin (murine obesity homobg) 


40.8 


407178 AA195651 


Hs 104106 


,SS,Dihydroorotase, 


ESTs 


39.3 


408000 


L11690 


Hs.620 


Plectin _repeatSH3 I spectrin ,SS ,Ptectln_r 


bullous pemphigoid antigen 1 (230/240kD) 


37.3 


427585 


D31152 


Hs 179729 


SS.CIq.Collagen.SS.dq.Collagen, 


collagen, type X, alpha 1 (Schmid metaph 


35.2 


429441 


AJ224172 


Hs 204096 


,SS, Uteroglobin, 


lipophilin B (uteroglobin family member) 


30.0 


450375 


AA009647 


Hs.8850 


,SS ) TM,disintegrin,Pep_M12Bj>ropep,Repro a disintegrin and metalloproteinase doma 


257 


420931 


AF044197 


Hs 100431 


SS.IL8.SS 


small inducible cytokine B subfamily (Cy 


25.2 


422109 


S73265 


Hs 1473 


SS,Bombesin,SS 


gastrin- releasing peptide 


24.8 


445730 


AI624342 


Hs 170042 


,SS,TM, Cation efflux 


ESTs 


24.1 


451110 


AI955040 


Hs 265398 


SS 


ESTs, Weakty similar to transformation- r 


24.0 


400297 


AI127076 


Hs 334473 


TM 


hypothetical protein DKFZp56401278 


23.8 


420813 


X51501 


Hs 99949 


SS.SS 


prolactin-induced protein 


22.8 


452744 


AI267652 


Hs 30504 


,SS,TM,GNS1_SUR4,cNMP_binding t Rlla 


Homo sapiens mRNA; cDNA DKFZp434E082 (fr 


22.6 


424634 


NM_003613Hs.151407 


ig,tsp_1,SS,AAA 


cartilage intermediate layer protein, nu 


217 


420757 


X78592 


Hs 99915 


hormone rec, Androgen recep,zf-C4, 


androgen receptor (dihydrotestosterone r 


21.7 


424399 AI905687 


Hs 2533 


SS 


aldehyde dehydrogenase 9 famiry, member 


20.3 


447350 


AI375572 


Hs 172634 


.pkinase, 


ESTs 


192 


456207 


AA193450 




,SS,TM,p450,p450 


gb:zr40e07.r1 Soares_NhHMPu_S1 Homo sapi 


183 


431448 


AL137517 


Hs 334473 


TM 


hypothetical protein DKFZp56401278 


182 


427217 


AA399272 


Hs 144341 


SS 


ESTs 


182 


456938 


X52509 


Hs.161640 


,SS,TM,aminotran_L2 J Cadherin_C_term,cad tyrosine aminotransferase 


181 


435496 


AW840171 Hs 265398 


SS 


ESTs, Weakly similar to transformation- r 


17 9 


402578 






SS,p450,SS,TM,p450 


C1001134:gi|2117372|pirj|l65981 fatty ac 


17.8 


453160 


AI263307 


Hs 239884 


SS 


H2B histone family, member L 


178 


422505 


AL120862 


Hs.124165 


SS 


programmed cell death 9 (PDCD9) 


177 


444342 


NM_014398Hs.10887 


Lamp.SS.TM.Lamp, 


similar to tysosome-assoctated membrane 


17 5 
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60 



65 



449765 
428227 
425692 
424001 
448595 
449446 
418007 
418994 
453596 
452401 
446591 
419296 
452838 
422805 
448390 
447342 
411869 
443709 
459587 
442580 
400289 
411598 
415263 
433905 
407276 
443348 
421037 
424086 
400295 
452461 
427365 
433365 
409269 
432596 
408771 
432912 
447033 
421155 
424905 
425398 
438167 
459583 
423945 
439820 
402606 
445263 
430217 
447164 
431385 
423887 
415385 
425704 
429859 
425523 
428368 
418912 
422026 
451952 
438199 
400608 
413472 
432374 
402408 
445537 
451621 
405654 
434988 



N92293 Hs. 206832 
AA321649 Hs.2248 



D90041 
W67883 



Hs.1 55956 
Hs. 137476 



AB014544 Hs.21572 
D60730 Hs.57471 
M13509 Hs.83169 
AA296520 Hs.89546 
AA441838 Hs.62905 
NM_007115Hs.29352 
H44186 Hs. 15456 
AA236115 Hs. 120785 
U65011 Hs.30743 
AA436989 Hs.121017 
AL035414 Hs.21068 



SS 

SSJL8, 

,SSAcetyltransf2, 
,pkinase, 

LRRCT,LRR,SS,LRRCT.serine_carbpept 
SS 



Hs.19322 
Hs.23439 
Hs.134662 

Hs.1 30239 
Hs.2258 
BE336654 Hs.70937 
AA948033 Hs.1 30853 
AA706910 Hs.112742 
AI951118 Hs.326736 
AW873596 Hs.182278 
AI684808 Hs.197653 



AI1 99268 
W20027 
AI082692 
AA031956 
AI733682 
X07820 



AI351010 

W72838 

N78223 

AI873274 

AF026944 



Hs.102267 

Hs.2533 

Hs.108106 

Hs.190721 

Hs.293797 



AA576953 Hs.22972 
AJ224741 Hs.278461 
AW732573 Hs.47584 
BE007371 Hs.200313 
AI357412 Hs.157601 
H87879 Hs.102267 
NM_002497Hs.153704 
AL049689 Hs.156369 
R28363 Hs.24286 
AI907673 
AA410943 

AL360204 Hs.283853 



H57646 
N47863 
AF026941 



Hs.42586 
Hs.336901 
Hs.17518 



BE178536 Hs.1 1090 
AL080207 Hs.134585 



R17798 
U79293 



Hs.7535 
Hs.159264 



NM_007050Hs,225952 
AB007948 Hs.1 58244 
BE440042 Hs.83326 
NMJXX)685Hs.89472 
U80736 Hs.1 10826 
AL120173 Hs.301663 
AW016531 Hs 122147 

BE242870 Hs.75379 
W68815 Hs.301885 
NA 

AJ245671 
AI879148 
NA 

AW18055 



Hs.12844 
Hs.26770 

Hs. 161 160 



ESTs, Moderately similar to ALU8JHUMAN A 
small inducible cytokine subfamily B (CX 
N-acetyltransferase 1 (arylamine N-acety 
paternally expressed 10 
KIAA0644 gene product 
ESTs 



SS,hemopexin,Peptjdase_M10,SS,Pept)dase_nnatrix metalloproteinase 1 (MMP1; inters 
Ss!leclin_c 1 sushi 1 EGF,SS 1 EGF ) lectin_c,su selectin E (endothelial adhesion motecul 



SS 

,SS,CUB t Xlink, 

PDZ.SS 

SS 

SS.SS 

histone.SS , histone, histone 
SS 

.SS.Iipocalin 
,SS,Peptidase_M1, 
.SS.TM.SNF 
.SS.LIM, 
SS 



hypothetical protein FLJ 14834 
tumor necrosis factor, alpha-induced pro 
PDZ domain containing 1 
ESTs 

preferentially expressed antigen in mela 
H2A histone family, member A 
hypotheticaJ protein 

Homo sapiens, Simitar to RIKEN cDNA 2010 

ESTs 

ESTs 

gb:zk1 5e04 .s1 Soares_pregnant_uterus_N bH 
ESTs 



herTwpexin^eptklase_M10 l SSPeptidase_M10matnx metalloproteinase 10 (MMP10; str 



histone.SS.histone.histone 
.SS.histone.hlstone, linker_h istone 
.SS.RfcosomalJJAe, 
TM 

.SS.DENN 
SS 

,SS,Lysyl_oxidase 
SS 

( SS,G9a,PHD, 
TM 

,SS,TPR 

SSJM l UPF0016,SS t TM,UPF0016 
SS,EGF,vwa,SS,TM,vwa, 
TM,K_tetra,ion_trans 1 SS,TM,K_tetra I ionJ 
,SS,TM,Folate_carrier 
SS 



H3 histone family, member A 
ESTs 
ESTs 

Homo sapiens breast cancer antigen NY-BR 
calmodulin 2 (phosphorylase kinase, delt 
programmed cell death 9 (PDCD9) 
lysyl oxidase 

aldehyde dehydrogenase 9 family, member 

transcription factor 

ESTs 

ESTs 

hypothetical protein FLJ13352 
matrilin 3 

potassium voltage-gated channel, delayed 

ESTs 

ESTs 



SS,Lysyt_oxidaseAldose_epim ( Epimerase,S lysyl oxidase 
pkj'nase,SS t TM,pkfnase l poryprenyl_synt MIMA (never in mitosis gene a)-related k 



SS 

, SS ,TM 7tm_1 ,p450 ,rrm 
.pkinase, 

death ,2U5,TM,Activin_recp,pkinase 

SS 

SS 

.SS.Acyttransferase, 



hypothetical protein similar to tenascin 
ESTs 

gb:!L-BT152-080399-004 BT152 Homo sapien 
gb:zt32h03.r1 Soares ovary tumor NbHOT H 
Homo sapiens mRNA full length insert cDN 
NM_024626:Homo sapiens hypothetical prat 
KIAA1560 protein 



l SS l RNA_pol_A,RNAjx)l_A2,RibosomaLS24e ribosomal protein S24 



.TMJBR 
.SSJM 

,SS p TM,BRCT,aiiKABC_tfan ( ABC_tran 
,SS,Fork_head, 
SS 

, SS ,TM, Y_phosphatase.MAM.fn3 , 
, SS, larnin ir^B Jamfnin_EGF.Samin in_Nterm 



Homo sapiens cig5 mRNA, partial sequence 
membrane-spanning 4-domains, subfamily A 
DKFZP434G232 protein 
COBW-like protein 

Human clone 23948 mRNA sequence 
protein tyrosine phosphatase, receptor t 
KIAA0479 protein 



SS,PeptWase_M10,hemopexin,SS,Peptidase_matrix metaltoproteinase 3 (strometysin 



SS.TMytmJ.SS.TM^tm.l, 
SS 

.SS.pkinase, 
.SS.ArfGap, 
SS,TM t SS,TM 
SS 
SS 

.SS.cartLanhydrase 

t SS,TM,ras 
SS.IipocalinJipocalln, 
BTB.SS 
SS 



angiotensin receptor 1 
trinucleotide repeat containing 9 
ESTs 
ESTs 

C10001899:gi|7508633|pir|fT25392 hypothe 
solute carrier family 1 (glial high affi 
Homo sapiens cDNA FU 11346 fe, clone PL 
NM_030920*»omo sapiens hypothetical pro 
EGF-like-domain. multiple 6 (EGFL6) 
fatty acid binding protein 7, brain 
C12001521.gi|7513934|pir||T31081 cca3 pr 
ESTs 



17.3 

17.0 

16.7 

16.5 

16.3 

16.2 

15.7 

15.5 

15.5 

15.0 

14.9 

14.3 

14.7 

14.3 

14.2 

14.2 

13.9 

13.7 

13.7 

13.5 

13.5 

13.3 

13.2 

13,1 

13.1 

13.0 

12.9 

128 

12.7 

12.5 

12.4 

12.3 

12 0 

11.9 

11.9 

11.9 

11.8 

11.8 

11.7 

11.6 

11.5 

115 

11.4 

11.4 

11.3 

11.2 

11.1 

11.1 

11.1 

10.9 

10.9 

10.7 

10.4 

10.3 

10.3 

10.3 

10.3 

103 

102 

10.1 

10.0 

9.9 

9.8 

9.7 

9.6 

9.6 

9.6 
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416220 


N49776 


Hs.170994 


,SS,TM 


hypothetical protein MGC 10946 


9.5 


431808 


M30703 


Hs.270833 


SS,TM,EGF,SS 


amphiregulin ( schwa nnoma-de rived growth 


9.5 


414142 


AW368397 


Hs. 150042 


,SS,UDPGT 


Homo sapiens cDMA FLJ 14438 fis, done HE 


9.4 


400298 AA032279 


Hs.61635 


TM 


six transmembrane epithelial antigen of 


9.4 


418601 


AA279490 


Hs.86368 


SS t TM I calreticulin,SS ) TM,calreliculin, 


calmegin 


9.4 


415539 


AI733881 


Hs.72472 


death ,ZU5,TM t Activin_recp f pklnase, 


BMP-R1B 


9.4 


421451 


AA291377 


Hs,50831 


TM 


ESTs 


9.3 


429432 


A1678059 


Hs,202676 


SS 


synaptonemal complex protein 2 


9.3 


442441 


AI820662 


Hs 129598 


SS 


ESTs 


9.1 


426429 


X73114 


Hs. 169849 


,SS,TM,fr»3,ig, 


myosin -binding protein C, slow-type 


9.1 


406687 


M31126 


Hs, 272620 


SS,PeptidaseJ410,herTK)rjexin,SS, Peptidase jriatrix metallo proteinase 11 (MMP11; stro 


9.1 


448693 


AW004854 


Hs.228320 


SS 


hypothetical protein FLJ23537 


9.1 


419948 AB041035 


Hs 93847 


Ferric redudJM,Ferric_reduct, 


NM_016931 :Homo sapiens NADPH oxidase 4 ( 


9.1 


426214 


H59846 


Hs. 128355 


SS 


ESTs, Moderately similar to ALU7 HUMAN A 


9.0 


427718 AI798680 


Hs 25933 


.SSTM.histone.Secl.histone, sugar tr 


ESTs 


89 


414812 


X72755 


Hs.77367 


SS,IL8,SS,IL8 


monokine induced by gamma interferon 


88 


400285 


NA 




JM.ABC tran^BC membrane, 


Eos Control 


88 


422330 


D30783 


Hs 115263 


SS,TM,EGF,SS t TM 


epiregulin 


88 


416182 


NM_004354Hs 79069 


cyclin,SS 


cycJin G2 


88 


420077 AW512260 Hs.87767 


SS 


ESTs 


87 


452281 


T93500 


Hs 28792 


,SS,TGF4)eta,TGFb_propeptide, 


Homo sapiens cDNA FLJ 1 1041 lis, clone PL 


87 


434531 


AA642007 


Hs 116369 


SS 


ESTs 


86 


408380 


AF1 23050 


Hs 44532 


,SSTM,ubiquitin,7trn 3,ANF_receptor,sush 


diubtquitin 


86 


443788 AI732643 


Hs.144151 


TM 


ESTs 


86 


411078 


AI222020 


Hs 182364 


SS,SS 


CocoaCrisp 


85 


445495 


BE622641 


Hs.38489 


SS,SS,ENTHLLWEQ,ENTH ,LLWEQ,DNA_mis_reESTs, Weakly similar to 138022 hypotheti 


85 


433426 


H69125 


Hs 133525 


,SS,TM 


ESTs 


85 


424871 


NMJ)04525Hs. 153595 


SS,EGF,ldl recepLa,WI_recepLb ) SS,TM 1 E 


low density lipoprotein-related protein 


8.4 


426215 


AW963419 Hs 155223 


SS 


stanniocalcin 2 


84 


409045 


AA6 35062 


Hs 50094 


TM 


Homo sapiens mRNA; cDNA DKFZp434O0515 (f 


8.4 


435525 


AI831297 


Hs.123310 


TM 


ESTs 


8.3 


409203 AA780473 


Hs.687 


SS,p450,SS,p450 


cytochrome P450, subfamily IVB, polypept 


8.3 


424902 


NMJ)03866Hs.153687 


SS,SS 


inositol polyphosphate-4-phosphatase, ty 


83 


431725 


X65724 


Hs2839 


SS,Cys_knot,SS 


Nome disease (pseudoglioma) 


8.3 


418092 


R45154 


Hs. 106604 


) death f ZU5,pkjnase,Activin recp, 


ESTs 


83 


439840 


AW449211 


Hs 105445 


SS 


GDNF family receptor alpha 1 


82 


427811 


M81057 


Hs 180884 


SS^n_carbOpept,Propep„M14,SS,Propep_M14cart)oxypeptjdase B1 (tissue) 


82 


420807 


AA280627 


Hs 57846 


SS,cpn10 


ESTs 


8.2 


426320 


W47595 


Hs 169300 


SS.TGF-beta.TGFb _propeptkJe,SS 


transforming growth factor, beta 2 


82 


447078 


AW885727 Hs 301 570 


,SS,kazal, 


ESTs 


8.1 


415786 AW419196 Hs.257924 


SS 


hypothetical protein FU13782 


8.1 


410102 


AW248508 Hs 279727 


SS 


Homo sapiens cDNA FLJ14035 fis, clone HE 


8.0 


404347 






SS 


Target Exon 


8.0 


433687 


AA743991 




TM 


gb:ny57g01.s1 NCI CGAP Pr1 8 Homo sapiens 


8.0 


421373 


AA808229 


Hs 167771 


,SSJMPDH_C,1MPDHJJ,CBS 


ESTs 


8.0 


422634 


NM_016010Hs.1 18821 


SS 


CGI-62 protein 


7.9 


453310 


X70697 


Hs 553 


TM.SNF.SSJHSNF, 


solute earner family 6 (netirotransmitte 


7.9 


435957 


N39015 


Hs 190368 


,SS,TM 


ESTs 


7.8 


407771 


AL138272 


Hs 62713 


,TM ( cpn60_TCP1 t Sema, 


ESTs 


7.8 


443646 


AI085198 


Hs 164226 


,TSPN,vwc,tsp_1 J EGF,thiored, 


ESTs 


7.8 


446142 


AI754693 


Hs 145968 


( TM ( cadherin,Cadherin_C_term, 


ESTs 


7.7 


444649 


AW207523 


Hs 197628 


,SS,rrm, 


ESTs 


7.6 


435147 


AL133731 


Hs4774 


,TM,SDF,UPAR_LY6, 


Homo sapiens mRNA; cDNA DKFZp761C1712 (f 


7.6 


439138 


AI742605 


Hs 193696 


TM 


ESTs 


7.6 


429220 


AW207206 


Hs 136319 


SS 


ESTs 


7.6 


428804 


AK0O0713 


Hs 193736 


( SS,UDPGT 


hypothetical protein FLJ20706 


7.5 


453511 


AL031224 


Hs 33102 


SS,SS 


transcription factor AP-2 beta (activati 


7.5 


439809 


R41396 


Hs 101774 


SS 


hypothetical protein FLJ23045 


7.5 


414869 


AA157291 


Hs 21479 


SS 


ubinuclein 1 


7.5 


416276 


U41060 


Hs.79136 


SSJMJM 


LIV-1 protein, estrogen regulated 


7.5 


452862 


AW378065 


Hs 8687 


.SS.Pep M12B propep,Reprolysfn,tsp_1 , 


ESTs 


7.4 


452926 


AI742170 


Hs 31297 


,SS,TM 


duodenal cytochrome b 


7.4 


453331 


AI240665 


Hs8895 


,SSJM,disintegrin P Pep„M1 2B_propep,Repro ESTs 


7.3 


420802 


U22376 


Hs1334 


SS,NA,mybJ)NA-binding 


v-myb avian myeloblastosis viral oncogen 


7.3 


450603 


R43646 


Hs 12422 


SS 


ESTs 


7.2 


422867 


L32137 


Hs 1584 


SS,EGF ( tsp_.3,SS,E2F_TDP, 


cartilage oligomenc matrix protein (COM 


7.2 


418004 


U37519 


Hs.87539 


SS f aWedh,SS,aldedh, 


aldehyde dehydrogenase 3 family, member 


7.2 
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426451 AI908165 
450701 H39960 
419519 AI198719 
410555 U92649 



Hs. 169946 
Hs. 288467 
Hs.176376 
Hs.64311 



10 



15 



20 



25 



30 



35 



40 
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50 



55 



60 



65 



433138 AB029496 Hs.59729 

411558 AA102670 Hs.70725 

409079 W87707 Hs.82065 

417275 X63578 Hs.295449 

432731 R31178 Hs.287820 

442818 AK001741 Hs.8739 
407366 AF026942 

427427 AF077345 Hs.177936 
410785 AW803341 
401045 



C 1 1 001 883*:gi|6753278lreflNP _033938,1 1 c 
418986 AI123555 Hs.81796 ,$S,Reprolysin,tspJ, 
Hs.7413 



SS.GATA, 
,SS,LRR 
SS 

.TM.disinte g rin ,R eprotys in, 
SS.ig ,Sema ,SS ,Sema,efha nd 
SS.TM.SSJM 
,TM,fn3, 

SS.efrand.SS.efhand.ras 
.SS.frtfMfriZfrti.fnl 
WD40,SS 
JMJBR 
SSJectin_c,SS 
SS 

ICE_p2D l SS.ICEj)10 1 ICE_p20,ICE_p10 l ICE_p 

67 



GATA-binding protein 3 (T-cell receptor 
Homo sapiens cDNA FLJ 12280 fe, clone MA 
ESTs 

a distntegrin and metalloproteinase doma 
semaphorin sem2 

gamma^aminobutyric acid (GABA) A recepto 
interieukin 6 signal transducer (gp130, 
paralbumin 
fibronectin 1 

hypothetical protein FU10879 
gb:Homo sapiens cig33 mRNA, partial sequ 
ESTs 

gb:IL2-UM0079-090300-050-D03 UM0079 Homo 



Hs.325823 
Hs.283713 



442082 R41823 
442861 AA243837 Hs.57787 
418836 AI655499 Hs.161712 
422060 R20893 
444381 BE387335 
404091 NA 

417866 AW067903 Hs.82772 
428819 AL135623 Hs.193914 
410275 U85658 Hs.61796 
425236 AW067800 Hs.155223 
415669 NMJW5025HS.78589 
416319 A1815601 Hs.79197 
412140 AA219691 Hs.73625 
442942 AW1 67087 Hs,131562 
446163 AA026880 Hs.25252 
443162 T49951 Hs.9029 
409602 W26713 Hs.256972 
428479 Y00272 
400300 X03363 
433404 T32982 
410079 U94362 
401781 

447359 NM_012093Hs.18268 
402230 NA 

427674 NMJX)3528Hs.2178 
428398 AI249368 Hs.98558 
458098 BE550224 
419968 X04430 
416636 N32536 
419703 AI793257 
424687 J05070 
449679 AI823951 
421296 NM_002666Hs.103253 
442117 AW664964 Hs.1 28899 
400303 AA242758 Hs.79136 
419440 AB020689 Hs.90419 
444858 AI199738 Hs.208275 
432239 X81334 Hs.2936 
440705 AA904244 Hs. 153205 
400286 NA 

446466 H38026 Hs.308 
423201 NM_00O163Hs.125180 
433043 W57554 Hs.125019 
439509 AF086332 Hs.58314 
425247 NM_005940Hs.155324 
409757 NM_001898HS.123114 
425292 NMJ05624HS.155545 
448045 AJ297436 Hs.20166 
452681 AF153330 Hs.30246 
452243 AL355715 Hs.28555 
439310 AF086120 Hs.102793 



Hs. 184572 

Hs.102720 
Hs.58589 



Hs,74170 

Hs.93913 

Hs.42645 

Hs.128151 

Hs.151738 

Hs.129700 



1 TM,EPHJbd,pkinase,SAM 1 fn3, 
SS 

,TM^ctivin_recp,pk]'nase l death,2U5 ) 

,SS,TM,CD36 

,SS,mito_carr 

,TM,7tm_3,ANF_receptof 1 

SS.Collagen.COLFIJSPN.SSJSPN 

SS.SS 

,SS ) Rbosomal_S4e 
SS 

,SS r serpin, 

SS.TM.ig.SSJM 

^SS.kinesin, 

,SS,ig,Serna,pkinase, 

,SS,TM ) fn3 1 

filament.SS.filament, filament 
,SS,TM, DAGKa.DAGKc, 
( SS,pkinase,pkinase 



ESTs 

ESTs; ca!syntenin-2 

ESTs 

ESTs 

ESTs, Moderately similar to ALU5_HUMAN A 
ESTs, Weakly similar to S64054 hypotheti 
Target Exon 

collagen, type XI, alpha 1 
KIAA0575 gene product 
transcription factor AP-2 gamma (activat 
stanniocalcin 2 

serine (or cysteine) proteinase inhibito 
CD83 antigen (activated B lymphocytes, I 
RAB6 interacting, kinesln-like (rabkines 
ESTs 

prolactin receptor 
DKFZP434G032 protein 
ESTs 

cell division cycle 2, G1 to S and G2 to 



l SS,TM,pkinase ) Recep_LJomain,SH2,PH,FurHER2 receptor tyrosine kinase (c-ert>b2 



SS 

Glyco_transf_8,SS 

,SS,filament,Pribosyftran,filamentAmad 

SS.adenytatekinase, 

,SS,TM,p450, 

histone,SS»histone, 

,SSJM 

SS 

SS,lL6,lL6, 
,SS,TM 
,SS,zf-C2H2 



ESTs 
glycogen"*) 2 
Target Exon 
adenylate kinase 5 
Target Exon 

H2B histone family, member Q 
ESTs 

metallothbnein 1 E (functional) 
interieukin 6 (interferon, beta 2) 
solute carrier family 16 (monocarboxylic 
ESTs 



HI, • — 

SS l Peptfdase_M10,fn2,hemopexin l SS JM.Pepmatrix metalloproteinase 9 (oelatrase B 
SS tolloid-like 1 

SS penlipin 
,SS,™ ESTs 
SSJM LIV-1 protein, estrogen regulated 

SS ' K1AA0882 protein 

SS ESTs, Weakly similar to ALU/^HUMAN I! 

SS,Peplidase_M10,hemopexin l SS,PepUdase - matrix metalloproteinase 13 (collagenase 



TM 

SS,TM,ABC_tran^BC_membrane,SS 

arrestin.SS 

SS,TM,fn3,SS 

3S 

.SSJM.Syntaxin 

SS r Peptidase_M10,hemopexin,SS 

,SS,cystatin, 

SS 

,SS,TM 
,SSJM 
SS 

t SS,TM t UDPGT,casein_kappa 



ESTs 

C16000922:gii7499103lpirifT20903 hypothe 

anestin 3, retinal (X-arrestin) 

growth hormone receptor 

lymphoid nuclear protein (LAF-4) mRNA 

ESTs 

matrix metalloproteinase 11 (MMP11; stro 
cystatin SN 

37 kDa leucine-rich repeat (LRR) protein 

prostate stem ceJI antigen 

solute carrier family 19 (thiamine trans 

programmed cell death 9 

ESTs 



7.1 
7.1 
7.1 
7.1 
7.0 
7.0 
7.0 
7.0 
6.9 
6.9 
6.8 
6.8 
67 



6.7 

67 

6.6 

6.6 

6.6 

6.6 

6.6 

6.6 

6.5 

64 

6.2 

6.2 

6.2 

6.2 

6.2 

6.1 

61 

6.1 

6.1 

6.1 

6.1 

6.1 

6.1 

6.1 

6.1 

6.1 

6.0 

6.0 

6.0 

6.0 

5.8 

5.8 
5.8 
5.8 
5.7 
5.7 
5.7 
5.7 
5.6 
5.6 
5.6 
5.5 
5.5 
55 
5.4 
5.4 
5.4 
5.4 
5.4 
5.3 
5.3 
5.2 
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441111 


A1806867 


Hs 126594 


( SS,TM,Phosphodiest, 


ESTs 


5.2 


452355 


N54926 


Hs.29202 


TM,7tm 1.TM 


G protein-coupled receptor 34 


5.2 


427711 


M31659 


Hs 180408 


SS 


solute earner family 25 (mitochondrial 


5.2 


418636 


AW749855 




,SS,TM,HECT 


gb:QV4-BT0534-281299-053-c05 BT0534 Homo 


5.2 


429353 


AL117406 


Hs 200102 


,SS,TM,ABC tran.ABC membrane, 


ATP-binding cassette transporter MRP8 


5.1 


441690 


R81733 


Hs 33106 


( SS,HECT,zf-UBR1 r PABP l 14-3-3 ( 


ESTs 


5.1 


430447 


W 17064 


Hs 332848 


SS 


SW1/SNF related, matrix associated, acti 


5.1 


429698 


A1685086 


Hs.26339 


,SS,ras, 


ESTs, Weakty similar to S2 1348 probable 


5.1 


425325 


X52730 


Hs 1892 


SS ( NNMT_PNMT_TEMr,SS t NNMT_PNMT_TEMT.STAR 





423600 
414737 
403593 
407758 
445234 
411165 
420633 
414117 
416783 
401093 
411096 
457411 
436007 
450506 
417975 
421072 
427032 
447752 
403199 
427122 
445900 
413048 
419563 
442432 
452093 
442323 
450606 
435542 
417576 
446089 
445413 
424420 
432378 
452190 
434674 
419986 
421582 
410361 
426327 
406639 
452834 
427315 
446733 
442118 
421524 
453060 
453403 
444301 
453619 
432656 
426384 
431701 
416931 
420854 
418867 
443514 
447499 



phenyiethanolamine N- me thyrtransf erase 



Hs 310359 
Hs 125087 



AI633559 
Al 160386 
NA 

D50915 Hs 38365 
AW137636 Hs 146059 
NM_000169Hs 69089 
NWL014581HS 274480 
W88559 Hs 1787 
AA206186 Hs79889 

U80034 Hs.68583 
AW085961 Hs 130093 
AI247716 Hs 232168 
NM.004460Hs.418 
AA641836 Hs 30085 
AI215069 Hs 89113 
AF012023 Hs 173274 
M73700 Hs 105938 
NA 

AW057736 Hs 323910 
AF070526 Hs 13429 



M93221 
AA526235 
BE093589 
AA447453 
AW016669 Hs 29190 
AI668605 Hs 60380 



Hs 75182 
Hs 193162 
Hs 38178 
Hs 27860 



AA687376 

AA339449 

AI860021 

AA151342 

BE614743 

AI493046 

H26735 

AA831879 

AI345455 

Ar910275 

BE391804 

W03242 

M97711 

AI638627 



Hs 269533 
Hs 82285 
Hs 270651 
Hs 12677 
Hs.146688 
Hs 146133 
Hs 91668 
Hs 136985 
Hs.78915 
Hs1406 
Hs 62661 
Hs 44898 

Hs 105685 



AA179949 Hs.175563 
AA863360 Hs.26040 
AA976718 Hs.202242 
AA312082 Hs.105445 
AW294092 Hs.21594 
BE466639 Hs.61779 
AK000136 Hs.10760 
H87648 Hs.33922 
NM 000246HS.3076 
AI472078 Hs 303662 
AW935490 Hs. 14658 
D45371 Hs.80485 
AW296927 
D31771 Hs.89404 
BE464288 Hs.141937 
AW262580 Hs.147674 



SS 
SS 

,CIDE-N,pkinase 

SS,SS 

,SS,TM 



5.1 

ESTs 

ESTs 

Target Exon 
KIAA0125 gene product 
ESTs 



SS,^Sbtase,BTK^H,pkinase,SH2,SH3,Ribo galactosidase, alpha 
SS,lipocalin,SS,lipocalin odorarrt-binding protein 2A 

JM.ionJrans.KJetra, proteolipid protein 1 (Pelizaeus-Merzbac 

SS,TM,TM monocyte to macrophage differentiation -a 

TM,LRRCT,TM,LRRCT, C12000586*:gi[6330167|dbj|BAA86477.1} (A 

Peptidase_M3, mitochondrial intermediate peptidase 

SS ESTs 
,SS,adh_zinc, ESTs 

SS,DPPlV_NJerm,Peptidase_S9,SS,DPPIV„N_fibroblast activation protein, alpha 



.SS.trypsin 
SS 

,SS, 14-3-3 

SS,transferrin,7tm_1 .transferrin, 
SS,TM r Folate_camer,SSJM,Folate_came 



hypothetical protein FLJ23186 
ESTs 

integrin cytoplasmic do main -associated p 
lactotransferrin 

NMJ)25243*:Homo sapiens solute carrier f 



,SS,TM,pkinase 1 Recep_L_domain,SH2,PH l FurHER2 receptor tyrosine kinase (c-err>b2, 



.SS.Ca.channeLB, 
SS,TM,lectin_c,fn2,Ricin_BJectin,SS,TM 
SS 
SS 

,SS,TM,7tm_1, 
,SS,TM,CBS,voltage_CLC 
,TM,Glyco_hydroJ 
.SS.pkinase.RhoGEFjg.PH.SM, 



Homo sapiens clone 24787 mRNA sequence 

mannose receptor, C type 1 

Homo sapiens cDNA FU 11983 fis, clone HE 

hypothetical protein FU23468 

Homo sapiens mRNA; cDNA DKFZp586M0723 (f 

ESTs 

ESTs, Moderately similar to ALU6_HUMAN A 
ESTs 



AlRS,formyLtransf ) GARS,SS,GARS,AIRS ( fc*phospto 



.pkinase 
SS t UPF0099,SS,UPF0099 1 
,SS,TM,MAPEG, 
,SS,TM,UDPGT 

,SS,TM,PH,SH2,Furin-like, pkinase ,Recep_L 

.SS.Htetdeacetyl, 

pkinase.OPR, 

SS.trefoil.SS.TM.ldLrecepLa.SRCR.tryps 

SS,TM,GBP,TM,GBP 

SS 

SS.SS/ig, 

,SS,DEAD,ForkJiead 
SS 

,SS,TM,p450, 

jg.Sema, 

SS 

,SS,ras,Y__phosphatase,ras 

,SS,HMG„box,fiIament 

SS.LRR.SS 

SS 

SS,LRR, 

.SS.ArfGap, 

.SS.BIR 

SS,C1q,Collagen,SS,C1q, 
.SSJM.Peptidase.MI, 
SS.homeobox.homeobox, 
.SS.TM.MIP, 

.SSJM.cadherin.cadherin 
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ESTs, Moderately similar to A47582 B-cel 
CGI-147 protein 
prostaglandin E synthase 
ESTs 

Homo sapiens clone PP1498 unknown mRNA 
ESTs 

GA-binding protein transcription factor, 

trefoil factor 1 (pS2) 

guanylate binding protein 1 , interferon- 

Homo sapiens clone TCCCTA00151 mRNAsequ 

gb: Human T-ceil receptor (V beta 18. 1 , J 

KIAA1688 protein 

Homo sapiens mRNA; cDNA DKFZp564N0763 (f 
ESTs, Weakly similar to fatty acid omega 
ESTs 

GDNF family receptor alpha 1 
hypothetical protein MGC 15754 
Homo sapiens cDNA FLJ13591 fis, clone PL 
asporin (LRR class 1) 

Homo sapiens, clone MGC:9084, mRNA, comp 

MHC class i) transacth/ator 

ESTs 

Human chromosome 5q13.1 clone 5G8 mRNA 
adipose most abundant gene transcript 1 
gb:U!-H-BW0-ajc-c-07^Ul.s1 NCl_CGAP_Su 
msh (Drosophfla) homeo box homolog 2 
ESTs 

protocadherin beta 16 



5.1 
5.1 
51 
50 
50 
49 
49 
49 
49 
49 
49 
49 
49 
49 
49 
48 
48 
48 
48 
48 
47 
47 
47 
46 
46 
46 
46 
46 
4.6 
46 
46 
45 
45 
45 
45 
45 
4.5 
45 
45 
45 
45 
44 
44 
4.4 
4.4 
4.4 
44 
4.4 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
43 
4.3 



WO 02/059377 
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441560 
409064 
422667 
454032 
432663 
401747 
432882 
437036 
447754 
443194 
451871 
457211 
421566 
431657 
427899 
444779 
442295 
436396 
446039 
422938 
406922 
439285 
424800 
429922 
447178 
409038 
452747 
420139 
408877 
403329 
439926 
430832 
432461 
452410 
418661 
431958 
425071 
447197 
428722 
428330 
423242 
449048 
414831 
413589 
408875 
418629 
450787 
414B70 
450325 
407633 
426172 
442262 
427961 
445563 
403943 
408761 
423279 
420440 
445107 
428303 
411667 
427809 
418203 
430376 
444190 
433495 
429638 



F13386 Hs.7888 
AA062954 Hs.141883 



H25642 
W31790 
AI984317 



Hs.1 33471 
Hs.1 94293 
Hs.122589 



NM_013257Hs279696 
AI571514 Hs.133022 
AW073310 Hs.163533 
AI954968 Hs.279009 
AI821O05 Hs.1 18599 
AW972565 Hs.32399 
NM_0O0399Hs.1395 
AI345227 Hs.105448 
AA829286 Hs.332053 
AI192105 Hs.147170 
A1827248 Hs.224398 
AI683487 Hs.152213 
AI150491 Hs.90756 
NM_0018O9Hs.1594 
S70284 

AL133916 Hs.172572 
AL035588 Hs.153203 
Z97630 Hs.226117 
AW594641 Hs.1 92417 
T97490 Hs.50002 
BE1 53855 Hs.61460 
NM_005357Hs.95351 
AA479033 Hs.130315 
NA 

AW014875 Hs.137007 
AI073913 Hs.100686 
AW451645 Hs.151504 
AL1 33619 Hs.29383 
NM001949HS.1189 
X63629 Hs.2677 
NM_013989Hs.154424 
R36075 



.pkinase, 

,SS,CUB, 

,SS,TM,FMO-like 

f SS,TM 

TM 

,SS,filament, filament 
pkinase,pkirtase_C, 
,SS,TM 
.pkinase, 
,SS,TM 
.SS.GDNF, 
WH1.WH1 
zf-C2H2,SS 
,SS,TM,pkinase 



U76456 
L22524 
AL039402 
Z45051 
M31158 



Hs.190787 
Hs.2256 
Hs.1 25783 
Hs.22920 
Hs.77439 



AW452631 Hs.313803 
NM_015434Hs.48604 
BE247550 Hs.86859 
AB006190 Hs.25475 
N72264 Hs.300670 
AI935962 Hs.26289 
NM_007069Hs.37189 
AA371307 Hs.125056 
BE170651 Hs.8700 
AW293165 Hs.143134 
AW873606 Hs.149006 

AA057264 Hs338936 
AW959861 Hs.290943 
NM_002407Hs.97644 
AI208121 Hs.147313 
AW974476 Hs.1 83601 
BE160198 

M26380 Hs.180878 
X54942 Hs.83758 
AW292053 Hs.12532 
AI878918 Hs.1 0526 
AW373784 Hs.71 
AI916662 Hs.211577 



( Ss!sAA_proteins f ABC„membrane,ABC„tran 1 serum amyloid A1 



Homo sapiens clone 23736 mRNA sequence 

ESTs 

ESTs 

ESTs, Weakly simitar to 1 54374 gene NF2 
ESTs 

Homo sapiens keratin 17 (KRT17) 
serum/glucocorticoid regulated kinase -li 
ESTs 

Homo sapiens cDNA FLJ14142 fis, clone MA 

matrix Gla protein 

ESTs 

ESTs, Weakly similar to S51797 vasodilat 
early growth response 2 (Krox-20 (Drosop 
ESTs, Weakly similar to B34087 hypotheti 



SS 

.COLFI.vwcCoHagen, 
,wnt, 

JM.GIycoJiydroJ 
,SS,TM,thiolase, 
SS.TM.Desaturase.SS 
,SS,ig 1 pkinase,LRRM',lRRCT 
HLH.SS 

.SS.TMJinkerJiistoneJtmJ 
,SS,TM 
SS,IL8,SS,IL8 
.SS.HLH 
.SS.TM.p^SO, 
.SS/IM 
SS.SS 
SS 
SS 

,SS,Collagen,COLFI l TSPN, 
.SS.TM.ras 
SS 



ESTs 



Homo sapiens cDNA FU11469 fis, clone HE 
wirgtess-type MMTV integration site fami 
ESTs 

centromere protein A (17kD) 
gb:stearoyl-CoA desaturase [human, adipo 
hypothetical protein FLJ20093 
MyoD family inhibitor 
H1 histone family, member 0 
ESTs 

small inducible cytokine subfamily A (Cy 
Ig supexfamiry receptor LNIR 
lipase, hofrnone-sensttive 
ESTs, Weakly similar to A47582 B-cell gr 
Target Exon 
ESTs 

ESTs, Weakly similar to JE0350 Anterior 
Homo sapiens cDNA FU11973 fis, clone HE 
Homo sapiens mRNA; cDNA DKFZp434E2321 (f 
00 E2F transcription factor 3 

SS,TM,Cadherin_CJerm ) cadheriri 1 SS,TM,cadcadherin 3, type 1, P-cadherin (placenta 
SS T4_dek>dinase,T4 Jeiodinase, deiodinase, iodothyronine, type II 

TM.SDF, gb:yh88b01.s1 Soares placenta Nb2HP Homo 

isS.TIMP, tissue inhibitor of metatloproteinase 4 

SS.PeptklWMIO.SSPeptida^MIO.r^pematrix metalloproteinase 7 (MMP7; uterin 
Ss' DEME-6 protein 

SS SS,TM similar to S68401 (cattle) glucose indue 

SS,cNMP_binding,Rlla I HMG_box protein kinase, cAMP^iependent, regulato 

SS abhydrolase ESTs, Highly similar to AF1 57833 1 noncl 

Ss" DKFZP434B1 68 protein 

SS p SH2 t PH 1 SS t TM > PH ( SH2,Furirvlike ) pkinas growth factor receptor-bound protein 7 



SS,TM,MIP,SS,TM,MIP, 
SS 
SS 

TM.TM 
.SS.DENN 
.SS.START, 
SS 

,SS,WH1,WH1 
p450,SS,p450 
.SS.TrVUtmJ, 
SS 

.SS.SRCR.Uterogbbin 
,SS,TM 

SS,RGS,RGS,RGS 
TM 

,SS l lipase,PUT,Sec7,PH, 

CKS.SS.CKS, 

SS 

SS 

SS.lg,MHCJ,oonnexin,SCAN f SS,TM 
SS.TM.SS 



aquaporln 7 
WAA1204 protein 
ESTs 

similar to rat HREV107 
ESTs 

deleted in liver cancer 1 

ESTs 

ESTs 

C500O355:gi|45O3225lreflNP_000765.1l cyt 
ESTs, Weakfy similar to (deffine not ava 
ESTs 

mammaglobin 2 (MGB2; mammagtobin B; lip 
ESTs, Weakly similar to I38022 hypotheti 
regulator of G-protein signaling 16 
g b:QV1-HT0413-010200-059-h03 HT041 3 Homo 
lipoprotein lipase 
CDC28 protein kinase 2 
chromosome 1 open reading frame 21 
cysteine and glycine- rich protein 2 
alpha-2-grycoprotein 1,anc 
kinectin 1 (kinesln receptor) 



4.3 
4.3 
4.3 

4.3 
4.3 

4.3 

4.2 

4.2 

4.2 

4.2 

4.2 

4.2 

4.2 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

40 

39 

3.9 

39 

3.9 
3.9 
3.8 
3.8 
3.8 
3.8 
3.B 
3.8 
38 
38 
3.8 
3,8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.7 
3.7 
3.7 
3.7 
37 
37 
37 
3.7 
37 
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454071 
451859 
420281 
427691 
428824 
424676 
418026 
457465 
417601 
407999 
425548 
446619 
411213 
406625 
417511 
428769 
407137 
401866 
451195 
426044 
426310 
440029 
408573 
431830 
444781 
431493 
414175 
411789 
418851 
453968 
407104 
449051 
434398 
454042 
459496 
414998 
407756 
442101 
449722 
452554 
421991 
420058 
425776 
407846 
406925 
445873 
418054 
414921 
452268 
428862 
412520 
410530 
401780 
447131 
418334 
415138 
421168 
431473 
421379 
411984 
408101 
405366 
414612 
411393 
435767 
416406 
433068 



AI041793 Hs.42502 
H44491 Hs.252938 
AI623693 Hs.191533 
AW1 94426 Hs.20726 
W23624 Hs.173059 
Y08565 Hs.151678 
BE379727 Hs.83213 
AW301344 Hs.122908 
NM_014735Hs 82292 
AI126271 Hs.49433 
AA890023 Hs1906 
AU076643 Hs.313 
AA676939 Hs 69285 
Y13647 Hs 119597 
AL049176 Hs.82223 
AW207175 Hs.106771 
T97307 

U10492 Hs.438 
AA502490 Hs.336695 
NMJ00909Hs 169266 
AW089705 Hs 293711 
AA284775 Hs 43148 
Y16645 Hs 271387 
NM_0144O0Hs 11950 
AI791493 Hs 129873 



AI308876 
AF245505 
AI417828 
AA847843 
S57296 



Hs.103849 
Hs 72157 
Hs.1 92435 
Hs 62711 
Hs.323910 



AW961400 Hs 333526 
AA121098 Hs.3838 
H22570 Hs 172572 
AA808940 Hs 274450 
NM_002543Hs.77729 
AA1 16021 Hs 38260 
Af651930 Hs.135684 
BE280074 Hs.23960 
AW452434 Hs 58006 
NM„014918Hs 110488 
AK001423 HS 94694 
U25128 Hs 159499 
AA426202 Hs.4O403 
L34041 Hs9739 
AA250970 Hs.251946 
NM_00231BHs .83354 
BE390551 Hs.77628 
NM_003512Hs 28777 
NM_000346Hs2316 
AA442324 Hs.795 
M25809 Hs 64173 

NM_004585Hs 17466 
AA319233 Hs.5521 
C18356 Hs 295944 
AF1 82277 Hs 330780 
AA825686 Hs 321176 
Y15221 Hs 103982 
NM_005419Hs 72988 
AW968504 Hs.123073 

BE274552 Hs 76578 
AW797437 Hs69771 
H73505 Hs 117874 
D86961 Hs.79299 
NMJ06456HS 288215 



JM,7tm_1. 

,SS,TM,EGF,ldLrecepLa,kiLrecepLb,EGF 
,SSMA, 

,SS,GlycosJransf_2, 
SS 

Glycos_transf_2,Ridn_BJeclin,SS,Glyoos 

lipocalai ,SS .fipocafin .lipocalin.ferriti 

,SS,Pribosyltran,Sulfatase 

PHD ( pkinase,SS 

SS 

SS,TM t fn3,SS,TM,fn3, 
,SS,TM,efhand,ionJrans 



SS,TM ) CUB l F5_F8„type„C ) MAM,SS,TM ( CUB,F5_neuropllin1 



ESTs 

ESTs, Weakly similar to ALULHUMAN ALU S 

ESTs 

ESTs 

ESTs 

UDP-N-acetyl-alpha-D-galactosamine:polyp 
fatty acid binding protein 4, adipocyte 
DNA replication factor 
KIAA0215 gene product 

ESTs, Weakly similar to YZ28_HUMAN HYPOT 
prolactin receptor 

secreted phosphoprotein 1 (osteopontin, 



37 
37 
37 
37 
37 



SS,TM,Desaturase,SS 
SS 

,SS,7tm_1,SPRY, 

,SS,TM,GDA1_CD39 

.SS.filament. 

SS.homeobox ,Ets,SS .homeobox , 
SS 

SS,TM,7tmJ ( 
SS 

,SS,TM,PMP22_Claudin, 
,SS,TM,IL8 
,SS,PH, lactamase^ 
,SS,p450,p450 



stearoyl-CoA desaturase (delta-9-desatur 
chordtn-like 
ESTs 

gb:ye53h05.s1 Soares fetal liver spleen 
Target Exon 

mesenchyme homeo box 1 
ESTs 

neuropeptide Y receptor Y1 

ESTs, Weakly similar to S64329 probable 

ESTs 

small inducible cytokine subfamily A (Cy 
GPI -anchored metastasis-associated prote 
ESTs, Weakly similar to A36036 cytochrom 



,TM,hemopexin,Peptidase_M10,hemopexin,Pehypothetical protein DKFZp761D1 12 
ig.LRRCT, DKFZP564I1922 protein 

.SS.TM ESTs 

,SS,HMG„box, Homo sapiens, done 1MAGE:3351295, mRNA 

,SS,TM,SH2,PH,pkinase,Recep_L_domain,Furv-erb-b2 avian erythroblastic leukemia v 



HER2 receptor tyrosine kinase (c-erb-b2, 
serum-inducible kinase 
hypothetic^ protein FU20O93 
EST 

oxidised low density lipoprotein (lectin 
ubiquitin specific protease 18 
ESTs 
cyclin B1 

ESTs, Weakly similar to ALU5JHUMAN ALU S 
KIAA0990 protein 

Homo sapiens cDNA FU10561 fis, ctone NT 
parathyroid hormone receptor 2 



SS 

pkinase.POLO^box.SS.pkinase.POLO.box, 
.SS.rg.pkinase.LRRNTT.LRRCT, 
.SS,TM,KRAB 1 SCAN 1 zf-C2H2jg 
,SS,TM 

SS l UCH-1,UCH-2,SS,TM,G_glu_transpept 
SS 

cydin.SS.TM, cyclin, 
SS 
SS 
SS 

SS,TM,7tmJ,SS,TM,7tm_2 

JM,ABC_membrarie > ABC_tran,Ribc«omaLS4eCbp/p300-interacting iransactivator, wit 
,SS l TM,transporLprot,SWIB,RhoGAP,DAG ^Eglycerol^-phosphate dehydrogenase 1 (so 
^S.rrm.PABP.pkinase.M-S^.rrm poiy(A)-blnding protein, cytoplasmic 1-1 

,SS,TM,mito_carr,LysyLoxidase tysyl oxidase-like 2 

SS,START,SS,START,NNMT_PNMr„TErvrr, steroidogenic acute regulatory protein r 
SS,histone,Calc„CGRPJAPP,ig,MHCJ,SPRY,H2A histone family, member L 



SS,HMG_box, 
histone,SS,histone,BolA 
ATP-synLab,SS,7tm_1,ATP-synt_ab 
filamenLS S,filament,fi lament 
TM 

,SS,TM,RibosomaLL27e, 

,Kun(tz_BPTI, 

SS,p450,SS 

SS 

SS.IL8, 

SH2,STAT,SS,STAT 
.pkinase, 

RhoGEF,PH t SS,RhoGEF,PH, 

SAP,SS,FG-GAP,vwa 

SS^ushi.trypsin.vwa^.fibrinogen^C.fn 

,SS,Peptidase_S8 1 P,PeptIdase_S8,P 

,SS,TM 

.SS.Pribosyltran, 



SRY (sex determining region Y)-box 9 (ca 
H2A histone family, member O 
ATPase, H transporting, lysosomal (vacuo 
NM_005557*:Homo sapiens keratin 16 (foca 
retinoic acid receptor res ponder (tazaro 
ESTs 

tissue factor pathway inhibitor 2 
cytxhrome P450, subfamfly IIB (phenobar 
ESTs, Weakly similar to S65824 reverse t 
small inducible cytokine subfamily B (Cy 
signal transducer and activator of trans 
CDC2-related protein kinase 7 
NrvL00337r:Homo sapiens vav 2 oncogene ( 
protein inhibitor of activated STAT3 
B-factor, properdin 
ESTs 

lipoma HMGIC fusion partner-like 2 
sialyllransferase 



37 
37 
37 
37 
37 
36 
36 
3.6 
36 
36 
36 
36 
36 
36 
36 
36 
36 
36 
3.6 
36 
36 
35 
35 
35 
35 
35 
3 5 
35 
3.5 
3.5 
35 
3.5 
35 
34 
3.4 
3.4 
34 
3.4 
3.4 
3.4 
3.4 
34 
3.4 
3.4 
3.4 
34 
34 
3.4 
3.4 
34 
34 
34 
3.4 
34 
34 
34 
33 
3.3 
3.3 
3.3 
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Hs.339665 
Hs.172914 



Hs.172636 
Hs, 164680 
Hs.280380 
Hs.134981 
Hs.159412 



Hs.57664 . 
Hs, 133483 



445462 AA378776 Hs.288649 

439452 AA918317 Hs.57987 

452017 AF109302 Hs.27495 

409099 AK000725 Hs.50579 
5 452106 AI141031 Hs.21342 

447519 U46258 

426928 AFO37062 

438825 BE327427 Hs.79953 

414575 H11257 Hs.22968 
10 417837 AL079905 Hs.1103 

422128 AW881145 

445941 AI267371 

429973 AI423317 

444542 AI161293 
15 459561 AI547306 

425741 AF052152 

426501 AW043782 Hs.293616 

456508 AA502764 Hs. 123469 

434228 Z42047 Hs.283978 
20 415752 BE314524 Hs.78776 

400419 AF084545 

439750 AL359053 

423858 AL137326 

428514 AW236861 Hs.193139 
25 428698 AA852773 Hs. 334 838 
448988 Y09763 Hs.22785 
432072 N62937 Hs.269109 
417433 BE270266 Hs.82128 
452194 AI694413 Hs.332649 
30 444051 N48373 Hs 10247 
420042 AW015140 Hs 161723 
457292 AI921270 Hs.334882 
421458 NM_003654Hs.104576 
431104 AW970859 Hs.313503 
35 443767 BE562136 Hs.9736 
419589 AW973708 Hs.201925 
415447 297171 Hs.78454 
443464 BE548446 Hs.5167 
423431 AA326062 
40 413278 BE563085 Hs.833 

458451 AW297181 Hs.195922 
440449 AA885430 Hs.201925 
413753 U17760 Hs.75517 
434876 AF160477 Hs.61460 
45 435575 AF213457 Hs.44234 
415773 R21651 Hs.324725 
446440 AV658411 Hs.42656 
450847 NM 003155HS.25590 
426075 AW513691 Hs.270149 
50 452110 T47667 Hs.28005 
439963 AW247529 Hs.6793 
402837 NA 

439451 AF086270 Hs.278554 
406664 L34041 Hs,9739 

55 417315 AI080042 Hs.336901 
413011 AW068115 Hs.821 
414987 AA524394 Hs„294022 
429197 H24471 Hs.26930 
448030 N30714 Hs.325960 

60 407604 AW191962 Hs.249239 
419092 J05581 Hs.89603 
456672 AK002016 Hs.114727 
443171 BE281128 Hs.9030 
452256 AK000933 

65 432201 AI538613 
406642 AJ245210 
400903 NA 



ss.ss 

ss.ss 

ss 

ss 

ss 

ss 

^S.adh.shortjGF-betaJGFbjropeptkte 
,SS,TM p histone,ANF_fieceptDr,guanyiate_cy 
,SS,pkinase,ig ( 

SS.TGFb jxopeptide.TGF-beta.SS 
SS 

SS.SSJipoxygenase.PLAT 
,SS,T-box,UDPGT 

S S.SS, Peptidase_M 1 ,EGF ,ig Jectin_c sushi 
SS 

,pkinase, 

SS 

SS 

.SS.TIvUtmJ 
TM 

,SS,Peptidase_M1, 

,T M , In teg riruB,Rici nJJJedin ,rrm 

,SS,TM 

,SS,START,NNMT_PNMT_TEMT, 
SS 

SSJM.TM 
.Sema.ig, 

SS,TM,LRRCT,LRRNT,LRR 1 TM,LRRCT, 

,SSJM,7tmJ,ANF_receptor,sushi 

( SS,ig, 

,SS,CUB, 

SS.TM.SS.TM.G-patch 
SS 

.Sema.fg, 

,SSPCI,RasGEF .hormone jec,zf-C4, 
,FGF, 

SS 1 OLF,OLF 1 OLF,Ribo9omaLL4 

SS.TM.SSF.SS.TM 

,SS,p450,p450 

.SS.TM.ubiquitinJamintn.GJaminin.EGF.k 

,SS,Ribosomal_Ll4 

,FGF, 

SS,Iaminin_EGFJamlnin_Nterm^dh_short,S 

,SS,HLH 

SS,ig,SS,TM 

1 SSJM l RtoosomaLS3Ae,G^mrna 
SS 

.SS.homeobox, 
,SS,fn3, 

,SS,TM^cfivin_recp t pkinase 

JM,p450,Ets 

SS 

.SS.Chromo^shadow.ch romo, 



hypothetical protein MGC3077 
B-cett CLLftymphoma 1 1 B (zinc finger pro 
prostate cancer associated protein 7 
hypothetical protein FU20718 
ESTs 
ESTs 

retino! dehydrogenase 5 (11-cts and 9-ci 
ESTs 

Homo sapiens clone IMAGE:451939, mRNA se 
transforming growth factor, beta 1 
gb:QV0-OT0033-010400-182-a07 OT0033 Homo 
ESTs 
ESTs 

amino peptidase 
ESTs 

Homo sapiens clone 24628 mRNA sequence 
ESTs 

ESTs, Weakhy similar to AF208855 1 BM-01 
Homo sapiens PR02751 mRNA, complete cds 
putative transmembrane protein 
Target 

Homo sapiens mRNA full length insert cDN 
Homo sapiens mRNA; cDNA DKFZp434B0650 (f 
ESTs 

KIAA1866 protein 

gamma-aminobutyric acid (GABA) A recepto 
ESTs 

5T4 oncofetal trophoblast glycoprotein 
olfactory receptor, family 2, subfamily 
activated leucocyte cell adhesion molecu 
ESTs 

hypothetical protein FU14251 
carbohydrate (keratan sulfate Gal-6) sul 
ESTs 

proteasome (prosome, macropaln) 26S subu 
Homo sapiens cDNA FU 13446 fts, clone PL 
myodlin, trabecular meshwork inducible 
Homo sapiens mRNA; cDNA DKFZp434F152(fr 
gb:EST29171 Cerebellum It Homo sapiens c 
interferon-stimulated protein, 15 kDa 
ESTs 

Homo sapiens cDNA FU13446 fis, clone PL 
laminin, beta 3 (nlceln (125kD), kalinin 
|g superfamily receptor LNIR 
triggering receptor expressed on myeloid 
ESTs, Moderately similar to A47582 B-cel 
KIAA1681 protein 
stanniocalcin 1 

ESTs, Weakly similar to 2109260A B cell 
Homo sapiens cDNA FLJ11309 fis, clone PL 
platelet-activating factor acetylhydrola 
ENSP00000241312*;DJ947L8.1.8 (novel Sush 
heterochromatin-like protein 1 



!SS JM t transport^t,SWIB t RhoGAP,DAG_PEgrycerol-3-phosphate dehydrogenase 1 {so 
,Ss!RNAj»LA,RNAj)olJ^ p RibosomaLS24e 1 ribosomal protein S24 



Hs.28661 
Hs.298241 



SS,LRR,LRRNT,SS,LRRNT,LRR, 
,S S,connexin,hormone_rec,zf-C4 1 connexjn 
.SS.Gelsolin, 
,SS,TM 
,SS,TM,C1q, 
SS,TM t SEA, 
.SS.PK.PK 
SS,TR7tm_1,iTrn,SS 
JM.GDIJtmJ, 

SS,TM,trypsin,SS,TM,trefoil 1 trypsin,tref 

SS 

SS 
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biglycan 
hypothetical protein FLJ 14950 
ESTs, Weakly similar to T20272 hypotheti 
membrane-spanning 4-domains, subfamily A 
collagen, type Vtll, alpha 2 
mucin 1, transmembrane 
Homo sapiens, clone MGC: 16327, mRNA, com 
TONDU 

Homo sapiens cDNA FU10071 fts, clone HE 
Transmembrane protease, serine 3 
gb:Homo sapiens mRNA for immunoglobulin 
Target Exon 



3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

33 

3,3 

33 

33 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3,3 

3.3 

3.3 

3.2 

3.2 
3.2 

3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
32 
3.2 
32 
32 
32 
32 
32 
32 
32 
3.2 
32 
32 
32 
3.2 
32 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
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434408 AI031771 Hs.132586 ,SS,G!ycoJiydro_2 ESTs 3.1 

452994 AW962597 Hs.31 305 SS,WD40,SS,WD40, KIAA 1547 protein 31 

445903 AI347487 Hs.132781 fn3,SS,TM,EF1BD ciass I cytokine receptor 31 

424364 AW383226 Hs.201189 SS ESTs, Weakly similar to G01763 atrophin- 3.1 

5 410196 AI936442 Hs.59838 UBACT_repeat,SS,UBACT_repeatThiFJamty hypothetical protein FU1 0608 31 

419150 T29618 Hs.89640 ,TM,pkinase,fn3, TEK tyrosine kinase, endothelial {venous 3.1 

433417 AA587773 Hs.8859 ,SS,SRCR, Homo sapiens, Similar to RIKEN cDNA 5830 3 1 

418624 AI734080 Hs.104211 .Semajg, ESTs 3.1 

436291 BE568452 Hs 5101 ,SS,abhydrolase, protein regulator of cytokinesis 1 31 

10 411000 N40449 Hs 201619 SS ESTs, Weakly similar to S3 8383 SEB4B pro 31 

412519 AA196241 Hs. 73980 SSJroponin,Hemagglutinin,SSJ^ 31 

450223 AA418204 Hs 241493 ,SS,proJsornerase, natural killer-tumor recognition sequenc 3.1 

422790 AA809875 Hs.25933 JM,histone,Sec1,histone^ugar_tr ESTs 3.1 

424269 AW137691 Hs 199754 ,SS,TM,7tmJ,GPS ESTs 3.1 

15 435854 AJ278120 Hs.4996 .SS.WD40 putative an kyrin -repeat containing prote 3.1 

447388 AW630534 Hs 76277 1 SS,TM,rrm,oxictored_q6,oxkJored_q6 Homo sapiens, clone MGC:9381, mRNA, comp 31 

451631 R00866 SS gb:ye79c02.s1 Soares fetal Rver spleen 3.0 

448105 AW591 433 Hs 298241 ,SS,TM,trefoil,trypsin,trefoil Transmembrane protease, serine 3 3.0 

438637 BE500941 Hs 126730 JM,PH, ESTs, Weakly similar to KIAA1214 protein 3.0 

20 423024 AA593731 Hs 325823 ,SS,TM,CD36 ESTs, Moderately similar to ALU5JHUMAN A 3.0 

456592 R91600 ) SS,Ran_BP1,LlM,Ran_BP1 ( GRIP,TPR,LIM gb:yq10c02.r1 Soa res fetal liver spleen 3 0 

425920 AL049977 Hs 162209 SS,TM,SS,TM,PMP22„Claudin ( PMP22_Claudinclaudin 8 3 0 

444670 H58373 Hs.332938 ,SS,TM hypothetical protein MGC 5370 3 0 

401197 art.arf, ENSP00000229263*:HSPC213. 30 

25 437755 AW204256 Hs 291887 ,wnt r ESTs 3 0 

452560 BE077084 Hs.336432 r SS,rrm,2f-RanBP l pkinase,C2,pkinase_C,DA ESTs 30 

410274 AA381807 Hs 61762 SS,SS hypoxia-inducible protein 2 3.0 

450098 W27249 Hs8109 SS hypothetical protein FLJ2 1080 3 0 

404826 ,SS,TM Target Exon 3 0 

30 458389 H70284 Hs.160152 ,SS,RA ESTs, Weakly similar to FPHU alpha-fetop 3 0 

408196 AL034548 Hs 43627 HMGJ>ox ( pkinase,2f-CCHC,SSJM t HMG J>ox,SRY (sex determining region Y)-box 22 3 0 

433675 AW977653 Hs 75319 ( SS,ribonuc_red_sm, ribonucleotide reductase M2 polypeptide 3 0 

418848 AI820961 Hs. 193465 l death,ZU5,pkinaseActivin_recp ) ESTs 3 0 

422095 AI868872 Hs.282804 SS,CuK>xidase,SS,Cu<>xidase,Cu-oxidase hypothetical protein FLJ 22704 3.0 

35 415992 C05837 Hs.145807 ,SS,TM hypothetical protein FLJ 13593 3 0 

424631 AA688021 Hs 179808 SS ESTs 3 0 

409956 AW103364 Hs.727 SS.TGF-beta.TGFbjjropeptide^.TGF-beta.inhibin, beta A (activin A, activin AB a 3 0 

419667 AUQ77005 Hs 92208 SS ( disintegrin,Reprolysin t Pep_Ml2Bj3rope a disintegrin and metalloproteinase doma 3 0 

450946 AA374569 Hs 127698 SS ESTs, Moderately similar to 21O9260A B c 3 0 

40 447770 AB032417 Hs.19545 Frizzted,Fz,SS,THFrizzled,Fz, frizzled (Drosophila) homolog 4 3 0 

423826 U20325 Hs 1707 SS.SS cocaine- and amphetamine-reguiated trans 3 0 

418838 AW385224 Hs 35198 ,SS,TM,Phosphodiest, ecto nucleotide pyrophosphatase/phosphodi 3 0 

442804 AW300118 Hs 131257 ,SS t TM,G-gamma ESTs 3 0 

432284 AA532807 Hs 105822 ,SS,TM,pkirtase, ESTs 3.0 
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TABLE 20A 



10 



15 



20 



25 



30 



Table 20A shows the accession numbers for those pkeys lacking unigenelD's for Table 20. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number. Gene cluster number 
Accession; Genbank accession numbers 



Pkey 

410735 
411667 
418636 
420854 
422128 
423431 
423945 
426878 
433687 
447197 
451631 
456207 
456592 



CAT number Accessions 

1221055 1 AW803341 AW803265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 

1253334.1 BE160198 AW935898 T11520 AW935930 AW856073 AW861034 

177402 1 AW749855AA225995AW750208AW750206 

197072J AW296927 AI684514 AI263168 AA281079 

211994.1 AW881145AA490718 M85637 AA304575 T06067 AA331991 

228162 1 AA326062AA325758AW962182 

233566 J AA4 1 0943 AW948953 AA334202 AA332882 

273265 1 BE069341 AW748403 AL044891 A1908240 AA393080 

373061 J AA743991 AA604852 AW272737 

71 1623 J R36075 AI366546 R36167 

878098_1 R00866 R01523 AI806815 

165078_-1 M193450 

202684 1 R91600T87079AA291455 
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TABLE 20B 



Table 20B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 20. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref; Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al.* refers to the publication 

entitled The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: fndicates nucleotide positions of predicted exons. 



Pkey Ref 



Strand Nt_position 



400608 9887566 Minus 96756-97558 

400903 2911732 Pius 59112-59228 

401045 8117619 Plus 90044-90184,91111-91345 

401093 8516137 Minus 22335-23166 

401197 9719705 Plus 176341-176452 

401747 9789672 Minus 118596-118816,119119-119244^19609-^ 
1 31 932, 1 32451 - 1 32575,1 33580-1 340 1 1 

401780 7249190 Minus 28397-286 17,28920-29045 ,29 135-29296,29411 -295 67, 29705-29787, 30224-30573 

401781 7249190 Minus 83215-83435,83531-83656,83740-83901,84237-84393,84955-85037,86290-86814 
401866 8018106 Plus 73126-73623 

402230 9966312 Minus 29782-29932 

402408 9796239 Minus 110326-110491 

402578 9884928 Plus 66350-66496 

402606 9909429 Minus 81747-82094 

402837 9369121 Minus 2013-2186,9570-9758,11136-11309,19429-19677,21210-21455,23368-23562,24342-24527,29132-29320 

403199 9958183 Minus 58895-59036,66618-66789 

403329 8516120 Plus 96450-96598 

403593 6862650 Minus 62554-62712,69449-69602 

403943 7711864 Plus 100742-100904,101322-101503 

404091 7684554 Minus 82121-83229 

404347 9838195 P!us 74493-74829 

404826 6572184 Plus 47726^8046 

405366 2182280 Plus 22478-22632 

405654 4895155 Minus 53624-53759 
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TABLE 21: 210 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES THAT ARE 
5 LIKELY TO ENCODE EITHER ENZYMES OR PROTEINS 

AMENABLE TO MODULATION BY SMALL MOLECULES 

Table 21 shows 210 genes up-regulated in breast cancer compared to normal adult tissues 
that are likely to encode either enzymes or proteins amenable to modulation by small 

1 0 molecules. These were selected as for Table 1 9, except that the ratio was greater than or equal 
to 3.0, the "average" normal adult tissue level was set to the 85 th percentile value amongst 
144 non-malignant tissues, and the 96 th percentile value amongst 73 breast cancers was 
greater than or equal 80 units, and the predicted protein contained a structural domain that is 
indicative of enzymatic function or of being modulatable by small molecules (e.g. pkinase, 

1 5 peptidase, phosphatase, ATPase, or ionjransporter domains). The predicted protein domains 
are noted. 



20 



25 



30 



35 



40 



45 



50 



55 



Pkey: Unique Eos probeset identifier number 

ExAccn; Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 ; Ratio of 93rd percentile tumor to 85*> percentile of normal body tissue 



Pkey 


ExAccn 


UnigenelD 


449746 


AI668594 


Hs.1 76588 


400292 


AA250737 


Hs.72472 


424735 


U31875 


Hs.272499 


407178 AA195651 


Hs.104106 


408045 


AW138959 Hs.245123 


450375 


AA009647 


Hs.8850 


429170 NM_001394Hs^359 


445730 


AI624342 


Hs.1 70042 


424634 NM_003613Hs,151407 


420757 


X78592 


Hs.99915 


424399 


A1905687 


Hs.2533 


447350 


AI375572 


Hs.172634 


456207 


AA193450 




456938 


X52509 


Hs.161640 


402578 






425692 


D90041 


Hs.1 55956 


424001 


W67883 


Hs, 137476 


418007 


M13509 


Hs.83169 


421727 


Y13153 


Hs.1 07318 


411869 


W20027 


Hs.23439 


400289 


X07820 


Hs.2258 


443348 AW873596 Hs.1 82278 


424086 


AI351010 


Hs.102267 


400295 


W72838 


Hs.2533 


408771 


AW732573 Hs.47584 


421155 


H87879 


Hs.102267 


424905 


NM_002497Hs.1 53704 


438167 


R28363 


Hs.24286 


459583 


A1907673 




423945 


AA410943 




445263 


H57646 


Hs.42586 



Predicted Protein Domains 

SS,p450 

deam,ZU5JM,Acth/in_recp,pkinase, 
SSJM 

SS,Dihydroorotase, 
PhosphodlestSomatomed in„B. 



UnigeneTltle 

ESTs, WeakJy similar to CP4Y_HUMAN CYTOC 
BMP-R1B 

short-chain alcohol dehydrogenase family 
ESTs 
ESTs 



SS,TM l disintegrin,Pep_M12B_propep l Reprol a disintegrin and metalloproteinase doma 



DSPc,Rhodanese 
SSJM.Cation.effUJx 
ig,tspJ,SS,AAA 

horrnone_rec t Androgen_recep,2f-C4 
SS 

pkinase, 

SS,TM,p450,p450 



dual specificity phosphatase 4 
ESTs 

cartilage intermediate layer protein, nu 
androgen receptor (dihydrotestosterone r 
aldehyde dehydrogenase 9 family, member 
ESTs 

gb:zr40e07.r1 Soares_NhHMPu_S1 Homo sapi 



SS TM,aminotranJ _2,Cadherin„CJerm,cadhtyrosine aminotransferase 
SS,p450,SS,TM jP 450 C1001134:glt2117372lpir|ll65981 fatty ac 

SS,Acetyltransf2, N-acetyltransferase 1 (arylamine N-acety 

pkinase, paternally expressed 1 0 

SS,r^pexin,Peptia^se_M10,SS l Peptidas^„iratrix metalloprotelnase 1 (MrvP1; Inters 

kynurenine 3-monooxygenase (kynurenine 3 
SS p Peptidase_M1 t ESTs 

hemc)pexin,Peptidase„M10,SS ( Peptidase„M10matrixmetalloprote 10 (MMP10; str 
SS.DENN calmodulin 2 (phosphorylase kinase, delt 

SS,Lys>'Lcxidase fysyJ oxidase 

S3 aldehyde dehydrogenase 9 family, member 

TM,K^tetra,'ion_trans I SS,TM I Kjetra,ionJ potassium voitage-gated channel, delayed 
SS.Lysyl ^xidaseAklose.epim.Epirnerase.S lysyl oxidase 
pkinase t SSJM,pkinase t polyprenyLsynt, NIMA (never in mitosis gene a}-related k 



SS,TM,7tmJ,p450,rrm 
pkinase, 

death,ZU5,TM ,Activin__recp .pkinase, 
SS.Acyftransferase, 



ESTs 

gb:lL-BT152-080399-004 BT152 Homo sapien 
gb:zt32h03.r1 Soares ovary tumor NbHOT H 
K1AA1560 protein 



R1 

65.7 
55.9 
53.8 
39.3 
34.9 
25.7 
24.9 
24.1 
217 
21.7 
20.3 
19 2 
18.3 
18.1 
17.8 
16.7 
16.5 
15.7 
15 3 
13.9 
13.5 
13.0 
128 
127 
11.9 
11.8 
117 
11.5 
11.5 
11.4 
11.2 



315 



WO 02/059377 



PCT/US02/02242 



423887 
429859 
428368 
418912 
451952 
402408 
415539 
406687 
419948 
400285 
408380 
409203 
424902 
431725 
41B092 
427811 
423554 
426261 
413374 
432677 
456986 
453331 
407721 
418004 
410555 
443695 
423545 
401045 

442082 
418836 
404091 
450865 
424085 
449650 
432304 
415669 
442942 
428795 
428479 
400300 
447359 
402230 
424687 
432328 
432239 
400286 
425247 
434737 
439310 
441111 
452355 
427711 
429353 
425325 

448706 
403593 
432777 
446232 
411165 
414117 
411096 
450506 
417975 
447752 
427122 



AL080207 Hs. 134585 
NM__007050Hs.225952 
BE440042 Hs.83326 
NM_000685Hs 89472 
AL120173 Hs.301663 
NA 

AJ733881 
M31126 
AB041035 
NA 

AF1 23050 



Hs.72472 
Hs 272620 
Hs.93847 

Hs 44532 



SSJM t BRCT,ank,ABC_tran 1 ABC_tran DKFZP434G232 protein 

SS JM,Y_phosphatase,MAM,fn3, protein tyrosine phosphatase, receptor t 

SS, Peptidase JvllO.rtemopexin.SS.PeptldasejTiatrix meialloproteinase 3 (stromelysin 

SSJM,7tmJ,SS,TM 1 7trrLl, angiotensin receptor 1 

SS.pkinase, ESTs 

SS,carb_anhydrase NM_030920*:Homo sapiens hypothetical pro 

death,ZU5JM,Activin_recp,pkinase ( BMP-R1B 

SS ( Peptidase_M10,hemopexin t SS,Peptkiase_rTiatrix metalloproteinase 11 (MMP11; stro 



AA780473 Hs.687 
NMJX)3866Hs 153687 
X65724 Hs2839 
R45154 Hs 106604 
M81057 Hs 180884 
M90516 Hs1674 
AW242243 Hs 168670 
NM.001034HS 75319 
NM_004482Hs 278611 
D38299 Hs.170917 
AI240665 Hs 8895 



Ferric_reductTM,Ferrtc_reduct, 
TM t ABCJran,ABC_membrane l 
SS J TM,ubfcfurtin,7trrL3 l ANF_recepk3r,sushi 
55,0450,55,0450 
SS,SS 

SS,Cys_knot,SS 
death ) ZU5,pkinase ( Activin_recp, 



NM_016931:Homo sapiens NADPH oxidase 4 ( 
Eos Control 
dil/biquftin 

cytochrome P450, subfamily IVB, polypept 
inositol polyphosphates-phosphatase, ty 
Nonie disease (pseudoglioma) 
ESTs 



Y 12735 
U37519 
U92649 



Hs 38018 
Hs.87539 
Hs 64311 



AW204O99 Hs 337720 
AP000692 Hs.129781 



C11001883*:gi|6753278|ref|NPJ)33938.1[c 



SS 1 Zn„wrrX)pept l Propep_M14 l SS ( Propep_M14carboxyr^ptidase B1 (tissue) 
GATaseJ.SIS, glutamlne-fructose-6-phosphate transamin 

SS,TM,WD40 t ubiquttin,E1-E2 J\TPase,Cati on peroxisomal famesylated protein 
SS ribonucleotide reductase M2 polypeptide 

SS,TM,Glycos_transL2 t Ricin_BJectin 1 DPP UDP-N^cetyl-alpha-D-galactosamtne:polyp 
SS,TM,7tm_1, prostaglandin E receptor 3 (subtype EP3) 

SS,TM,disintegrin,Pep_M12B „propep,Reprol ESTs 

pkinase, dual-specificity tyrosine-(Y)-phosphoryl 

SS.aldedh.SS.aldedh, aldehyde dehydrogenase 3 family, member 

TM,disintegrin f Reprolysin, a dlsintegrin and metalloproteirtase doma 

ESTs, Weakly similar to AF 1 26780 1 retin 
GAF,PDEase chromosome 21 open reading frame 5 

lCE_p20,SS,ICE_p10 t ICE j)20,ICE J>10,lCE_p 



R41823 Hs7413 
AI655499 Hs 161712 
NA 

A1248013 Hs 106532 
NMJ)02914Hs 139226 
AF055575 Hs 23838 
AA932186 Hs 69297 
NM 005025HS 78589 
AW167087 Hs 131562 
R45503 Hs 97469 
Y00272 Hs 184572 
X03363 

NM_012093Hs 18268 
NA 

J05070 Hs 151738 
A1572739 Hs. 195471 
X81334 Hs.2936 
NA 

NM 005940HS 155324 
AA828246 Hs 291884 
AFO86120 Hs 102793 
AI806867 Hs 126594 
N54926 Hs 29202 
M31659 Hs 180408 
AL117406 Hs.200102 
X52730 Hs1892 

phenylethancrfamine N-methyrtransferase 
AW291095 Hs 21814 SS.TM.pkinase, 
NA . 

AA564991 Hs.269477 
A1281848 Hs.194691 
NM_000169Hs.69089 
W88559 Hs.1787 
U80O34 Hs.68583 
NM_004460Hs.418 
AA641836 Hs.30085 
M73700 Hs.105938 
AW057736 Hs.323910 



TM,EPH_lbd,pkinase,SAM,fn3, 
TM,Adivin_recp,pkinase,death^U5 ) 
TM,7tm_3,ANF_receptor, 
zf-C2H2 

SS,AAA,ViraLhelicase1 ,rrm, 
TMjonJrans,SSJM,ion_trans, 
TM,7tm_1, 
SS.serpin, 

SSjg.Sema.pkinase, 
SS.TM 

SS,pkinase ( pkinase 



67 

ESTs; calsyntenin-2 
ESTs 

Target Exon 

ESTs, Weakly similar to I38588 reverse t 
replication factor C (activator 1) 2 (40 
calcium channel, voltage-dependent L ty 
ESTs 

serine (or cysteine) proteinase inhfbito 
ESTs 

ESTs, Highly similar to A39769 N-acetyll 
cell division cycle 2, G1 to S and G2 to 



SS 1 TM,pkinase,Recep_L_domain,SH2 1 PH,FuriHER2 receptor tyrosine kinase (c-erb-b2, 
SS.adenylatekinase, adenylate kinase 5 

SS,TM,p450, Target Exon 

SSPeptidase_M10 Jn2,herT»pexin 1 SS,TM,Pepmatrix metalloproteinase 9 (gelatinase B 
6PF2K,PGAM, 6-phosphofructo-2-kinase/fructose-2 ( 6-bi 
SS, Peptidase^ 10,hemor^xin,SS,PeptidasejTiatrix metalloproteinase 13 (collagenase 
SS,TM,ABC_tran,ABC„membrane,SS C1600O922:gi|74991O3|pirl|T20903 hypothe 
SS.PeptidaseJvllO.hemopexin.SS matrix metalloproteinase 1 1 (MMP1 1 ; stro 

UCH-1 l pkinase,OPR ) Rhodanese,AMP-binding,ESTs 
SS,TM,UDPGT,casein_kappa ESTs 
SS.TM.Phosphodiest, ESTs 
TM,7tm_1 ,TM G protein- coupled receptor 34 

SS sotote carrier family 25 (mitochondrial 

SS JMABCjran ABC_membrane, ATP-binding cassette transporter MRP8 

SS 1 NNfvn"„PNMT_TEMT 1 SS ( NNMT_PNMT_TEfvf7,STAR 

5.1 

interleukin 20 receptor, alpha 
CIDE-N,pkinase Target Exon 

alpha-amylase, ESTs 
SS,TM,7tm_3,RibosomaLL13 retinoic acid induced 3 

SS 1 Melibiase,BTK,PH,pkinase,SH2 1 SH3,Ribo galactosidase, alpha 
T^iorijrans.KJetra, proteo Op id protein 1 (PeUzaeus-Merzbac 

Peptidase_M3, mitochondrial intermediate peptidase 

SS,DPPIV_N - terrTi,Peptjdase_S9 1 SS,DPPIV_NJjbroblast activation protein, alpha 
SS.trypsin hypotheticaj protein FLJ23186 

SS,transfenin,7trrM .transferrin, lactotransferrin 
SS,TM,pkinase ) Recep_L_domain,SH2 1 PH,FuriHER2 receptor tyrosine kinase (c-erb-b2, 
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Hs.78915 
Hs.1406 



400181 NA 

452093 AA447453 Hs.27860 
435542 AA687376 Hs.269533 
417576 AA339449 Hs.82285 
446089 AI860021 Hs.270651 
424420 BE614743 Hs. 146688 
452190 H26735 Hs.91668 
419986 A1345455 
421582 A1910275 
446733 AA863360 Hs.26040 
453060 AW294092 Hs.21594 
400205 NA 
420854 AW296927 
432690 AF181490 Hs.278627 
441560 F13386 HsJ888 
416445 AL043004 Hs.79337 
439024 R96696 Hs.35598 
432882 NM_013257Hs.279696 
447754 AW073310 Hs.163533 
453775 NM_002916Hs.35120 
431657 A1345227 Hs.105448 
427899 AA829286 Hs.332053 
422938 NM_0018O9Hs.1594 
418478 U38945 Hs.1174 
406922 S70284 
439285 AL133916 Hs.172572 
429922 Z97630 Hs.226117 
420139 NM_005357Hs.95351 
425071 NM.0139B9HS.1 54424 
424511 BE300512 Hs.193557 
428722 U76456 Hs.190787 
428330 L22524 Hs.2256 
414831 M31158 Hs.77439 
413589 AW452631 Hs.313B03 
418629 BE247550 Hs.86859 
413453 AA129640 Hs.128065 
403943 

444618 AV653785 Hs. 173334 
408761 AA057264 Hs.238936 
427809 M26380 Hs. 180878 
418203 X54942 Hs.B3758 
454071 AKM1793 Hs.42502 
424676 Y08565 Hs.151678 
457465 AW301344 Hs.122908 
417601 NM 014735HS.82292 
446619 AU076643 Hs.313 
406625 Y13647 Hs.1 19597 
428769 AW207175 Hs.106771 
426310 NM_000909Hs.169266 
417531 NM_003157Hs.1087 
444781 NMJ144O0HS.1195O 
431493 AI791493 Hs.129873 
428966 AF059214 Hs.194687 
414175 AI308876 Hs.103849 
455325 AW895719 
429597 NM.003B16Hs.2442 
425320 U29344 Hs.83190 
431854 AA383550 Hs.271699 
407104 S57296 Hs.323910 
449051 AW961400 Hs.333526 
434398 AA121098 Hs.3838 
454042 H22570 Hs.172572 
407756 AA1 16021 Hs.38260 
401464 AF039241 Hs.9028 
412970 AB026436 Hs.177534 
412049 N53437 Hs.18268 
425776 U25128 Hs.159499 



SS TM 3Beta_HSD ENSP0000C171555:CDNA FU10727 fis, ctone 

SSJM.Vtrn J , ' Homo sapiens mRNA; cONA DKFZp586M0723 (f 

SslpkinaseJ*haGEF,ig 1 PH 1 SH3. ESTs 

AlRS,formylJransf,GARS ( SS,GARS,AlRS 1 for phosphoribosylglydnamide formyttransfer 



pkinase 

SS.TM.WAPEG, 

SS,TM,PH,SH2^urin4ike,pkinase,Recep_L. 
pkinase.OPR, 

SS t trefoil,SS i ™,ldLrecepLa,SRCR,tryps 
SS.TM.p450, 

SS.ras.Y^phosphatase.ras 

SSJM,Peptidase_M1, 
SS,pyr„redox,SS,RibosomaLL39 
pkinase, 
SS,pkinase, 
SS,TM,trypsin,vwd ( ig 
pkinase, pkinase_C, 
pkinase, 



ESTs, Moderately similar to A47582 B-cel 
prostaglandin E synthase 
Homo sapiens clone PP1498 unknown mRNA 
GA-biruj'mg protein transcription factor, 
trefoil factor 1 (pS2) 

ESTs, Weakly similar to fatty acid omega 
hypothetical protein MGC 15754 
NM_006265*:Homo sapiens RAD21 (S. pombe) 
gb:UI-H-BWO-ajCHC-07-0-UU1 NCI_CGAP_Su 
prenylcysteine lyase 

Homo sapiens ctone 23736 mRNA sequence 

K1AA0135 protein 

ESTs 

serunVglucocorticoid regulated kinase-li 
Homo sapiens cDNA FU14142 fis, clone MA 



SS AAA.PI3_PW_kinase.Pt3Ka.Pl3K_rbd.Pl3 replication factor C (activator 1 ) 4 (37 
SSJM.pkinase ESTs, Weakty similar to B34087 hypotheti 

Ss!sAAj>roteins 1 ABC„rnembraneABC_tran 1 serum amyloid A1 



SS.TM.thiolase, 
ank,ank 

SSJM t Desaturase,SS 
SS.ig.pkinase.LRRNT.LRRCT, 
SS.TMJinkerJVtstoneJtmJ 
SS,TM,p450, 

SS,T4_deiodinase,T4_deiodinase, 
SS,Y_phosphatase,BandJ1 
SS.T1MP, 



centromere protein A (17kD) 
cyclin-<iependent kinase inhibitor 2A (me 
gb;stearoyl-CoA desaturase [human, adipo 
hypothetical protein FLJ20O93 
H1 histone family, member 0 
lipase, hormone-sensitive 
delodinase, ioobthyronine, type tl 
ESTs, Moderately similar to ALU7JHUMAN A 
tissue inhibitor of metalloproteinase 4 



SS,Peptidase_M10,SS,Peptidase_M10,hemopematn'x metalloproteinase 7 (MMP7; uterin 
S$]cNMP„binding,Rlla,HMGJ>ox protein kinase, cAMP-dependent, reguiato 

Ss'abhydrolase ESTs, Highly similar to AF1 57833 1 noncl 

Ss!sH2,PH,$S 1 TM,PH,SH2,Furin-ltke,pkinas growth factor receptor-bound protein 7 



ESTs 

C50O0355:gi|4503225|ref|NP_0OO765.1| cyt 
ELL-RELATED RNA POLYMERASE II, ELONGATIO 
ESTs, Weakly similar to (defline not ava 
lipoprotein lipase 
COC28 protein kinase 2 
ESTs 

UDP-N-acetyl-alpha-D-g alactosamine: polyp 
DNA replication (actor 
KIAA0215gene product 
secreted phosphoprotein 1 (osteopontin, 
stearoyl-CoA desaturase (detta-9-desatur 
ESTs 

neuropeptide Y receptor Y1 
serine/threonine kinase 2 
GPI-anchored metastasis-associated prate 
ESTs, Weakly simflar to A36036 cytochrom 



SS,Peptidase_C1.gpdh 
p450,SS,p450 

SS,TM,7tm_1, 
SS,lipase,PLAT,Sec7,PH, 
CKS,SS,CKS, 
TM,7tmJ, 

Glycos_transf_2,Ricin_B_lectin ( SS t Glyoos 
SS.Pribosyltran.Sulfatase 
PHD.pkinase.SS 
SS.TM.efhand/ion^trans 
SS.TM.Desaturase.SS 
SS,7tmJ,SPRY, 
SS,TM,7tmJ, 

$S,pkinase,vwa,vwa,G tycojransfji 
SS,PHJactamase_B 
SS,p450,p450 

cholesterol 25-hydroxylase 
TM,hemopexin,Peptidase_M10,r^rjexin,Pephyrjothetica protein DKFZp761Dl12 
TM,ion Jrans,KJetra, gb:QV4-NNGO39-290300-1 54-f06 NN0039 Homo 

a disintegrin and metalloproteinase doma 
AcyLtrarisf,adh_2inc,ketoa<Yi-synt,pr>bj fatty acid synthase 
IM$,SS polymerase (DNA directed) iota 

SS,TM,SH2 I PH,p|ynase 1 Recep_L„domain,FLjriv-erb-b2 avian erythroblastic leukemia v 



SS 

pkinase ) POLO_box,SS.pkinase,POLO_box, 

SS,ig f pkinase,LRRNT,LRRCT J 

SS, UCH-1 ,UCH-2,SS,TM,G jglu Jranspept 

Peptidase„M24, 

DSPc,Myosin_tail, 

SS.adenylatekinase, 

SS,TM.7tm_2,SS,TM.7tm_2 
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HER2 receptor tyrosine kinase (c-erb-b2, 
serum-inductble kinase 
hypothetical protein FU 20093 
ubiquitin specific protease 18 
histone deacetylase 5 
dual specificity phosphatase 10 
adenylate kinase 5 
parathyroid hormone receptor 2 



4.6 

4.6 

4.6 

4.6 

4.6 

4.5 

4.5 

4.5 

4.5 

4.4 

4.4 

4.4 

4.3 

4.3 

4.3 

4.3 

4.3 

4.2 

4.2 

4.2 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.0 

4.0 

4.0 

3.9 

3.9 

3.9 

3.8 

3.8 

3.8 

3.8 

3.8 

3.8 

3.7 

3.7 

3.7 

3.7 

3.7 

3.7 

3.7 

3.6 

3.6 

3.6 

3.6 

3.6 

3.6 

3.6 

3.6 

3.6 

3.6 

3.5 

3.5 

3.5 

3.5 

3.5 

3.5 

3.5 

3.5 

3,5 

3.5 

3.4 



WO 02/059377 



PCT7US02/02242 



407846 


AA426202 Hs.40403 


TMABC_membraneABC_tran,RibosorriaLS4eCbp/p300-interacting transactivator, wit 


34 


406925 


L34041 Hs.9739 


SS JM^ansport^rotSWIB.RhoGAP^AG^PE-glycerol-S-phosphate dehydrogenase 1 (so 


34 


445873 


AA250970 Hs.251946 


S$,rrm,PABP, pkinase, 14-3-3,rrm 


poiy(A)- binding protein, cytoplasmic 1-1 


34 


418054 


NM_002318Hs.83354 


SSJHmito carr.LysyLoxidase 


tysyl oxidase-like 2 


34 


406815 


AA833930 Hs.288036 


SSJPPT, 


tRNA isopentenylpyrophospbate transferas 


3.4 


410530 


M25809 Hs 64173 


ATP-syntab,SS,7tmJ »ATP-synLab 


ATPase, H transporting, lysosomal (vacuo 


34 


407021 


U52077 




gb:Human marinerl transposase gene, comp 


34 


421168 


AF1 82277 Hs 330780 


S$,p450,SS 


cytochrome P450, subfamily 1IB (phenobar 


34 


431473 


M825686 Hs 321 176 


SS 


ESTs, Weakiy similar to S65824 reverse t 


34 


408101 


AW968504 Hs 123073 


pkinase, 


CDC2-related protein kinase 7 


34 


422083 


NM_001141Hs 111256 


lipoxygenase, PLAT , 


arachidonate 15-lipoxygenase, second typ 


33 


411393 


AW797437 Hs.69771 


SS.sushi.trypsin.vwa.rrm.fibrinogen^Cfn 


B-factor, properdin 


33 


435767 


H73505 Hs 117874 


SS.PeptidaseJSS.PPeptidase^SS.P 


ESTs 


33 


433068 


NM 006456HS 288215 


SS.Pribosyttran, 


sialy [transferase 


33 


426928 


AF037062 Hs 172914 


SS,adh„shortTGF-beta,TGFb_propeptide 


retinol dehydrogenase 5 (1 Wis and 9-d 


33 


414575 


H11257 Hs 22968 


SS.pkinase.ig, 


Homo sapiens clone IMAGE451939, mRNA se 


33 


445941 


AI267371 Hs.172636 


SS.SSJipoxygenase.PLAT 


ESTs 


33 


444542 


AI161293 Hs 280380 


SS,SS,Peptidase_M1 I EGF,ig l )ectin_c,sushi 


aminopepfjdase 


33 


425741 


AF052152 Hs 159412 


pkinase, 


Homo sapiens clone 24628 mRNA sequence 


33 


434228 


Z42047 Hs 283978 


SS.TMjtmJ 


Homo sapiens PR02751 mRNA, complete cds 


33 


433264 


D85782 Hs3229 




cysteine dioxygenase, type I 


33 


400419 


AF084545 


SS,Peptidase_M1, 


Target 


33 


439750 


AL359053 Ha 57664 


TM,integrin_B ( Ricin_B_lect)n t rrm 


Homo sapiens mRNA full length insert cDN 


33 


417757 


R19897 Hs 106604 


death ,ZU5,pWnase,Activin_recp, 


ESTs 


33 


452194 


AI694413 Hs 332649 


SS,TM,7tm 3.ANF receptor, sushi 


olfactory receptor, family 2, subfamily 


32 


421458 


NM 003654HS 104576 


SS 


carbohydrate (keratan sulfate Gaf-6) sul 


32 


443767 


BE562136 Hs 9736 


SS.PC 1 , RasG EF ,hormone_rec,zf-C4 , 


proteasome (prosome, macropain) 26S subu 


32 


422648 


D86983 Hs. 118893 


peroxidase, LRRCT, 


Melanoma associated gene 


32 


423431 


AA326062 


SS,p450,p450 


gb:EST29171 Cerebellum II Homo sapiens c 


32 


451264 


AI768235 


SS.Trehalase 


gbwg82g08.x1 Soares_NSF_F8_9W_OT_PA_P_S 


32 


452110 


T47667 Hs 28005 


SS,TM,Activin_recp, pkinase 


Homo sapiens cDNA FLJ11309 fis, clone PL 


3.2 


439963 


AW247529 Hs 6793 


TM,p450,Ets 


platelet-activating factor acetylhydrola 


32 


453941 


U39817 Hs36820 


SS,DEAD,HRDC,helicase_C ( 


Bloom syndrome 


31 


406664 


L34041 Hs.9739 


SS ) TM,transpofLprot f SWIB l RhoGAP l DAG_PE- 


gfycerol-3 


phosphate dehydrogenase 1 (so 


3.1 




3.1 


453487 


R31770 Hs.23540 


TM,7tmJ, 


ESTs 


420911 


U77413 Hs.100293 




CMinked N-acetylglucosamine (GlcNAc) tr 


3.1 


443171 


BE281128 Hs.9030 


SS,TM,7tm_1,rTm,SS 


TONDU 


31 


452256 


AK000933 Hs28661 


TM.GDIJtmJ, 


Homo sapiens cDNA FU 10071 fis, clone HE 


31 


432201 


Ai538613 Hs 298241 


SS,TM,trypsln,SS,TM,trefoil ( trypsin,tref 


Transmembrane protease, serine 3 


31 


419150 


T29618 Hs.89640 


TM,pklnase,fn3, 


TEK tyrosine kinase, endothelial (venous 


3.1 


444443 


AI149286 Hs55099 


SS 


rab6 GTPase activating protein (GAP and 


31 


426283 


NM 003937HS 169139 




kynureninase (L-kynurenine hydrolase) 


31 


436291 


BE568452 Hs 5101 


SS.abhydrolase, 


protein regulator of cytokinesis 1 


31 


450223 


AA418204 Hs 241493 


SS.pro isomerase, 


natural killer-tumor recognition sequenc 


3.1 


424269 


AW1 37691 Hs 199754 


SS,TM,7tm_2,GPS 


ESTs 


3.1 


448105 


AW591433 Hs.298241 


SSJM.trefoil.trypsin, trefoil 


Transmembrane protease, serine 3 


3.0 


452560 


BE077084 Hs 336432 


S$,rm,zf-RanBP 1 pkinase,C2 i pkinase_C,DAG ESTs 


3.0 
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TABLE 21A 

Table 21 A shows the accession numbers for those pkeys lacking unigenelD's for Table 21 . 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number. Gene duster number 
Accession: Genbank accession numbers 



Pkey 


CAT number 


Accessions 


420854 


197072 1 


AW296927 AI684514AI263168 AA281079 


423431 


228162 1 


AA326062 AA3257 58 AW962182 


423945 


233566J 


AA41 0943 AW948953 AA334202 AA332882 


451264 


863988 1 


AI768235 R31400 H29082 H23107 


455325 


1279475 1 


AW895719N31451 N41451 


456207 


165078.-1 


AA193450 
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TABLE 21B 



Table 21B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 21. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gi) numbers. "Dunham I. et al." refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

Ntposition: I ndicates nucleotide positions of predicted exons, 



Pkey 


Ref 


Strand 


Nt_posttion 


401045 


8117619 


Plus 


90044-90184,91111-91345 


402230 


9966312 


Minus 


29782-29932 


402408 


9796239 


Minus 


110326-110491 


402578 


9884928 


Plus 


66350-66496 


403593 


6862650 


Minus 


62554-62712.69449-69602 


403943 


7711864 


Plus 


100742-1009O4.101322-1015O3 


404091 


7684554 


Minus 


82121-83229 
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TABLE 22: 739 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT BREAST 

Table 22 shows 739 genes up-regulated in breast cancer compared to normal adult breast. 
These were selected as for Table 19, except that the ratio was greater than or equal to 3.0, the 
denominator was the 85 th percentile value for 12 non-malignant breast specimens, and the 
96 th percentile value amongst the 73 breast cancers was greater than or equal 100 units. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Trtte; Unigene gene title 

r 1 ; Ratio of 90 th percentile tumor to 85* percentile normal breast tissue 



Pkey ExAccn UnigenelD UnigeneTitle R1 

400292 AA250737 Hs.72472 BMP-R1B 51.5 

424735 U31875 Hs 272499 short-chain alcohol dehydrogenase family 38.3 

400297 AI127076 Hs 334473 hypothetical protein DKFZp56401 278 29.9 

431448 AL137517 Hs 334473 hypothetical protein DKFZp56401 278 26.9 

451110 AI955040 Hs.265398 ESTs, Weakly similar to transformations 25.8 

431211 M86849 Hs,323733 gap junction protein, beta 2, 26kD (conn 23.2 

418203 X54942 Hs 83758 CDC28 protein kinase 2 22.6 
407980 AA046309 gb:zf12K)1.s1 Soares_fetal_hearLNbHH19W 19.8 

414646 M353776 Hs,901 CD48 antigen (B-cell membrane protein) 18.9 

446921 AB012113 Hs.16530 small inducible cytokine subfamily A (Cy 18.0 

409041 AB033025 Hs.5O081 KIAA1 199 protein 17.6 

412140 AA219691 Hs.73625 RAB6 interacting, kinesin-like (rabkines 17.6 
407824 AA147884 Hs.9812 Homo sapiens cON A FLJ1 4388 fis, clone HE 17.1 

453160 AI263307 Hs 239884 H2B histone family, member L 17.0 

407137 T97307 gb:ye53h05.s1 Soares fetal liver spleen 16.1 

425692 D90041 Hs 155956 N-acetyltransf erase 1 (aryiamine N-acety 16.1 

438533 AI440266 Hs. 170673 ESTs, Weakly similar to T24832 hypotheti 16.0 

428227 AA321649 Hs.2248 small inducible cytokine subfamily B (CX 15 5 

444342 NM_014398Hs.10887 similar to tyscsome-associated membrane 15.1 

422505 AL120862 Hs.124165 programmed cell death 9 (PDCD9) 14,9 

430515 AA746503 Hs.283313 ESTs 14.7 

417308 H60720 Hs.81892 KIAA01 01 gene product 14 4 
452744 AI267652 Hs.30504 Homo sapiens mRNA; cONA DKF2p434 E082 (fr1 4.4 

412446 AI768015 Hs.92127 ESTs 14 2 

415539 AI733881 Hs.72472 BMP-R1B 14.1 

435496 AW840171 Hs.265398 ESTs, Weakly similar to transformations 13.8 

438209 AL1 20659 Hs.6111 aryl-hydrocarbon receptor nuclear transl 13.8 
400205 NA NM_006265*:Homo sapiens RAD21 (S. pombe)13.5 

430965 AA489732 Hs.154918 ESTs 13.4 

415263 AA94B033 Hs.130853 ESTs 13.3 

451952 AL120173 Hs.301663 ESTs 13 -2 

449722 BE280074 Hs.23960 cyclin B1 13.2 

406685 M18728 g b:Hu man nonspecific crossreacting a ntig 13.0 

406690 M29540 Hs.220529 carcinoembfyonlcenegen-relateda^lad 12.8 

429925 NM_000786Hs.226213 cytochrome P450, 51 (lanosterol 14-alpha 12.8 

416498 U33632 Hs.79351 potassium channel, subfamily K, member 1 12.7 

432378 AI493046 Hs.1 46133 ESTs 12- 5 

441377 BE218239 Hs.202656 ESTs 12 5 
456207 AA193450 gb:zr40e07.r1 Soares^ NhHMPu_S1 Homo sapi12.4 

422805 AA436989 Hs.121017 H2A histone family, member A 12.2 

407811 AW190902 Hs.40098 cysteine knot superfamtly 1 , BMP antagon 12.2 

407178 AA195651 Hs,104106 ESTs 122 
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420931 AF044197 Hs 100431 small Inducible cytokine B subfamily (Cy 12,1 

421727 Y13153 Hs 107318 kynurenine 3-monooxygenase (kynunenine 3 12.0 

434408 AI031771 Hs. 132586 ESTs 12.0 

446591 H44186 Hs 15456 PDZ domain containing 1 11.9 

5 431385 BE17 8536 Hs 11090 membrane-spanning 4-domains, subfamily A 11.8 

443348 AW873596 Hs 182278 calmodulin 2 (phosphorylase kinase, deft 11.7 

416602 NMJ)Q6159Hs 79389 nei (chtcken>like 2 11.7 

433365 AF026944 Hs 293797 ESTs 11.6 

437866 AA1 56781 Hs 74170 metaBothionein 1 E (functional) 11.5 

10 412472 AW975398 Hs. 293836 ESTs 11.4 

416030 H15261 Hs.21948 ESTs 11.3 

439979 AW6O0291 Hs.6823 hypothetical protein FU 10430 11.3 

420757 X78592 Hs 99915 androgen receptor (dihydrotestosterone r 11.3 

411598 BE336654 Hs 70937 H 3 histone family, member A 11.2 

15 423600 AI633559 Hs 310359 ESTs 11.2 

430770 AA765694 Hs 123296 ESTs 11.0 

421037 AI684808 Hs 197653 programmed cell death 9 (PDCD9) 10.9 

452461 N78223 Hs. 108106 transcription factor 10.7 

409269 AA576953 Hs 22972 hypothetical protein FU 133 52 10.6 

20 417791 AW965339 Hs.111471 ESTs 10.6 

447268 AI370413 Hs 36563 hypothetical protein FU224 18 1 0.4 

424001 W67883 Hs. 137476 paternally expressed 10 10.4 

447342 AI199268 Hs 19322 Homo sapiens, Similar to RIKEN cDNA 2010 10.4 

424905 NM_002497Hs 153704 N I MA (never in mitosis gen ea)-relatedk 10.1 

25 453619 H87648 Hs 33922 Homo sapiens , clone MGC: 9084, mRNA, compl 0.1 

442942 AW167087 Hs 131562 ESTs 10.1 

434377 AW137148 Hs 306593 Homo sapiens cDNA FLJ11382fis, clone HE 10.1 

427217 AA399272 Hs 144341 ESTs 10.1 

445730 AI624342 Hs. 170042 ESTs 10.0 

3 0 432887 AI926047 Hs 162859 ESTs 10.0 

452243 AL355715 Hs 28555 programmed cell death 9 9.9 

424590 AW966399 Hs 46821 hypothetical protein FU 20 086 9.9 

432169 Y00971 Hs 2910 phosphoribosyl pyrophosphate synthetase 9.9 

438950 H23789 Hs 144530 EST 9.9 

3 5 418836 AI655499 Hs 161712 ESTs 9.8 

430291 AV660345 Hs 238126 CGl-49 protein 9.8 

444665 BE613126 Hs 47783 B aggressive lymphoma gene 9.7 

407377 C16391 gb:C16391 Clontech human aorta polyA mRN 9.7 

445413 AA151342 Hs 12677 CGI-147 protein 9.7 

40 443462 AI064690 Hs 171176 ESTs 9.7 

442145 AI022650 Hs8117 erbb2-interacting protein ERBIN 9.7 

435570 AF212222 Hs 177812 uncharacterized bone marrow protein BM04 9.7 

439820 AL360204 Hs 283853 Homo sapiens mRNA full length insert cDN 9.6 

428966 AF059214 Hs 194687 cholesterol 25-bydroxylase 9.6 

45 449448 D60730 Hs 57471 ESTs 9.6 

433929 AI375499 Hs 27379 ESTs 9.5 

432731 R31178 Hs 287820 fibronedin 1 9.3 

411815 AA156679 Hs.125790 leucine-ricb repeat-containing 2 9.3 

415385 R17798 Hs.7535 COBW-like protein 9.3 

50 422026 U80736 Hs 110826 trinucleotide repeat containing 9 9.2 

432596 AJ224741 Hs 278461 matrilin 3 9,2 

439451 AF086270 Hs 278554 heterochromatin-like protein 1 9.2 

423945 AA410943 gb:zt32h03.r1 Soares ovaty tumor NbHOT H 9.1 

442432 BE093589 Hs 38178 hypothetical protein FU23 468 9.1 

55 446715 AI337735 Hs 173919 ESTs, Moderately similar to ZN9LHU MAN Z 9.0 

408771 AW732573 Hs.47584 potassium voltage-gated channel, delayed 9.0 

437021 AI076089 Hs 292239 ESTs 9.0 

428479 Y00272 Hs 184572 ceB division cycle 2, G 1 to S and G2 to 8.9 

428839 AI767756 Hs 82302 Homo sapiens cDNA FU 148 14 fis, clone NT 8.9 

60 402408 NA NM_030920*: Homo sapiens hypothetical pro 8.9 

418601 AA279490 Hs.86368 calmegin 8.8 

426327 W03242 Hs.44898 Homo sapiens clone TCCCTA001 51 mRNA sequ8. 8 

419519 AM98719 Hs.176376 ESTs 8.8 

440621 AW296024 Hs.150434 ESTs 8.8 

65 446142 AI754693 Hs.145968 ESTs 8.8 

418196 A1745649 Hs.26549 KIAA1 708 protein 8.7 

447178 AW594641 Hs.192417 ESTs 8.7 
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427535 D31152 Hs.179729 collagen, type X, alpha 1 (Schmid metaph 8.6 

415857 AA366115 Hs.127797 Homo sapiens cDNA FU11331 fis, done HE 8.6 

435061 AI651474 Hs.163944 ESTs 86 

431374 BE258532 Hs.251871 CTP synthase 8.4 

5 417866 AW067903 Hs.82772 collagen, type X!, alpha 1 8 4 

437211 AA382207 Hs.5509 ecotropic viral integration site 2B 8 3 
437751 AA767373 Hs.35669 ESTs, Moderately similar to ALU 1 .HUMAN A 8 3 

423887 AL080207 Hs.134585 DKFZP434G232 protein 82 

440941 BE268362 Hs.7535 COBW-like protein 8.2 

10 429859 NM_007050Hs.225952 protein tyrosine phosphatase, receptor t 8 2 

410193 AJ132592 Hs.59757 zinc finger protein 281 8.2 

431725 X65724 Hs.2839 Nome disease (pseudogfoma) 8.1 

446258 AI283476 Hs.263478 ESTs 81 

416747 AW876523 Hs. 15929 hypothetical protein FU 12910 8.1 

15 434424 AI811202 Hs.325335 Homo sapiens cDNA: FU23523 fis, done L 8.1 

421650 AA781795 Hs.122587 ESTs 8.0 

429534 AW976987 Hs.163327 ESTs, Weakly similar to 2109260A B cell 8.0 

457465 AW301344 Hs.122908 DNA replication factor 8 0 

427961 AW293165 Hs.143134 ESTs 8 0 

20 436481 AA379597 Hs.5199 HSPC150 protein similar to ubiquitin-con 8.0 

418216 AA662240 Hs.283099 AF15q14 protein 8.0 
418250 U29926 Hs.83918 adenosine monophosphate deaminase (feofo 7.9 

400285 NA Eos Control 7.9 

401464 AF039241 Hs.9028 histone deacetytase 5 7.9 

2 5 407242 M18728 gb:Human nonspecific crossreacting antig 7.8 

422232 D43945 Hs. 113274 transcription factor EC 7.8 

454024 AA993527 Hs.293907 hypothetical protein FLJ23403 7.8 

444542 AI161293 Hs.280330 aminopeptidase 7.8 

436396 AI683487 Hs.1 52213 wingless-type MMTV integration site fami 7.7 

3 0 437204 AL110216 Hs.12285 ESTs, Weakly similar to 155214 salivajy 7.6 

408805 H6991 2 Hs.48269 vaccinia related kinase 1 7.6 

437207 T27503 Hs.15929 hypothetical protein FU 12910 7.6 
442818 AK001741 Hs 8739 hypothetical protein FU 10879 7,6 

426283 NMJX)3937Hs.169139 kyn u re n inase {L-kynurenine hydrolase) 7.5 
3 5 424687 J05070 Hs.151738 matrix metalbproteinase 9 (gelatinase B 7.5 
446315 NM_016293Hs 14770 bridging integrator 2 7.5 
433426 H69125 Hs 133525 ESTs 75 
406639 M97711 gb.Human T-cell receptor (V beta 18.1, J 7.5 
420077 AW512260 Hs.87767 ESTs J.4 
40 457332 AA961694 Hs.105187 kinesin protein 9 gene 7.4 
422938 NM_001809Hs.1594 centromere protein A (17kD) 7.4 
447555 AI391662 Hs.160963 Homo sapiens, clone MGC: 1231 8, mRNA,com7, 4 
444618 AV653785 Hs.173334 ELL-RELATED RMA POLYMERASE II, ELONGATI07.3 
410361 BE391804 Hs.62661 guanylate binding protein 1, interferon- 7.3 
45 400268 NA NM_003 292; Homo sapiens translocated prom 7.3 
439509 AF086332 Hs.58314 ESTs 7 \ 

407771 AL138272 Hs.62713 ESTs 7 -3 
407202 N58172 Hs,109370 ESTs 7.3 
433096 AU076803 Hs-282975 carboxylesterase 2 (intestine, Irver) 7.2 
50 422094 AF129535 Hs.272027 F- box only protein 5 ' 7.1 

430832 AI073913 Hs.100686 ESTs, Weakty similar to JE0350 Anterior 7.1 
430287 AW182459 Hs,125759 ESTs, Weakly similar to LEU 5_ HUMAN LEU KE7.0 

423739 AA398155 Hs 97600 ESTs . 7 0 
448212 AI475858 g b;tc37d07.x1 NCLCGAR.CLL1 Homo sapiens7.0 
5 5 407277 AW170035 Hs.326736 Homo sapiens breast cancer antigen NY-BR 7.0 
454440 BE062906 Hs,28338 KIAA1546 protein 7.0 
444783 AK001468 Hs 62180 anilBn (Drosophila Scraps homotog), act 7.0 
421373 AA808229 Hs 167771 ESTs 6.9 
431960 AW241821 Hs.301927 co.1A ^.9 
60 424704 AI263293 Hs. 152096 cytochrome P450, subfamily II J (arachtdo 6.8 
449517 AW500106 Hs.23643 serine/threonine protein kinase MASK 6.8 
439840 AW449211 Hs.105445 GDNF family receptor alpha 1 6.8 
414080 AA135257 Hs.47783 B aggressive lymphoma gene 6.8 
441243 AI767056 Hs.193002 ESTs 6.7 
65 408380 AF123050 Hs.44532 diubiquitin 6.7 
422956 BE545072 Hs.122579 hypothetical protein FU 10461 6.7 



HZZ^^V ruw... • r- 

446651 AA393907 Hs.97179 ESTs 6.7 
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419839 U24577 Hs.93304 phospholipase A2 t group V!t (platelet-ac 6.7 

437740 AA810265 Hs.122915 ESTs 6 7 

421582 AI910275 Hs,1406 trefoil factor 1 (pS 2) 6 7 

427356 AW023482 Hs.97849 ESTs 6.6 

5 429597 NM_003816Hs.2442 a d'eintegrin and metalloproteinase doma 6 6 

422634 NM_016010Hs.118821 CGI-62 protein 6 6 

421072 AI215069 Hs 89113 ESTs 6 5 

427718 AI798680 Hs.25933 ESTs 6.5 

411000 N40449 Hs 201619 ESTs, Weakly similar to S38383 SEB4B pro 6 5 

10 449343 AI151418 Hs 272458 protein phosphatase 3 (formerly 2B), cat 6 4 

409757 NM_001898Hs 123114 cystatinSN 6 4 

447164 AF026941 Hs.17518 Homo sapiens cig 5 mRNA, partial sequence 6 4 

456938 X52509 Hs.161640 tyrosine aminotransferase 6 4 

418848 AI820961 Hs 193465 ESTs 6 4 

15 424902 NM_003866Hs 153687 inositol polyphosphates-phosphatase, ty 6 4 

452838 U6501 1 Hs 30743 preferentially expressed antigen in mela 6 4 

439452 AA918317 Hs.57987 B-cell CLL/lymphoma 11B (zinc finger pro 64 

407266 AJ235664 gb:Homo sapiens mRNA for immunoglobulin 6 3 

411078 AI222020 Hs 182364 CocoaCrisp 6 3 

20 433001 AF217513 Hs 279905 clone HQ03 10 PR O0310p1 6 3 

434340 AI193043 Hs 128685 ESTs, Weakly similar to T17226 hypotheti . 6 2 

429503 AA394183 Hs.26873 ESTs 6 2 

402578 C1001134;gi|2117372|pirj|l65981 fatty ac 6 2 

409646 AW161391 Hs 709 deoxycytidine kinase 61 

25 430447 W17064 Hs 332848 SWI/SNF related, matrix associated, acti 61 

432415 T16971 Hs 289014 ESTs, Weakly similar to A439 32 mucin 2 p 61 

443709 AI082692 Hs 134662 ESTs 61 

420929 AI694143 Hs 296251 programmed cell death 4 61 

428368 BE440042 Hs 83326 matrix metal to proteinase 3 (stro me iysin 61 

3 0 428248 AI126772 Hs 40479 ESTs 6 0 

420344 BE463721 Hs 97101 putative G protein-coupled receptor 6 0 

453392 U23752 Hs 32964 SRY (sex determining region Y>box 11 6 0 

425397 J04088 H s. 156346 to poisome rase (DN A) II alpha (170k D) 6 0 

418007 M13509 Hs 83169 matnx metalloproteinase 1 (MMP1; inters 6 0 

3 5 428585 AB007863 Hs.185140 K1AA0403 protein 6 0 

437608 AA761605 Hs 292308 ESTs, Weakly similar to ALU 1_HU MAN ALU S6 0 

427408 AA583206 Hs 2156 RAR -related orphan receptor A 6 0 

406687 M31126 Hs.272620 matrix metal top roteinase 1 1 (MMP1 1 ; stro 6 0 

418092 R45154 Hs 106604 ESTs 6 0 

40 447051 AW139130 Hs 160951 ESTs, Weakly similar to Con1 [H.sapiens] 6 0 

441233 AA972965 Hs 135568 ESTs 6 0 

432239 X81334 Hs2936 matrix metal toproteinase 13 (coflagenase 6 0 

435106 AA100847 Hs 193380 ESTs, Highly similar to AF174600 1 F-box 5 9 

435525 AI83 1297 Hs 123310 ESTs 5 9 

45 458809 AW97251 2 Hs 20985 sin3-associated polypeptide, 30kD 5 9 

410785 AW803341 gb:IL2-UMO07£O903(XM)50-D03 UM0079 Homo5.9 

422576 BE548555 Hs 118554 CGI -83 protein 5 9 

451398 AI793124 Hs.144479 ESTs 5 9 

441881 AW968904 Hs 179566 hypothetical protein FLJ22624 5 8 

5 0 412022 AI005043 Hs 24143 Wiskott-Aldrich syndrome protein in terac 5 8 

416636 N32536 Hs42645 solute carrier family 16 (mono carboxy lie 5 8 

447350 AI375572 Hs 172634 ESTs 5 8 

434094 AA305599 Hs 238205 hypothetical protein PR O20 13 5 8 

409151 AA306105 Hs 50785 SEC22, vesicle trafficking protein (S. c 5 8 

55 448807 AI571940 Hs7549 ESTs 5 8 

452281 T93500 Hs 28792 Homo sapiens cDNA FLJ11041 fis, done PL 5 8 

421281 AI299139 Hs 17517 ESTs 5 8 

430361 AI033965 Hs 239926 sterol-C4- methyl oxidase-like 5 8 

400289 X07820 Hs2258 matrix metalloproteinase 10 (MM P 10; str 5 7 

60 440527 AV657117 Hs 184164 ESTs, Moderately simitar to S65657 alpha 5 7 

434674 AA831879 Hs 136985 ESTs 5 7 

426320 W47595 Hs 169300 transforming growth factor, beta 2 5 7 

452401 NMJ)07115Hs.29352 tumor necrosis factor, alpha- induced pro 5 7 

448663 BE614599 H s. 106 823 hypothetical protein MGC 147 97 5 7 

65 438199 AW016531 Hs 122147 ESTs 5 7 

446203 Z47553 Hs. 14286 flavin containing monooxygenase 5 5 7 

428336 AA503115 Hs. 183752 microseminoproteln, beta- 5.6 
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430379 AF134149 Hs.240395 potassium channel, subfamily K, member 6 5.6 
422835 BE218705 Hs.121378 metaJbthionein-lfte 5, testis-specific 5.6 
444758 AL044878 Hs.1 1899 3'hydroxy-3-methyiglutaryt-Coenzyme A re 5.6 
443426 AF098158 Hs.9329 chromosome 20 open reading frame 1 5.6 
5 400301 X03635 Hs.1657 estrogen receptor 1 5.6 
447078 AW885727 Hs.301570 ESTs 5.6 
432015 AL157504 Hs. 1591 15 Homo sapiens mRNA; cDNA DKFZp586O0724 (f5.5 
438691 AA906288 Hs.212184 ESTs 5.5 
439809 R41396 Hs.101774 hypothetical protein FU23045 5.5 

10 415786 AW419196 Hs.257924 hypothetical protein FU 13782 5,5 
456373 BE247706 Hs.89751 membrane-spanning 4-domains, subfamily A 5, 5 
401645 NA C16001440*:gi]12330704|gb|AAG52890.1|AF35 5 

437967 BE277414 Hs.5947 mel transforming oncogene (derived from 5,5 
445885 AI734009 Hs.127699 KIAA1603 protein 5 4 

15 439138 AI742605 Hs. 193696 ESTs 54 
440270 NMJ)15986Hs.7120 cytokine receptor- like molecule 9 5.4 
437536 X91221 Hs.144465 ESTs 54 
438167 R28363 Hs.24286 ESTs 54 
452741 BE392914 Hs.30503 Homo sapiens cDNA FL J 1 1 344 fis, clone PL 54 

20 426214 H59846 Hs.128355 ESTs, Moderately similar to ALU7.HUMAN A 5 4 
413554 AA319146 Hs.75426 secretogranin II (chromogranin C) 54 
422867 L32137 Hs.1584 cartilage oligomeric matrix protein (COM 5 4 
434263 N34895 Hs.44648 ESTs 5 4 

446382 AW205168 Hs.150823 ESTs 5 4 

25 422406 AF025441 Hs.1 16206 Opa- interacting protein 5 5.3 
438321 AA576635 Hs.6153 CGI-48 protein 5 3 

418310 AA814100 Hs.86693 ESTs 5 3 

419625 U91616 Hs.91640 nudear factor of kappa light polypeptid 5.3 
450701 H39960 Hs.288467 Homo sapiens cDNA FLJ 12280 fis, done MA 5,3 

30 445900 AF070526 Hs.1 3429 Homo sapiens clone 24787 mRNA sequence 5.2 
449051 AW961400 Hs.333526 HER2 receptor tyrosine kinase (c-erb-b2, 52 
418758 AW959311 Hs/172012 hypoftetical protein DKFZp434J037 5.2 
431070 AW408164 Hs.249184 transcription factor 19 (SC1) 5.2 
417079 U65590 Hs.81134 "mterieukin 1 receptor antagonist 5 2 

35 421928 AF013758 Hs.109643 poiyade ny late binding protein- in teradin 5 2 
428804 AK000713 Hs.193736 hypothetical protein FLJ20706 5.2 
427427 AF077345 Hs.177936 ESTs 5.2 
403485 C3001B13*:gi|l2737279|reflXP_012163.1lk 5 2 

422168 AA586894 Hs.112408 S 1 00 calcium-btnding protein A7 (psorias 5.1 

40 421937 AI878857 Hs.109706 hematological and neurological expressed 51 
426752 X69490 Hs.172004 fitin 5 1 

453310 X70697 Hs,553 solute carrier family 6 (neurctransmitte 51 
423198 M81933 Hs.1634 celt division cyde 25A 51 
412281 AI810054 Hs.14119 ESTs 51 

45 447513 AW955776 Hs.313500 ESTs, Moderately similar to ALU7_HUMAN A 51 
453931 AL121278 Hs.25144 ESTs 51 
404347 Target Exon 5.1 

431808 M30703 Hs.270833 amphiregulin (schwannoma-derived growth 5.1 
429113 D28235 Hs.196384 prostagl andin-endo peroxide synthase 2 (p 5.1 

50 436291 BE568452 Hs.5101 protein regulator of cytokinesis 1 5.1 
450603 R43646 Hs.12422 ESTs 5.1 
434725 AK000796 Hs.4104 hypothetical protein 5.0 
435981 H74319 Hs.188620 ESTs 50 
407376 AA993138 Hs.142287 ESTs, Weakly similar to ALUF_HUMAN !!!! 5.0 

55 431689 AA305688 Hs.267695 UDP-GaljbetaGlcNAc beta 1,3-galactosyttr 5.0 



405348 NA C7001664:gi|1269806l|db]|BAB21849.1| (AB 5,0 

436196 AK001084 Hs.333498 Homo sapiens cDN A FLJ 10222 fts, clone HE 5.0 



437065 AL036450 Hs.103238 ESTs 5.0 

410196 AI936442 Hs.59838 hypothetical protein FLJ 10808 5.0 

60 429412 NM_006235Hs.2407 POU domain, class 2, associating factor 5.0 

446619 AU076643 Hs,313 secreted phosphoprotein 1 (osteopontin, 4.9 

403329 NA Target Exon 4.9 

442875 BE6230O3 Hs 23625 Homo sapiens clone TCCCTA00142 mRNA sequ4.9 

442441 At820662 Hs.129598 ESTs 4.9 

6 5 430375 AW371048 Hs,93758 H4 histone family, member H 4.9 



424128 AW966163 gb:EST378236 MAGE resequences, MAGI Homo4.9 

408873 AL046017 Hs, 182278 calmodulin 2 (phosphorylase kinase, dert 4.9 
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407910 AA650274 Hs 41296 fibronectin leucine rich transmembrane p 4.9 

432606 NM_002104Hs 3066 g ran zyme K (serine protease, granzyme 3; 4.9 

453204 R10799 Hs 191990 ESTs 4.8 

452020 AA722012 Hs.255757 ESTs, Weakly similar to AT 2AJHUMAN POTEN 4. 8 

5 449048 Z45051 Hs.22920 simitar to S 684 01 (cattle) glucose indue 4 8 

408369 R38438 Hs 182575 solute carrier family 15 {H??? transport 4 8 

431645 AF078849 Hs.266483 dynein light chains 4 8 

423575 C18863 Hs.163443 Homo sapiens cDN A FU 11 576 fe, done HE 4 8 

444246 H93281 Hs 10710 hypothetical protein FU204 17 4 8 

10 421524 AA312082 Hs 105445 GDNF family receptor alpha 1 4 8 

452827 AI571835 Hs.55468 ESTs 4 8 

414222 AL135173 Hs.878 sorbitol dehydrogenase 4 8 

456086 AL161999 Hs 77324 eukary otic translation termination facto 4 8 

419078 M93119 Hs 89584 insulinoma-associated 1 4 8 

15 418973 AA233056 Hs 191518 ESTs 4.8 

447033 AI357412 Hs.157601 ESTs 4 8 

451621 AI879148 Hs 26770 fatty acid binding protein 7, brain 4.7 

419968 X04430 Hs 93913 interieu kin 6 (interferon, beta 2) 4 7 

424326 NM_014479Hs.1 45296 d sin teg rin protease 4.7 

20 431585 BE242803 Hs 262823 hypothetical protein FL J 10326 4 7 

429294 AA095971 Hs 198793 Homo sapiens cDNA: FU22463 fis, clone H 4 7 

416814 AW192307 Hs80042 dolichyl-P-Glc:Man&GlcNAc2-PP-dolichylgl 4 7 

439897 NM_0153l0Hs6763 KIAA0942 protein 4 7 

429687 AI675749 Hs 211608 nudeoporin 153kD 4 7 

25 422880 AF228704 Hs.121524 glutathione reductase 4.7 
405801 NMJ)00390:Homo sapiens choroideremia (Ra4 6 

432435 BE21B886 Hs 282070 ESTs 4 6 

439544 W26354 Hs 28891 hypothetical protein FU 11 360; artemis p 4 6 

425354 U62027 Hs 155935 complement component 3a receptor 1 4 6 

30 436027 AI864053 Hs 39972 ESTs, Weakly similar to I3858B reverse t 4 6 

424623 AW963062 Hs 337404 ESTs 4 6 

403366 NA Target Exon 4 6 

402542 Target Exon 4 6 

450193 AI916071 Hs 15607 Homo sapiens Fanconi anemia complementat 4 6 

35 411678 AI907114 Hs 71465 squalene epoxtdase 4 6 

456844 AI264155 Hs 152981 CDP-diacylglyceroi synthase (phosphatida 4 6 

448072 AI459306 Hs 24908 ESTs 4 5 

408045 AW138959 Hs245123 ESTs 4 5 

423782 AI472209 Hs 323117 ESTs 4 5 

40 447388 AW630534 Hs 76277 Homo sapiens, clone MGC9381 1 mRMA, comp4.5 

448140 AF1 46761 Hs 20450 BCM-like membrane protein precursor 4 5 

452561 AI692181 Hs 49169 KIAA1 634 protein 4 5 

425331 AW962128 gb:EST374201 MAGE resequences, MAGG Homo4.5 

428801 AW277121 Hs 254861 ESTs 45 

45 428500 AI815395 Hs 184641 fatty acid desaturase 2 4 5 

426075 AW513691 Hs270149 ESTs, Weakly similar to 21 09260A B cell 44 

437259 AI377755 Hs 120695 ESTs 4 4 

400409 AF153341 Hs 283954 Homo sapiens winged helix/forkhead trans 4 4 

412863 AA121673 Hs 59757 zinc finger protein 281 4 4 

5 0 426989 AI815206 Hs 99395 ESTs 4 4 

401866 Target Exon 4 4 

418819 AA228776 Hs 191721 ESTs 4 4 

406348 Target Exon 4.4 

412138 AW895387 gb;GV4-NN0038-300300-157-c10 NN0038 Homo4.4 

55 428550 AW297880 Hs 98661 ESTs 4 4 

411743 AW862214 gb:QV4-CT0361-301299-074-b05CT0361 Homo4.4 

429966 BE081342 Hs 283037 HSPC0 39 protein 4 4 

423291 NM^004129Hs 126590 guanylate cyclase 1,50^^613 2 4 4 

423456 AL1 10151 Hs 128797 DKFZP586 D0824 protein 44 

60 452190 H26735 Hs 91668 Homo sapiens clone PP1 498 unknown mRIMA 4 4 

424871 NM_004525Hs 153595 low density lipoprotein related protein 4 3 

429575 AA706003 Hs 99387 ESTs 4 3 

429922 297630 Hs.226117 H1 histone family, member 0 4 3 

421379 Y15221 Hs.103982 small inducible cytokine subfamily B (Cy 4 3 

65 400300 X03363 HER2 receptor tyrosine kinase {c-erb-b2, 4 3 

437258 AL041243 Hs.174104 ESTs 4 3 

446595 T57448 Hs.15467 hypothetical protein a J207 25 4 3 
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403011 NA EN SPQ 000021 5330*: Probable serine/threoni 4.3 

419055 AI365384 Hs.11571 Homo sapiens cDNA FLU 1570 fis, done HE 4.3 

418661 NM_001949Hs.1189 E2F transcription factor 3 4.3 

407786 AA687538 Hs,38972 tetraspan 1 4.3 

5 429183 AB014604 Hs.197955 KIAA0704 protein 4.3 

442914 AW188551 Hs.99519 hypothetic*) protein FLJ 14007 4.3 

441029 AI091795 Hs.179246 ESTs 4.3 

452194 A1694413 Hs, 332649 olfactory receptor, family 2, subfamily 4.3 

414821 M63835 Hs.77424 Fc fragment of IgG, high affinity la, re 4.2 

10 410102 AW248508 Hs.279727 Homo sapiens cDNA FLJ 14035 fis, clone HE 4.2 

452110 T47667 Hs 28005 Homo sapiens cDNA FU 11 309 fis, done PL 4.2 

442007 AA301116 Hs 142838 nucleolar phospboprotetn Nopp34 4.2 

417318 AW953937 Hs.12891 ESTs 4.2 

431818 AW510444 Hs 191705 ESTs, Weakly similar to T471 84 hypotheti 4.2 

15 443646 AI085198 Hs 164226 ESTs 4.2 

419169 AW851980 Hs 262346 ESTs, Weakly similar to S72482 hypotheti 4.2 

446839 BE091926 Hs 16244 mitotic spindle coiled-coil related prot 4.2 

423242 AL039402 Hs 125783 DEME-6 protein 4.2 

432116 AA902953 Hs.308538 ESTs 4.2 

20 409038 T97490 Hs 50002 small inducible cytokine subfamily A (Cy 4.2 

445625 BE246743 Hs.288529 hypothetical protein FU22635 4.2 

425139 AW630488 Hs 325820 protease, serine, 23 4.2 

447397 BE247676 Hs 18442 E-1 enzyme 4.2 

410166 AK001376 Hs 59346 hypothetical protein FU 10514 4 1 

25 437295 AW779318 Hs 88417 ESTs 4 1 

430486 BE062109 Hs.241551 chloride channel, calcium activated, fam 41 

441790 AW294909 Hs. 132208 ESTs 41 

410129 BE244074 Hs.58831 regulator of Fas-induced a poptosis 41 

427521 AW973352 Hs.290585 ESTs 4 1 

30 425247 NM_005940Hs. 1 55324 matrix metalloproteinase 11 (MMP11;stro 41 

412886 AF041163 Hs. 74647 Human T-ceD receptor active alpha-chain 41 
441153 BE562826 gb:601336534F1 NIH_MGC_44 Homo sapiens c4.1 

444301 AK000136 Hs.10760 asporin (LRR class 1) 41 

426711 AA383471 Hs. 180669 con served gene amplified in osteosarcoma 41 

35 405850 NA Target Exon 41 

440283 AI732892 Hs. 190489 ESTs 4 0 

432441 AW292425 Hs.1 63484 ESTs 4 0 

400284 NA estrogen receptor 1 4 0 

417341 N91453 Hs.102987 ESTs 4 0 

40 429732 U20158 Hs 2488 lymphocyte cytosolic protein 2 (SH2 doma 4 0 

411393 AW797437 Hs 69771 B-factor, properdin 4 0 

425704 U79293 Hs. 159264 Human clone 23948 mRNA sequence 4 0 

419594 AA013051 Hs 91417 topoisomerase (DNA) II binding protein 4 0 

419092 J05581 Hs 89603 mudn 1, transmembrane 4 0 

45 443147 AI034351 Hs 19030 ESTs 40 

408633 AW963372 Hs. 46677 PRO2000 protein 4.0 

433404 T32982 Hs.1 02720 ESTs 4.0 

421506 BE302796 Hs 105097 thymidine kinase 1, soluble 40 

417900 BE250127 Hs 82906 CDC20 (cell division cyde 20, S. cere vi 3.9 

50 414602 AW630088 Hs 76550 Homo sapiens mRNA; cDNA DKFZp564B 1264 (f3. 9 

413762 AW411479 Hs848 FK506-btndhg protein 4 (59kD) 3.9 
404580 NMJ14112^Hc^TlOsapier^st^icho^hinophala 3.9 

452046 AB018345 Hs 27657 KIAA0802 protein 3 9 
459587 AA031956 gb:zkl5e04,s1 Scares_pregnanLuterus_NbH 3 9 

55 416658 U03272 Hs 79432 fibrillin 2 (congenital contractural ana 3 9 

426647 AA243464 Hs 294101 pre-B-cell leukemia transcription factor 3 9 

429353 AL1 17406 Hs 200102 ATP-binding cassette transporter MRP8 3 9 

419038 AW134924 Hs.190325 ESTs 3 9 

413918 X07871 Hs.89476 CD2 antigen (p50), sheep red blcod eel! 3 9 

60 421977 W94197 Hs 110165 ribosomal protein l_26homotog 3 9 

442567 AI201183 Hs.1 30251 ESTs 3 9 

421168 AF182277 Hs.330780 cytochrome P450, subfamily IIB (phenobar 3 9 
431701 AW935490 Hs.1 4658 Human chromosome 5q 13.1 cfone5G8mRNA3.9 
418526 BE019020 Hs.85838 soiute earner family 16 (monocarboxylic 3 9 

65 414998 NM 002543 Hs. 77729 oxidised low density lipoprotein (lectin 3 9 
422790 AA809875 Hs.25933 ESTs 3 9 

419741 NMJX)7019Hs.93002 ubiquto'n carrier protein E2-C 3.9 



327 



WO 02/059377 



PCT/US02/02242 



430017 AA263172 Hs.35 protein tyrosine phosphatase, non-recept 3.9 

458814 AI498957 Hs,170861 ESTs, Weakly similar to Z195J-HJMAN ZINC 3.8 

428514 AW236861 Hs.193139 ESTs 3.8 

434521 NM_002267Hs,3886 karyopherin alpha 3 (importin alpha 4) 3.8 

5 409425 U40462 Hs,54452 zinc finger protein, subfamily 1A, 1 (Ik 3.8 

439560 BE565647 Hs,74899 hypothetical protein FU1 2820 3 8 

424028 AF055084 Hs.153692 Homo sapiens cDNA FU 14354 fts, done Y7 3 8 

400021 AFFX control - HUMISGF3A/M97935_MA 3 8 

453403 BE466539 Hs.61779 Homo sapiens cDNA FU 13591 fis, clone PL 3 8 

10 445941 AI267371 Hs.172636 ESTs 38 

434378 AA631739 Hs 335440 EST 3 8 

429220 AW207206 Hs 136319 ESTs 3 8 

439176 AI446444 Hs.190394 ESTs, Weakly similar to B28096 line-1 pr 3 8 

401045 C11001883 , :gi|6753278lreqNP_033938.1[c 3 8 

15 430178 AW449612 Hs 152475 ESTs 3 8 

423397 NMJ301838HS 1652 chemokine (C-C motif) receptor 7 3 8 

447630 AI660149 Hs 44865 lymphoid enhancer-binding factor 1 3 8 

436391 AJ227892 Hs 146274 ESTs 3 8 

413011 AW068115 Hs.821 bigh/can 3 8 

20 422121 Af767949 Hs.179833 ESTs 3 8 

452268 NM_003512Hs 28777 H2A histone family, member L 3 8 

427811 M81057 Hs 180884 carboxypeptidase B1 (tissue) 3 8 

415579 AA165232 Hs 222069 ESTs 3 8 

437330 AL353944 Hs50115 Homo sapiens mRN A; cDN A DKFZp761J1 112 (f 3.8 

2 5 427122 AW057736 Hs.323910 HER2 receptor tyrosine kinase <c-eri>b2, 3 7 

400286 NA C16000922:gi|7499103|pir||T2u903 hypothe 3 7 

420281 AI623693 Hs 191533 ESTs 37 

419926 AW900992 Hs 93796 DKFZP586D2223 protein 3 7 

417541 AI992191 Hs. 180040 hypothetical protein FU22439 3 7 

30 426172 AA371307 Hs 125056 ESTs 3 7 

429638 AI91 6662 Hs. 211 577 kinectin 1 (kinesin receptor) 3 7 

457001 J03258 Hs2062 vitamin D (1,25- dihydroxyvilamin D3) re 3 7 

424109 AW406878 gb:UI-HF-BLO-adg-g-06-0-Ul.r1 NIH_MGC_373 7 

417022 NM_014737Hs 80905 Ras association (RalGDS/AF-6) domain fam 3 7 

35 436222 AI206737 Hs.122810 HomosapienscDNAFLJ11489fis,cfoneHE 3.7 

430448 AI633553 Hs 13303 Homo sapiens cDNA: FLJ2 1784 fis, clone H 3 7 

432729 AK000292 Hs.278732 hypothetical protein FLJ20285 3 7 

413916 N49813 Hs.75615 apolipoprotein C-il 3 7 

421662 NM_014141Hs.106552 cell recognition molecule Caspr2 3 7 

40 441633 AW958544 Hs. 11 2242 normal mucosa of esophagus specific 1 3 7 

408761 AA057264 Hs.238936 ESTs, Weakly similar to (deffine not a va 3 7 

406153 Target Exon 3 7 

445563 AW873606 Hs.149006 ESTs 3 7 

453464 AI884911 Hs.32989 receptor (calcitonin) activity modifying 3 7 

45 448918 AB011152 Hs.22572 KIAA0580 protein 3 7 

413936 AF113676 Hs 297681 serine (or cysteine) proteinase inhibito 3 6 

448069 U76248 Hs.20191 seven in absentia (Drosophila) homolog 2 3 6 

453313 BE005771 Hs 153746 hypothetical protein FLJ2 2490 3 6 

425234 AW152225 Hs.165909 ESTs, Weakly similar to 1 3 8022 hypotheti 36 

50 419941 X98654 Hs.93837 phosphatidylinositol transfer protein, m 3 6 

402397 AF188625 Hs.189507 phospholipase A2, group II D 3 6 

430378 Z29572 Hs.2556 tumor necrosis factor receptor superfami 3 6 

448106 AI800470 Hs.171941 ESTs 3 6 

426431 NM 000458Hs. 169853 transcription factor 2, hepatic; LF-B3; 3 6 

5 5 431843 AA516420 Hs.183526 ESTs, Weakly similar to 1 38 022 hypotheti 3 6 

426878 BE069341 gb:GV3-BT0381-270100-073-c08 BT0381 Homo3.6 

434061 AW024973 Hs.283675 NPD009 protein 3 6 

445292 AV653264 Hs 13982 HomosapienscDNAFLJ14666fis,cloneNT 3 6 

452101 T60298 Hs.10844 Homo sapiens cDMA FL J 14476 fis, clone MA 3 6 

60 427581 NM_0147B8Hs 179703 KIAA01 29 gene product 3 6 

409047 AW961434 Hs 31539 ESTs 3 6 

416820 NM_000402Hs.80206 glucose-6-phosphate dehydrogenase 3 6 

410386 W26187 Hs,3327 Homo sapiens cDNA: FLJ22219 fis, clone H 3 6 

440516 S42303 Hs.161 cadherin 2, type 1, N-cadherin (neuronal 3 6 

6 5 434360 AW015415 Hs.127780 ESTs 36 

428970 BE276891 Hs,194691 retinoic acid induced 3 3 6 

415079 R43179 Hs.22895 hypothetical protein FL J 2 354 8 3.6 
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409619 AK001015 Hs.55220 BCL2-associated athanogene 2 3.6 

430044 AA464510 Hs.152812 ESTs 3.6 

4X829 AW451999 Hs.194024 ESTs 3.5 

434224 AA380731 Hs.84 interleukin 2 receptor, gamma (severe co 3,6 

5 439247 AF088020 Hs.46767 EST 3.6 

431542 H63O10 Hs.5740 ESTs 3.5 

430713 AA351547 Hs.2642 eu ka ry otic translation elongation factor 3.5 

434988 AI418055 Hs.161160 ESTs 3.5 

437748 AF234882 Hs.5814 soppressfonoftumorigenicity? 3.5 

10 418322 AA284166 Hs.84113 cyclin -dependent kinase inhibitor 3 (CDK 3.5 

439569 AW602166 Hs.222399 CEGP1 protein 3.5 
459583 AI907673 gb:IL-BT152-080399404 BT152 Homo sapien3 5 

403212 NM_019595:Homo sapiens intersects 2 (IT 3.5 

409099 AK000725 Hs.50579 hypothetical protein FU2071 8 3.5 
15 453968 AA847843 Hs.62711 Homo sapiens, clone I MAGE: 335 1295, mRNA 3 5 

436338 W92147 Hs,1 18394 ESTs 35 

422890 Z43784 Hs 75893 ankyrin 3, node of Ranvier (ankyrin G) 35 

442295 A1827248 Hs 224398 Homo sapiens cDNA FLJ1 1469 fts, done HE 3.5 

417975 AA641836 Hs 30085 hypothetical protein FLJ231 86 3 5 

20 433730 AK002135 Hs3542 hypothetical protein aj 11 273 3.5 

439926 AW014875 Hs 137007 ESTs 3.5 

445873 AA250970 Hs 251946 poly (A)-bin ding protein, cytoplasmic 1-1 3.4 

410153 BE311926 Hs 15830 hypothetical protein FLJ 12691 3.4 
422128 AW881145 gb: QV0OT0033-01 0400-1 82-a07 OT0033 Homo 3.4 

25 414921 BE390551 Hs 7762B steroidogenic acute regulatory protein r 3.4 

441134 W29092 Hs 7678 cellular retinoic acid-binding protein 1 3.4 

444564 AI167877 Hs.143716 ESTs 3.4 

402470 NA Target Exon 3.4 

418120 AA213437 Hs.192249 ESTs 3.4 

30 422414 AW875237 Hs.13701 ESTs 3.4 

433345 AI681545 Hs.152982 hypothetical protein FLJ 131 17 3.4 

409213 U61412 Hs.51133 PTK6 protein tyrosine kinase 6 3.4 

422611 AA158177 Hs.118722 fucosyllransferase 8 (alpha (1,6) fucosy 3 4 

423554 M90516 Hs.1674 glutamine-frudose-6-phosphatetransamin 3.4 

35 442619 AA447492 Hs.201B3 ESTs, Weakly similar to AF1 64793 1 prote 34 
402359 NA C 19001 99 1 *rgi|1 26561 11|gb|AAK00751.1|AF23 4 

439398 AA284267 Hs.221504 ESTs 3 4 

415208 F01020 Hs.172004 titin 3 4 

452853 AA812633 Hs.10845 ESTs 3 4 

40 429345 R11141 Hs.199695 hypothetical protein 3 4 

449027 AJ271216 Hs.22880 dipep tidy I peptidase III 34 

412115 AK001763 Hs.73239 hypothetical protein FU 10901 3 4 

432180 Y18418 Hs.272822 RuvB (E coli homolog)-like 1 3.4 

428977 AK001404 Hs.194698 cyclin B2 3.4 

45 431611 U58766 Hs.264428 tissue specific transplantation antigen 3.4 
418286 AA622528 Hs.319825 Homo sapiens, done IMAGES 61 6574, mRNA,3.4 

436895 AF037335 Hs.5338 carbonic a nhydrase XII (tumor antigen H 3.4 

443378 AW392550 Hs.9280 proteasome (prosome, macropain) subunit, 34 

428450 NMJ14791HS.184339 KIAA0 175 gene produd 3 3 

50 449571 AW016812 Hs.200266 ESTs 3 3 

412777 AI335773 Hs.270123 ESTs 33 

420542 NM_000505Hs.1321 coagulation fador XII (Hageman factor) 3.3 

412754 AW1 60375 Hs.74565 amyloid beta (A4) precursor-like protein 3 3 

418327 U70370 Hs.84136 paired-like homeodomain transcription fa 3.3 

55 449065 AI627393 Hs.258998 ESTs, Weakly similar to high mobility gr 33 

425999 AW513051 Hs.332981 ESTs, WeakJy similar to I38022 hypothefi 3.3 

430280 AA361258 Hs.237868 interieukin 7 receptor 33 

407777 AA161071 Hs.71465 squalene epoxidase 3 3 

426516 BE262660 Hs.1 70197 glutamic-oxaioacetic transaminase 2, mil 3.3 

60 414361 AI086138 Hs.204044 ESTs 3.3 

427080 AW068287 Hs.173466 ras-re I ated C3 botulinum toxin substrate 3.3 

426429 X73114 Hs. 169849 myosin-binding protein C, sbw-type 3.3 

446163 AA026880 Hs.25252 prolactin receptor 3.3 

428566 U41763 Hs.184916 clathrin, heavy polypeptide-fike 1 3.3 

65 418641 BE243136 Hs.86947 a disintegrin and metalloproteinase doma 3.3 

436293 AI601188 Hs.120910 ESTs 3.3 
411257 AA628967 Hs.115274 ESTs, Highly similar to I HH.HUMAN INDIAN 3.3 
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430253 
430066 
436469 
437786 
444079 
457183 
431215 
424563 
450828 
408652 
445142 
426761 
439237 
422616 
443247 
406663 
434137 
408877 
439101 
408221 
447519 
404755 
451871 
420319 
430580 
400202 
400222 
425988 
458098 
430589 
431563 
442353 
422309 
419703 
420380 
410853 
454417 
432745 
422032 
415339 
426384 
448030 
418739 
442053 
434747 
427297 
412228 
452304 
453953 
407758 
451149 
430015 
433313 
418334 
450223 
454365 
451128 
417793 
428027 
441197 
424634 
419986 
416714 
449465 
422166 
409079 
423551 



AW270655 Hs 193804 
R43409 Hs 6829 
AW978484 Hs. 93842 
AI0157O9 Hs.172089 
AW408158 Hs 318893 
BE300330 Hs 118725 



AK001514 Hs.236844 hypothetical protein FU 10652 3.3 
AI929659 Hs 237825 signal recognition particle 72 kD 3 3 

AKO01455 Hs 5198 Down syndrome critical region gene 2 3.3 
BE142681 Hs 155573 polymerase (DNA directed), eta 3 3 

H09048 Hs 23606 ESTs 3 3 

H91882 Hs 118569 Dvl-binding protein I DAX (inhibition of 3 3 
AA496078 Hs 121554 Human DNA sequence from clone RP11-218C1 3.3 
AA446932 Hs 151428 ret finger protein 2 33 
ESTs 3 3 

Homo sapiens mRNA for K1AA1644 protein, 3 3 
Homo sapiens cDNA: FU22554 fe, clone H 3 3 
Homo sapiens mRNA; cDNA DKFZp586l2022 (f3.3 
ESTs, Weakly simitar to A47582 B-cell gr 3 3 
selenopbosphate synthetase 2 3 3 

BE614387 Hs 333893 c-Myc target JP01 3 3 

U24683 Hs 302063 immunoglobulin heavy constant mu 3 3 

AA907734 Hs 124895 ESTs 3 3 

AA479033 Hs 130315 ESTs, Weakly similar to A4 7582 B-cell gr 3 3 
C01765 Hs 38750 hypothetical protein FLJ1 1526 3.3 
AA912183 Hs 47447 ESTs 3 3 

U46258 Hs 339665 ESTs 3 3 

NA Target Exon 3.3 

AI821005 Hs 118599 ESTs 3.2 
hypothetical protein 3 2 

immunoglobulin heavy constant gamma 3 (G 3 2 
NMJ)02795*:Homo sapiens protea some (pros3 2 
NMJ)02082*:Homo sapiens G protein-couple 3 2 
Esfs, Weakly simfer to I38022 hypotheti 3 2 
metallothionein 1E (functional) 3 2 

UDP-N-acetyi-alpha-!>galactosamine:polyp 3 2 
ESTs 3.2 
ESTs, Moderately similar to ALU7_HUMAN A 3 2 
solute carrier family 16 (monocarboxylic 3 2 
ESTs 3 2 

ESTs 3.2 
ESTs 32 
trinucleotide repeat containing 9 3 2 



AW406289 Hs 96593 
AA806105 Hs 300697 
NA 
NA 

BE045897 
BE550224 
AJ002744 
AI027643 
BE379594 
U79745 
AI793257 
AA640891 
H04588 
AI244459 
AI821926 

AA476966 Hs 110857 polymerase (RNA) III (DNA directed) poly 3 2 
NM_015156Hs 78398 KIAA0071 protein 3 2 

AI472078 Hs 303662 ESTs 3 2 



Hs 274454 
Hs.74170 
Hs 246315 
Hs 120912 
Hs 49136 
Hs 114924 
Hs 128151 
Hs 102406 
Hs 30469 
Hs 110826 



gb:nt78f05.x5 NCI_CGAP_Pr3 Homo sapiens 3 2 



N30714 
AA310964 
R35343 
AA837085 



Hs 325960 
Hs.88012 
Hs 24968 
Hs 220585 



AW292593 Hs 334907 
AW503785 Hs 73792 
AA025386 Hs.61311 
AW408337 Hs 36972 
D50915 
AL047586 



Hs 38365 
Hs.10283 
AW768399 Hs 112157 
W20128 Hs 296039 



Hs 5521 
Hs.241493 



AA319233 
AA418204 
AW966728 Hs 54642 
AL1 18668 

AW405434 Hs 82575 
U22029 Hs 334345 
BE244638 Hs 166 
NM_O03613Hs 151407 
AI345455 Hs 78915 
AF283770 Hs 79630 
NM_004380Hs 23598 
W72424 Hs 112405 
W87707 Hs 82065 
AA327598 Hs 233785 



membrane-spanning 4-domains, subfamily A 3 2 

SHP2 interacting transmembrane adaptor 3 2 
Human DNA sequence from clone RP1 233G163.2 

ESTs 3 2 
Homo sapiens, clone MGC: 17333, mRNA, com3.2 

complement component (3d/Epstetn Bart vi 3 2 

ESTs, Weakly similar to S10590 cysteine 3 2 

CD7 antigen (p41) 3 2 

KIAA0125 gene product 3 2 

RNA binding motif protein 8B 3 2 

ESTs 3 2 

ESTs 3 2 

ESTs 3.2 

natural killer-tumor recognition sequenc 3 2 

methionine adenosyltransferase II, beta 3 2 
gb:DKFZp761IO310_r1 761 (synonym: hamy2)3 2 

small nuclear ribonucleoproteln porypept 3 2 

cytochrome P450, subfamily IIA (phenobar 3 2 

sterol regulatory element binding transc 3 2 

cartilage intermediate layer protein, nu 3 2 

GA-binding protein transcription factor, 3 2 

CD79A antigen (immunogtobullrvassociated 3 2 

CREB binding protein (Rubinstein-Taybi s 3.2 

S100 calcium-binding protein A9 (calgran 3.2 

interieukin 6 signal transducer (gp130, 3.2 

ESTs 3.2 
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453553 AA036849 Hs.61829 Homo sapiens cDKA FL J 12763 fis, clone NT 3.2 
442580 AI733682 Hs.1 30239 ESTs 3.2 
458079 AI796870 Hs.54277 DNA segment on chromosome X (unique) 9923.2 
425700 AFQ76292 Hs.159251 forkhead box H1 3.2 
5 417124 BE122762 Hs.25338 ESTs 3.2 
407104 S57296 Hs.323910 v-erb-b2 avian erythroblastic leukemia v 3.2 
442215 A1703172 Hs.129005 ESTs, Weakly similar to 2109250A B cell 3.1 
430271 T06199 Hs.237506 DnaJ (Hsp40) homolog, subfamily B, membe 3.1 
425317 AW205118 Hs.210546 interieukin 21 receptor 3.1 
10 426095 AI278023 Hs. 89986 ESTs 3.1 
442313 BE388898 Hs,8215 hypothetical protein FLJ 11 307 3.1 
424709 AL1 37589 Hs.1 52149 hypothetical protein DKFZp434K0410 3.1 
429671 BE379335 Hs.211594 proteasome (prosome, macropain) 26S subu 3.1 
432715 AA247152 Hs.200483 ESTs, Weakly similar to KIAA1 074 protein 3.1 
15 431574 AW572659 Hs.261373 hypothetical protein (1J43401 4. 3 3.1 
436876 AI124756 Hs.5337 isochrate dehydrogenase 2 (NADP), mitoc 3.1 
405017 NA Target Exon 3.1 

433805 AA706910 Hs.1 12742 ESTs 3.1 
437352 AL353957 Hs,284181 hypothetical protein DKFZp434P0531 3.1 
20 430105 X70297 Hs.2540 cholinergic receptor, nicotinic alpha p 3.1 
422083 NM_001141Hs.111256 arachidonate lipoxygenase, second typ 3.1 
413507 BE145360 Hs.190064 ESTs, Weakly similar to I38022 hypotheti 3.1 
415989 AI267700 Hs.317584 ESTs 3.1 
422907 AI879263 Hs.6986 Human glucose transporter pseudogene 3.1 
25 425548 AA890023 Hs.1906 prolactin receptor 3.1 
422599 BE387202 Hs.1 18638 n on- metastatic cells 1, protein (NM23A) 3.1 
439963 AW247529 Hs.6793 plate let-activating factor acetyfhydrola 3.1 
453883 AI638516 Hs.22630 cofactor required for Sp1 transcriptiona 3.1 
458021 AI885190 Hs.156089 ESTs, Weakly similar to repressor protei 3.1 
30 418478 U38945 Hs.1174 cycftn -dependent kinase inhibitor 2A (me 3.1 
400814 NA Target Exon 3.1 

402327 Target Exon 3.1 

416935 AA190712 gb:zp87f09.r1 Stratagene HeLa cell s3 93 3.1 

439838 AL355722 Hs.106875 Homo sapiens EST from clone 35214, full 3.1 
35 437036 Ai571514 Hs,133022 ESTs 3.1 
449523 NM_000579Hs.54443 chemokine (C-C motif) receptor 5 3.1 
406642 AJ245210 gb.'Homo sapiens mRNA for immunoglobulin 3.1 

406624 AF052762 gb;Homo sapiens clone csneg8-1 immunoglo 3.1 

421924 BE514514 Hs.109606 coronin, actin-binding protein, 1A 3.1 
40 414523 AU076633 Hs.76353 serine (or cysteine) proteinase in hibito 3,1 
416379 N38857 Hs.203933 ESTs 3,1 
422823 D89974 Hs.121102 vanin2 31 
433904 AI399956 Hs.208956 ESTs 3 1 

421904 BE143533 Hs.109309 hypothetical protein RJ20035 3.1 
45 428834 AW899713 Hs.339315 ESTs 3 1 

436043 AW963838 Hs.168830 Homo sapiens cDNA FLJ 12 136 fis, clone MA 31 
452823 AB012124 Hs.30696 transcription factor-like 5 (basic helix 3 ,1 

405381 NA Target Exon I'l 

428746 AW503820 Hs.192861 Spi-B transcription factor (Spi-1/PU.1 r 3.1 
5 0 435147 AL1 33731 Hs.4774 Homo sapiens mRMA; cDNA DKFZp761C1712(f3.1 
425782 U66468 Hs. 159525 cell growth regulatory with EF-handdoma 3.1 
423306 W88562 Hs.108198 ESTs 3.1 
419123 AA234276 Hs.88253 ESTs 3.1 
438581 AW977766 Hs.292133 ESTs, Moderately similar to I78885 senn 3.1 
5 5 417105 X60992 Hs.81226 CD6 antigen 3.0 
428361 NM_015905Hs.183858 transcriptional intermediary factor 1 3.0 
417880 BE241595 Hs.82848 seiectin L (lymphocyte adhesion molecule 3.0 
402606 NM„024626:Homo sapiens hypothetical prot 3.0 

40^451 NM_004496*:Homo sapiens hepatocyte nude 3.0 

60 421878 AA299652 Hs.1 11496 Homo sapiens cDNA FU11643 fis, clone HE 3.0 
409518 BE384836 Hs.3454 KIAA1 821 protein 3.0 
416933 BE561850 Hs.80506 small nuclear hbonucleoprolein polype pt 3.0 
414324 Y14768 Hs.890 lymphotoxmbeta(TNF superfamiry, membe 3.0 
425081 X74794 Hs.154443 minichromosome maintenance defcent (S. 3.0 
65 401519 C15000476*:gii12737279|reflXP_012163.1| 3.0 

411704 AI499220 Hs.71573 hypothetical protein FLJ 10 074 3.0 
428819 AL135623 Hs.193914 KIAA0575 gene product 3.0 
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428423 AU076517 Hs.184276 

413835 AI272727 Hs.249163 

412600 L28824 Hs,74101 

410491 AA465131 Hs.64001 

5 4 33658 L03678 Hs.156110 

427666 AI791495 Hs.180142 

452514 AI904898 

429500 X78565 Hs.289114 

432485 N90866 Hs.276770 

10 437400 AB011542 Hs.5599 

452234 AW084176 Hs.223296 

413269 BE167526 

453216 AL137566 Hs.32405 
400929 

15 445145 AI961702 Hs.147434 

432615 AA557191 Hs.55028 

423279 AW959861 Hs.290943 

429392 AL109712 Hs.296506 

408548 AA055449 Hs.63187 

20 451346 NM_006338Hs.26312 

413109 AW389845 Hs.110855 

401714 NA 

421462 AF016495 Hs.104624 

421750 AK000768 Hs. 107872 

25 453293 AA382267 Hs. 10653 

457085 AA412446 Hs.98138 

438930 AW843633 Hs.306163 



solute earner family 9 (sodium/hydrogen 3 0 
fatty acid hydroxylase 3.0 
spleen tyrosine kinase 3 0 

Homo sapiens clone 25218 mRNA sequence 3 0 
immunoglobulin kappa constant 3 0 

calmodulin-like skin protein 3 0 

gb:RC-BT068-1 30399-085 BT068 Homo sapien3.0 
hexabrachkxi (tenascin C, cytotactin) 3 0 
CDW52 antigen (CAMPATH-1 antigen) 3 0 
EGF-like-domain, multiple 5 3 0 

ESTs, Weakly similar to I38022 hypotheU 3 0 



gb:CM4-HrQ509-08030(M07-g07 HT05O9 Homo 3.0 
Homo sapiens mRNA; cDNA DKFZp586G0321 (f 3.0 
ENSP00000252232*:Sterof regulatory eleme 3 0 



ESTs 30 

ESTs, Weakly similar to I54374 gene NF2 3 0 

ESTs 3 0 

Homo sapiens mRNA full length insert cDN 3 0 

ESTs, Weakty similar to ALUC_HUMAN !!!! 3 0 

glioma amplified on chromosome 1 protein 3.0 

ESTs 3 0 
ENSP00000241802*:CDNA FLJ11007 FIS, CLON 3.0 

aquaporin 9 3 0 

hypothetical protein FLJ20761 3 0 

ESTs 3.0 

ESTs 3.0 

hypothetical protein AL1 1 01 1 5 3.0 
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TABLE 22A 

Table 22A shows the accession numbers for those pkeys lacking unigenelD's for Table 22. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 

IS Si fwSSXM AW803466 AWS03402 AW80341 3 AW803268 AW803396 AW 803334 AW803355 

411743 1256098J AW862214 AW859811 AW862215 

412138 1279172.1 AW895387 AW895547 AW895564 AW895323 AW895405 AW895539 AW895538 

413269 1356961.1 BE167526 BE167651 BE076401 R24654 

416935 163179 J AA 190712 AA190665 AA252564 

422128 21 1994 J AW881145 AA490718 M85637 AA304575 T06067 AA331991 

423945 233566„1 AA4 10943 AW948953 AA334202 AA 332882 

424109 235506 J AW406878 AW966560 AW966151 AW96S496 AA336174 AA335376 AA335537 

424128 235728.1 AW966163 AA335983 AA336011 AA335668 AA335973 

425331 250199J AW962128 AA355353 AA427363 

426878 273265 J BE069341 AW748403 AL044891 At908240 AA393080 

432745 353673.1 AI821926 AA658826 AA564492 AA635129 AI791191 

441153 51084J BE562826 BE378727 

448212 755099J AI475858 AW969013 

451128 859865J AL1 18668 D78823 A 17621 76 

452514 920172.1 AI904898 AI904849 AI904899 

456207 1 65078.-1 M193450 
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TABLE 22B 

Table 22B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 22. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled The DNA sequence of human chromosome 22." Dunham t. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


Nt_pos'rtion 


400814 


8569925 


Minus 


72840-72924,74761-74849 


400929 


7651921 


Minus 


122033-122241,123483-124028 


401045 


8117619 


Plus 


90044-90184,91111-91345 


401451 


6634068 


Minus 


119926-121272 


401519 


6649315 


Plus 


157315-157950 


401645 


7657839 


Minus 


34986-35133 


401714 


6715702 


Plus 


96484-96681 


401866 


8018106 


Plus 


73126-73623 


402327 


7656695 


Minus 


108675-108770,109801-109910 


402359 


9211204 


Minus 


4040341961 


402408 


9796239 


Minus 


110326-110491 


402470 


9797107 


Plus 


195129-195776 


402542 


9801558 


Minus 


67076-67594 


402578 


9884928 


Plus 


66350-66496 


402606 


9909429 


Minus 


81747-82094 


403011 


6693597 


Minus 


3468-3623 


403212 


7630897 


Minus 


156037-156210 


403329 


8516120 


Plus 


96450-96598 


403366 


8783692 


Minus 


4932349652 


403485 


9966528 


Plus 


2888-3001,3198-3532,3655-4117 


404347 


9838195 


Plus 


74493-74829 


404580 


6539738 


Minus 


240588-241589 


404755 


7706327 


Minus 


53729-53846 


405017 


6532084 


Plus 


35551-35690 


405348 


2914717 


Minus 


43310-43462 


405381 


6006920 


Minus 


7636-8054 


405801 


2924321 


Plus 


63469-63694 


405850 


6164995 


Plus 


13871-14110 


406153 


9929734 


Minus 


12902-13069 


406348 


9255985 


Minus 


71754-71944 
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TABLE 23: 320 GENES DOWN-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT BREAST 

Table 23 shows 320 genes down-regulated in breast cancer compared to normal adult breast. 
These were selected as for Table 22, except that the numerator was set to the median value 
for V non-malignant breast specimens, the denominator was set to the median value amongst 
the 73 breast cancers, the 90 th percentile value amongst thel 2 non-mahgnant breast 
specimens was greater than or equal 80 units, and the ratio was greater than or equal to 4.0 
(i.e. 4-fold down-regulated in tumor vs. normal breast). 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

1 5 UnlgenelD: Unigene number 

Unigene Title: Unigene gene title 

R1; 0 f so* percentile normal body tissue to 75* percentile tumor 



20 Pkey ExAccn Un»genelD UnigeneTitle 



25 



30 



35 



40 



45 



50 



55 



60 



42B722 
428848 
445263 
418935 
407228 
417511 
439285 
412442 
410544 
412047 
422667 
406664 
423201 
422163 
428769 
407049 
412295 
425126 
406791 
447471 
451533 
419407 
411939 
410532 
425707 
416585 
443060 
435265 
422511 
433138 
402195 
429350 
445107 
406643 
410199 
417225 
437569 
436062 
425078 
430327 



U76456 Hs. 190787 tissue inhibitor of metalioproteinase 4 

NMJJ0Q230 Hs. 194236 leptin (murine obesity homclog) 

H57646 Hs,42586 K1AA1560 protein 

T28499 Hs 89485 carbonic anhydrase IV 

M25079 Hs, 1 55376 hemoglobin, beta 

AL049176 Hs82223 chordln-Hke 

AL133916 Hs 172572 hypothetical protein FLJ20093 

A1983730 Hs 26530 serum deprivation response (phosphatidyl 

AI446543 Hs 95511 ESTs 

AA934589 Hs 49696 ESTs 

H25642 Hs 133471 ESTs 

L34041 Hs.9739 grycerol-3-phosphate dehydrogenase 1 (so 

NMJJ00163 Hs.125180 growth hormone receptor 

AF027208 Hs.1 1 2360 prominin (mouse)-like 1 

AW207175 Hs.106771 ESTs 

X72632 NM_021724*:Homo sapiens nuclear receptor 

AW088826 Hs.1 17176 poly(A)-binding protein, nuclear 1 

N32759 Hs.172944 chorionic gonadotropin, beta polypeptide 

AI220684 Hs.272572 hemoglobin, alpha 2 

AF039843 Hs.18676 sprouty (Drosophila) homclog 2 

NM004657 Hs.26530 serum deprivation response (phosphatidyl 

AW410377 Hs,41502 hypothetical protein FLJ21276 

AI365585 Hs.146246 ESTs 

T53088 Hs.155376 hemoglobin, beta 

AF115402 Hs.1 1713 E74-!ike factor 5 (ets domain transcript 
X54162 Hs79386 leiomodin 1 (smooth muscle) 
D78874 Hs.8944 procollagen Oendo peptidase enhancer 2 
AA779958 Hs.185932 ESTs 
AU076442 Hs.117938 collagen, type XVII, alpha 1 
AB029496 Hs.59729 semaphorin sem2 

NM_004497*:Homo sapiens hepatocyte nucle 
AI754634 Hs.131937 ESTs 

AI208121 Hs.147313 ESTs. Weakly similar to I38022 hypothet 
N77976 Hs.272572 hemoglobin, alpha 2 
AW377424 Hs.205126 Homo sapiens cDNiA: FU22667 fis, clone H 
AA815048 Hs.24078 hypothetical protein FU12649 
AA760849 Hs.294052 ESTs 
AK00O027 Hs.98633 ESTs 

NM 002599 Hs.1 54437 phosphodiesterase 2A, cG MP stimulated 
AW973636 Hs.55931 ESTs 

335 



Ratio 

22.4 

17.4 

15.4 

15.0 

14.6 

14.6 

14.3 

13.6 

12.6 

122 

12.0 

12.0 

11.7 

10.8 

10.6 

10.1 

9.8 

9.8 

9.5 

9.5 

9.4 

9.0 

9.0 

8.9 

8.8 

8.7 

8.6 

8.5 

8.4 

8.3 

8.1 

81 

81 

8.0 

8.0 

7.8 

7.5 

7.5 

7.5 

7.4 
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447577 


AI393693 


Hs.183297 


DKF2P566F2124 protein 


7.4 


446039 


AI150491 


Hs.90756 


ESTs 


7.2 


422060 


R20893 


Hs.325823 


ESTs, Moderately similar to ALUS HUMAN A 


7.2 


424455 


AA452006 


Hs.333199 


ESTs 


7.1 


424343 


AW956360 


Hs.4748 


adenylate cyclase activating polypeptide 


7.1 


442792 


AI352340 


Hs 131194 


ESTs 


7.0 


406714 


AI219304 


Hs.283108 


hemoglobin, gamma G 


6.9 


407571 


AI446183 


Hs.9572 


ESTs, Highly similar to CYA5 HUMAN ADENY 


68 


429580 


AA346839 


Hs 209100 


DKFZP434C171 protein 


67 


453500 


AI478427 


Hs.43125 


esophageal cancer related gene 4 protein 


67 


422233 


AB002058 


Hs 113275 


purinerg/c receptor P2X-)ike 1, orphan r 


67 


420205 


AA256395 


Hs 88156 


ESTs 


66 


404368 


NA 




ENSP0O000241075*:TRRAP PROTEIN. 


66 


447261 


NM 006691 


Hs.17917 


extracellular link domain-containing 1 


65 


417090 


AA1 93282 


Hs 85863 


ESTs, Weakly similar to B34612 zinc fing 


65 


410677 


NNL003278 


Hs 65424 


tetranectin (plasminogen-binding protein 


65 


427283 


AL1 19796 


Hs 174185 


edonucleotide pyropnosphatase/phosphodi 


65 


415011 


AW963085 




gb:EST375158 MAGE resequences, MAGH Homo6.4 


412068 


S72043 


Hs 73133 


metalbthionein 3 (growth inhibitory fac 


64 


416253 


BE250659 


Hs 15463 


Homo sapiens, done IMAGE:2959994, mRNA 


64 


435885 


AA701483 


Hs 36341 


ESTs 


6.3 


402779 


NA 




Target Exon 


63 


418138 


AA213626 


Hs. 136204 


EST 


63 


439335 


AA742697 


Hs 62492 


ESTs, Weakly similar to B39066 proline-r 


63 


427019 


AA001732 


Hs 173233 


hypothetical protein aJ 10970 


62 


411478 


BE143068 




gb:MRO-HT0158-030200-003-b09 HT0158 Homo6 2 


452654 


BE004783 




gb:MR2-BN0114-270400-004-e11 BN0114 Homo6 1 


447359 


NM 012093 


Hs 18268 


adenylate kinase 5 


6 1 


414323 


NNL014759 


Hs.334688 


KIAA0273 gene product 


6 1 


441266 


H15968 


Hs.293845 


Homo sapiens, clone fMAGE:3502329, mRNA, 


6 1 


417011 


F08212 


Hs 234898 


ESTs, Weakly similar to 2109260A B cell 


6.0 


400089 


NA 




Eos Control 


60 


433614 


W07475 


Hs 277101 


cytochrome c oxidase subunit IV isoform 


5 9 


440439 


N92818 


Hs647 54 


ESTs, Weakly similar to potential CDS [H 


5.9 


454404 


BE067414 




gb:MR4-BT0355-200100-201-e05 BT0355 Homo 5 9 


436704 


AA062610 


Hs 148050 


EST 


59 


406563 


NA 




Target Exon 


59 


433490 


AW451023 


Hs.65848 


hypothetical protein DKFZp7610132 


59 


419313 


AA843387 


Hs 87279 


ESTs 


59 


409196 


NM-001874 


Hs 334873 


carboxypeptidase M 


58 


410882 


AW809163 




gb:MR4-ST0118-261099-012-a03 ST0118 Homo 5.8 


453469 


AB014533 


Hs 33010 


KIAA0633 protein 


58 


441899 


AI372588 


Hs8022 


TU3A protein 


58 


426210 


AA372052 


Hs 334559 


Homo sapiens cDNA FU 14458 fis, clone HE 


58 


413065 


BE063555 




gb:CM1-BT0283-081199-033-d09 BT0283 HomoS 8 


454192 


AW876813 


Hs.3343 


phosphoglycerate dehydrogenase 


57 


425187 


AW014486 


Hs 22509 


ESTs 


57 


429757 


AW452355 


Hs 256037 


ESTs 


57 


420202 


AID 36557 


Hs.95910 


putative lymphocyte G0/G1 switch gene 


57 


416284 


AI695473 


Hs.298006 


ESTs 


57 


428553 


AA181641 


Hs 184907 


G protein-coupled receptor 1 


56 


404689 


NA 




Target Exon 


5.6 


438887 


R68857 


Hs 265499 


ESTs 


56 


406082 


S47833 


Hs.82927 


adenosine monophosphate deaminase 2 {iso 


56 


449748 


H23963 


Hs.32043 


ESTs 


56 


431048 


R50253 


Hs 249129 


cell death-inducing DFFA-Iike effector a 


5.5 


452205 


C15819 




gb:C15819 Clontech human aorta pofyA mRN 


5.5 


434040 


AW444613 


Hs 288809 


hypothetical protein FU20159 


5.5 


407744 


AB020629 


Hs 38095 


ATP-binding cassette, sub-family A (ABC1 


5.5 


450606 


AI668605 


Hs 50380 


ESTs, Moderately similar to ALU6_HUMAN A 


5.5 


414629 


AA345824 


Hs 76688 


carboxylesterase 1 (monocyte/macropriage 


5.5 


401665 






C1 1000703 ;gi|10048448|ref(NP_06 5258.1 1 g 


5.5 


436107 


T99079 


Hs 191194 


ESTs 


5.5 


444432 


AI161428 


Hs 75916 


splicing factor 3b, subunit 2, 145kD 


5,5 


434715 


BE005346 


Hs 116410 


ESTs 


5.5 


447205 


BE617015 


Hs 11006 


ESTs, Moderately similar to T17372 plasm 


5.5 


408122 


AI432652 


Hs.42824 


hypothetical protein FLJ107 18 


5.5 
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454016 
414913 
459033 
441003 
450637 
442398 
403612 
407102 
410057 
428232 
432769 
431344 
427032 
406305 
437411 
442800 
402054 
432085 
415313 
459159 
427164 
441391 
458959 
402698 
401810 
438879 
414657 
427809 
456063 
451186 
451882 
402583 
431130 
458218 
416083 
455282 
426488 
426156 
407891 
408610 
445967 
434813 
437526 
454775 
409451 
409853 
405062 
446490 
417622 
421978 
440338 
415421 
417574 
409882 
44799B 
445613 
443074 
451324 
432433 
449654 
414519 
457531 
433200 
430782 
427555 
433545 
420334 



AW016806 
R25621 
AA017590 
BE172240 
N49826 
AA994520 
NA 

AA007629 
R66634 
BE272452 
AA620814 
R99530 
AF012023 
BE261320 
AW613948 
AI809481 
NA 

AF212829 
R59638 
AI904646 
AB037721 
BE467930 
AI285901 
NA 
NA 

AA827674 
AA424074 
M26380 
NM_006744 
AW023469 
A1821324 
NA 

NM_006103 
AI435179 
R53467 
BE1 43867 
X03350 
BE244537 
AA486620 
AW026692 
D59597 
AI524307 
AI076012 
BE1 60229 
AF012626 
AW502327 



Hs.233108 ESTs 



Hs.129907 
Hs.126379 
Hs.18602 



Hs.9739 

Hs.268107 

Hs.183109 

Hs.144959 

Hs.272572 

Hs.173274 

Hs.158196 

Hs.194915 

Hs.131227 

Hs.272406 
Hs.6181 

Hs.173871 
Hs.170381 
Hs.181297 



Hs.189073 

Hs.76780 

Hs.180878 

Hs.76461 

Hs.65256 

Hs.1 00445 

Hs.2719 
Hs. 126820 
Hs.269122 

Hs.4 

Hs.1 67382 

Hs.41135 

Hs.224829 

Hs.118821 

Hs.162870 

Hs.121388 

Hs.54472 



gb:yh45f06.r1 Soares placenta Nb2HP Homo 
ESTs 

ESTs, Weakly similar to 138022 hypotheti 
ESTs 



5.5 
5.4 
54 
5.4 
54 



gb:ou42g09.s1 Soares_NFL_T_GBC_S1 Homo s5 4 



Target Exon 

glycerol-3-phosphate dehydrogenase 1 (so 
muftimerin 

monoamine oxidase A 
ESTs 

hemoglobin, alpha 2 

integrin cytoplasmic domain-associated p 
transcriptional adaptor 3 (ADA3, yeast h 
ESTs 
ESTs 

Target Exon 

potassium channel, subfamily K, member 9 
ESTs 

gb'QV-BT065-020399-103BT065Homosapien 5.2 



AK000706 

AW298163 

AJ243662 

R62431 

R35009 

R00348 

AJ243191 

AI768289 

BE550889 

AW341470 

AI783600 

AW014734 

AI989812 

N94587 

AW973716 

AA682722 

AF026263 

AW137094 

AA868510 

AI349351 



Hs.15125 

Hs.82318 

Hs.110196 

Hs.12758 

Hs.24903 

Hs,56874 

Hs.304389 

Hs.1 58491 

Hs. 144907 

Hs.208052 

Hs.157969 

Hs.199850 

Hs.55063 

Hs.13913 

Hs.192725 

Hs.247920 

Hs.97990 

Hs.1 12496 

Hs 118944 



5.3 
5.3 
5.3 
5,3 
5.3 
5.3 
5.3 
53 
5.3 
5.3 
5.3 
5.3 
5.2 



KIAA1300 protein 
ESTs 
ESTs 

ENSPO000O251335*:DJ1003J2.1 {sodium and 

Target Exon 

ESTs 

protein phosphatase 1 , regulatory (inhib 
lipoprotein lipase 

retlnol-binding protein 4, interstitial 
ESTs, Weakly similar to leucine-rich gli 
ESTs 

NMJ21620:Homo sapiens PR domain contain 
HE4; WFDC2; putative ovarian carcinoma m 
ESTs 

ESTs, Weakly similar to ALU1_HUMAN ALU S 



5.2 
5.2 
5.2 
5.2 
5.2 
5.2 
5.2 
5.1 
5.1 
5.1 
5.1 
51 
51 
5.1 
5.1 



gb:MR0-HT0164^)70100-O13-h02 HT0164 Homo5.1 



5.1 
5.1 
5.0 
5.0 
5.0 
5.0 



alcohol dehydrogenase 1B (class I), beta 
natriuretic peptide receptor A/guanylate 
endomucin-2 
ESTs 

CGI-62 protein 
ESTs 

ESTs, Weakly similar to MDHCJHUMAN MALAT 5.0 
gb:QVl-HT0413-090200-062-a12 HT0413 Homo5.0 
fragile X mental retardation 2 5 0 

gb:UWF-BROp^ka-a-07-0-Ul.r1 NIH_MGC _5 5.0 

Target Exon 

hypothetical protein FU20699 
WAS protein family, member 3 
NICE-1 protein 
ESTs 
ESTs 

gb:ye69e06.r1 Soares fetal liver spieen 
heat shock 27kD protein family, member 7 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 

K1AA1577 protein 
ESTs 

cholnergic receptor, muscarinic 5 
ESTs 
ESTs 

hypothetical protein FU22477 
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5.0 
5.0 
5.0 
5.0 
5.0 
5.0 
5.0 
5.0 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.8 
4.8 
4.8 
4.8 
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421795 


X63094 


Hs.283822 


427138 


N77624 


Hs.173717 


413072 


BE063965 




443721 


AW450451 


Hs,266355 


408053 


AW139474 


Hsi46862 


427067 


AA843716 


Hs.177927 


442969 


AI025499 


Hs.132238 


426220 


AI383475 


Hs.171697 


414593 


BE386764 




426893 


AA398716 


Hs.97418 


434046 


AW292618 


Hs.113011 


401590 


NA 




457971 


AW1 34679 


Hs.242849 


427722 


AK000123 


Hs.180479 


443793 


AA045290 


Hs.25930 


407737 


R49187 


Hs.6659 


441955 


AA972327 


Hs.142903 


441499 


AW298235 


Hs. 101689 


447517 


AI382726 


Hs 182434 


403017 






450580 


N40087 


Hs 15248 


404611 


H58589 


Hs 35156 


414831 


M31158 


Hs 77439 


459290 


NMJXJ1546 


Hs 34853 


444341 


AM 4 2027 


Hs 146650 


408614 


AL1 37698 


Hs 46531 


449638 


AW204277 


Hs. 250723 


434418 


AF1 34707 


Hs 278679 


447360 


AI375984 


Hs 167216 


419583 


F00312 




440698 


AI348455 


Hs 147492 


451199 


AI290653 


Hs 124758 


438338 


NM_014861 


Hs6168 


433756 


AW0 15933 


Hs 112654 


423301 


S67580 


Hs 1645 


417237 


H86385 


Hs 81737 


439745 


AL3 89981 


Hs 149219 


424137 


AA335769 


Hs 16262 


449338 


H73444 


Hs394 


434744 


N94835 


Hs 283828 


407402 


AF0353O3 




443510 


NMJ)12190 


Hs 9520 


415754 


AA169114 


Hs 12247 


415986 


Z43619 




457416 


BE 142052 


Hs 62654 


418064 


BE387287 


Hs 83384 


437120 


AI356125 


Hs 157767 


453950 


AA156998 


Hs 211568 


401093 






436935 


AW206494 


Hs 253560 


457974 


AW842353 


Hs 321717 


428222 


AL133112 


Hs 183085 


442705 


AI264634 


Hs 131127 


437409 


AL359599 


Hs.283850 


458494 


AI380906 


Hs.158436 


410490 


H03589 




416069 


R37101 


Hs 20982 


438463 


AA807958 


Hs 314232 


444043 


AI499723 


Hs 135089 


438327 


H87407 


Hs 172944 


457711 


AF147401 


Hs 23917 


400870 






441425 


AA933590 


Hs 28937 


416267 


H45384 




407262 


M12873 




444567 


AV654020 


Hs.184261 


403263 


NA 





Rhesus blood group, D antigen 4.8 

phosphatide acid phosphatase type 2B 4.8 
gb:QV3-BT0296-14O2O0-O85-h01 BT0296Homo 4.8 

ESTs 4 8 

ESTs 4.8 

ESTs 4 7 

ESTs 4 7 

ESTs, Weakly similar to T13924 sdk prate 4 7 
gb:601273249F1 NIH_MGC_20 Homo sapiens c 4 7 

ESTs 4 7 

ESTs 4 7 

Target Exon 4 7 

ESTs 4 7 

hypothetical protein FLJ20116 4 6 

ESTs, Weakly similar to 2109260A B cell 4 6 

ESTs 4 6 

ESTs 4 6 

ESTs 4 6 

ESTs 4 6 

Target Exon 4 6 

ESTs 4 6 

Homo sapiens cDNA FU1 1027 fts, clone PL 4 6 

protein kinase, cAMP -dependent, regulato 4 6 

inhibitor of DNA binding 4, dominant neg 4 6 

ESTs 4 6 
Homo sapiens mRNA; cDNA DKFZp434C1915 (f 4 6 

hypothetical protein MGC2747 4 6 

a disintegrin and metalloproteinase doma 4 6 

ESTs 4 6 
gb:HSBB0D101 STRATAGENE Human skeletal m4.6 

Homo sapiens cDNA FU1 1777 fis, clone HE 4 6 

ESTs 4 6 

KIAA0703 gene product 4 6 

Homo sapiens, clone MGC:9764, mRNA, comp 4 5 

cytochrome P450, subfamily IVA, pofypept 4 5 

palmitoyl-protein thioesterase 2 4 5 

Homo sapiens mRNA full length insert cDN 4 5 

ESTs 45 

adrenomedullin 4 5 

Homo sapiens genomic DNA, chromosome 21q 4 5 

gb:Horno sapiens clone 23943 mRNA sequenc 4 5 

formyftetrahydrofolate dehydrogenase 4 5 

hypothetical protein FLJ 11413 4 5 

gb:HSC1GE121 normalized infant brain cDN 4 5 

kringle-containing transmembrane protein 4 5 

S100 calcium-binding protein, beta (neur 4 4 
ESTs, Weakly similar to HXA2_HUMAN HOME04 4 

eukaryotic translation initiation factor 4 4 

C12000586*:gi|6330167|dbj]BAA86477.1| (A 4 4 

ESTs 4 4 

ESTs, Weakly similar to S22765 heterogen 4 4 
Homo sapiens mRNA; cDNA DKFZp434K098 (fr 4 4 

ESTs 4 4 
Homo sapiens mRNA; cDNA DKFZp547C126 (fr 4 4 

ESTs 44 

gb:yj42d08.ii Soares placenta Nb2HP Homo 4 4 

ESTs 4 4 

ESTs 4 4 

ESTs 4 4 

chorionic gonadotropin, beta polypeptide 4 4 

ESTs 4 3 

C11000905:gi|11692565jgb|AAG39879.1iAF28 4 3 

homeobox protein from AL590526 4 3 

gb:yn99c10.r1 Soares adult brain N2b5HB5 4 3 

gb:Human Ig rearranged H-chain mRNA VDJ4 4 3 

ESTs, Weakly similar to T26686 hypotheti 4 3 

Target Exon 4.3 
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410034 BE067414 

456804 AI421645 Hs.139851 

446427 BE395260 Hs.309438 

416931 D45371 Hs.B0485 

5 421296 NM_002666 Hs.103253 
400973 NA 

452602 AW366194 Hs.55962 

412330 NMJXJ5100 Hs.788 
405016 

10 435104 AI475671 Hs.88607 
406118 

418556 T02850 

429745 AA480818 Hs.221736 

433088 AW451206 Hs.1 15899 

15 444445 AA342329 Hs.1 15920 

453880 AI803166 Ks.28462 

447384 AI377221 Hs.40528 

414541 BE293116 Hs,76392 

444975 AV652165 Hs.182482 

20 403921 NA 

451477 AI798425 Hs.42710 
406344 

416970 AA191201 Hs.35861 

413662 BE155866 Hs.25522 

25 458504 AW070634 Hs, 144794 
404682 NA 

418089 N69913 Hs.6858 
403433 

446532 AW975460 Hs, 143563 

30 414217 AI309298 Hs,279898 

418425 AI871247 Hs,6262 

419589 AW973708 Hs.201925 

457029 AA3977B9 Hs, 161803 

447860 AF193807 Hs 131B35 

35 448988 Y09763 Hs.22785 

440610 AI733098 Hs, 130800 
439590 AF086410 

427240 AA399975 Hs, 274 151 

408932 AW594172 Hs.278513 

40 436112 T77545 Hs.187559 

444382 AI144152 Hs,58246 

456716 AA318060 Hs.135121 

419846 NM_015977 Hs.285681 

410036 R57171 Hs.57975 

45 400545 IMA 
403051 NA 

420139 NMJXJ5357 Hs.95351 

450244 AA007534 Hs.1 25062 

453261 AA034116 Hs.1 18494 

50 440246 W52010 Hs.191379 

414516 AI307802 Hs.1 35560 

438232 AI150595 Hs.122226 

410233 AA082947 

412179 BE270758 Hs.69428 

55 441871 AI306150 Hs.153450 

426411 AK000708 Hs.1 69764 

453692 AL110416 

448640 AW817177 Hs.102558 

417481 AA203261 Hs.21738 

60 412912 AW118878 Hs.1 10835 

454183 AW807116 

426328 AW631296 

435942 R06285 Hs.191215 

417629 T76945 
65 403593 NA 

402690 

418190 R49591 Hs.270425 



gb:MR4-BT0355-200100-201-e05 BT0355 Homo4.3 



caveolin 2 4.3 

EST 4.3 

adipose most abundant gene transcript 1 4.3 

perilipin 4.3 

ENSPOOO00236667*:Murin 5B (Fragment). 4.3 

ESTs 4,3 

A kinase (PRKA) anchor protein (gravin) 4.3 

CY000171 •:gi|9280405|gb|AAF86402.1 |AF245 4.3 

ESTs, Highly similar to F-box protein FB 4.3 

ENSP00000246632:CDNA FU20261 fis, clone 4.3 

gb FB12A9 Fetal brain, Stratagene Homo s 4.3 

ESTs 4.3 

ESTs 4.3 

Homo sapiens cDNA: FU22816 fis. clone K 4.3 

ESTs, Weakly similar to 138022 hypotheti 4.3 

ESTs 4.2 

aldehyde dehydrogenase 1 family, member 4.2 

ESTs, Weakly similar to T00362 hypotheti 4.2 

C5000212*:gil10047237|dbjlBAB13407.1l (A 4.2 

ESTs 4.2 

C500166a.gil11611537|dbilBAB18935.1|(AB 4.2 

DKFZP586E1521 protein 4.2 

KIAA1808 protein 4.2 

ESTs 4.2 

C9001188*:gi|12738842|refjNP_073725.1| p 4.2 

ESTs, Weakfy similar to I78885 serine/th 4.2 

NM J)01622:Homo sapiens alpha-2-HS-gfycop 4.2 

ESTs 4.2 

Homo sapiens cDNA: FLJ23165 fis, clone L 4.2 

hypothetical protein MGC8407 4.2 

Homo sapiens cDNA FU13446 fis, clone PL 4.2 

ESTs 4.2 

Rhesus blood group, B glycoprotein 4.2 

gamma-aminobutyric acid (GABA) A recepto 4.2 

ESTs 4.2 

gbiHomo saptens full length insert cONA 4.2 

ligatin 4.2 

TP53TG3 protein 4.2 

ESTs 4.2 

ESTs 4,2 

hypothetical protein FU2241 5 4.2 

Wllliams-Beuren syndrome chromosome regi 4.2 

calsequestrin 2 (cardiac muscle) 4„ 1 

Target Exon 41 

Target Exon 4,1 

lipase, hormone-sensitive 4.1 

ESTs 41 

ESTs 4,1 

ESTs 4,1 

ESTs, Weakly simitar to T43458 hypotheti 4„ 1 

ESTs 4.1 

gb:zn10g07.s1 Stratagene hNTT neuron (937 4.1 

hypothetical protein MGC3020 4.1 

ESTs, Weakly similar to 1909123A Na glue 4.1 

hypothetical protein aJ20701 4. 1 

gb:DKFZp434K0431_r1 434 (synonym: htes3) 4.1 
Homo sapiens, done MGC.5352, mRNA, oomp 4.1 

ESTs 4.1 

ESTs 4.1 
gb:MR4-ST0062-040100-024-b12ST0062Homo4.1 
gb:hh83c09.y1 NCI_CGAP_GU1 Homo sapiens 4.1 

ESTs 4.1 

gb:yc92c07.r1 Soares infant brain 1NIB H 4.1 

Target Exon 4.0 

Target Exon 4.0 

ESTs 4.0 
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408641 


AW245207 


Hs.5555 


hypothetical protein MGC5347 


4.0 


427899 AA829286 


Hs.332053 


serum amyloid A 1 


4.0 


445975 


AI811536 


Hs. 145734 


ESTs 


4.0 


438831 


BE263273 


Hs.6439 


synapsin II 


4.0 


455578 


BE006350 


Hs.14355 


Homo sapiens cDNA FU13207 fis, done NT 


4.0 


401840 


NA 




Target Exon 


4,0 


413753 


U17760 


Hs.75517 


laminin, beta 3 (ntcetn (125kD) ( kalinin 


4.0 


445030 


AI205925 


Hs.147238 


ESTs, Highly similar to AAC3.HUMAN ALPHA 


4.0 


433873 


AW156913 


Hs. 150478 


ESTs, Weakly similar to A Chain A, Cryst 


4,0 


456736 


AW246217 


Hs.1619 


achaete-scute complex (Drosophila) homol 


4,0 


450112 


BE047734 


Hs.5473 


ESTs, Moderately similar to ALU5_HUMAN A 


4.0 


448906 


AI589567 


Hs.309719 


ESTs 


4.0 
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TABLE 23A 
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Table 23A shows the accession numbers for those pkeys lacking unigenelD's for Table 23. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs These sequences were clustered based on sequence similarity using Clustenng 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 



409853 

410034 

410233 

410490 

410882 

411478 

413065 

413072 

414593 

414913 

415011 

415986 

416267 

417574 

417629 

418556 

419583 

426328 

439590 

442398 

452205 

452654 

453692 

454183 



1156226.1 

1170594J 

118656.1 

1205347 J 

1225686.1 

1247073 J 

1347960.1 

1348163.1 

1464909.1 

1506721.1 

151328.1 

1564410 J 

1583547 J 

1687770.1 

1690392.1 

1767866.-1 

186198J 

264901J 

47413.1 

541271.1 

90415.1 

925931.1 

977825.1 

1049636.1 



454404 1170594.1 

454775 1234106.1 

455282 1273020.1 

459159 919998.1 



AW502327 AW502488 AW501829 AW502625 AW502687 

BE067414 BE067958 BE067419 BE067963 AW577127 AW601412 

AA082947 AA083036 

H03589 AW750687 AW750688 

AW809163 AW809247 AW809177 AW809190 AW809225 

BE143068 AW849143 AW848705 AW848569 AW848071 AW848475 AW848092 AW848005 
RE063555 BE151321 BE151319 BE151657 BE151655 BE063556 BE151322 

ii063965 BEC63968 BE064034 BE064028 BE063874 BE063966 BE063669 BE064C43 BE064033 BE063884 

BE386764 BE387560 

R25621 C03959 C04010 

AW963085 AA1 59005 AW963073 

Z43619 R61274 H12206 R12883 

H45384 H49125H41699 

R00348 R09593 

T76945 R20210R05755 

T02850 

F00312 AA247490 F31427 AA383663 F22045 
AW631296AA375484 
AF086410 W94386 W74609 
AA994520AW393574 
C15819AA024741 AA024742 
BE004783 BE004947 AI911790 

A^l\e aXSothM AW80741 5 AW807338 AW807288 AW807263 AW807316 AW177402 AW807413 AW807068 BE141561 
SSjffiS AW807 1 19 AW807299 AW807241 AW807225 AW807204 AW807345 

KS AW807431 AwS AW807337 AW807406 AW807259 AW807375 AW845890 AW807220 AW807399 
l^^ ^m^^ AW807236 AW807027 AW8071 12 AW807378 AW807202 BE141593 AW8072 16 
AW 071^8 AW8072^ AwS aS«0^ AW8O7OS0 AW807248 AW807404 AW807075 AW807237 AWB0721 2 
AW807 0 AW8071 04 BE140912 AW807301 AW807382 AW807294 AW807026 AW807020 AW8071C8 AW807025 AW807433 

AW807 38 S 201 BEM1590 AW807302 AW807323 AW807380 AW807109 BE141588 ^^^J^^^^^^^Ij^ 
AW807309 BE141614AW845861 AW807396 AW807300 AW807348 AW80731 1 AW807214 AW807132 AW807402 AW807350 
AW 070^ AW807217 AW807312 AW807261 AW807268 AW807260 AW807266 » 

AW807346 AW 07 f 5 BeS AW807351 AW807122 AW807235 AW807076 AW807314 AW807213 AW807143 AW807 131 

BE141619BE14O943AW8O7420AW807381 BE141621 AW845921 
BEH1629 B(=141R25 BE141624 BE141636 BE141630 AW807405 AW807290 AW807353 
BE067414 Bi067958 BE067419 BE067963 AW577127 AW601412 

BE160229 AW819879 AW820179 AW819882 AW819876 AW820169 BE153201 AW993736 BE152911 
BE1 43867 AW935060 AW886684 
AI904646 BE179494 BE179421 
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TABLE 23B 

Table 23B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 23. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 
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Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (G I) numbers. "Dunham I. et al." refers to the publication 

entitled The DNA sequence of human chromosome 22." Dunham L et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

Nt_position; indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


Nt_position 


400545 


9800107 


Minus 


124618-124881 


400870 


9838306 


Minus 


34081-35027 


400973 


7960452 


Minus 


9811^-98253 


401093 


8516137 


Minus 


22335-23166 


401590 


9966320 


Minus 


33547-33649 


401665 


7145001 


Plus 


121591-122537 


401810 


7342191 


Plus 


129063-129476 


401840 


7684597 


Plus 


56283-56439 


402064 


8083691 


Minus 


8288-8806 


402195 


7689778 


Minus 


147901-148884 


402583 


7684486 


Plus 


94883-95003 


402690 


8348058 


Plus 


13368-13998 


402698 


8570304 


Minus 


108641-108903 


402779 


9588555 


Minus 


38173-39210 


403017 


6693623 


Plus 


78630-79367 


403051 


4827080 


Minus 


5269-5411 


403263 


7770677 


Plus 


52431-52737 


403433 


9719611 


Minus 


72225-72437 


403593 


6862650 


Minus 


62554-62712,69449-69602 


403612 


8469060 


Minus 


94723-94859 


403921 


7711590 


Minus 


3297-3536 


404368 


7630956 


Minus 


102053-102199 


404682 


9797231 


Minus 


4097741150 


404689 


7534100 


Plus 


119461-119717 


405016 


6524300 


Plus 


51997-53308 


405062 


7657730 


Plus 


101283-101432 


406118 


9143818 


Plus 


53997-54629 


406344 


9255974 


Plus 


20254-20374,20526-20659,20835-2 1 097 


406563 


7711604 


Plus 


34401-34538 
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TABLE 24: 



Table 24 depicts Seq ED No., UnigenelD, UnigeneTitle, Pkey, Pred.Cell.Loc., and ExAccn 
for all of the sequences in Table 25. The information in Table 24 is linked by Seq ID No. to 
Table 25. 



nuclear 
nuclear 
cytoplasm 



Pkey: Unique Eos probeset Identifier number 

1 0 ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 
Unigene Title: Unigene gene title 

Pred.Cellloc.: Predicted Cellular Localization 
Seq ID No • Sequence Identification Number found in Table 25 

15 " " 

Pkey ExAccn UnigenelD UnigeneTitle 

449746 A1668594 Hs.176588 ESTs, Weakly similar to CP4Y_HUMAN CYTOC 

407276 AI951118 Hs.326736 Homo sapiens breast cancer antigen NY-BR 

20 415539 AI733881 Hs.72472 BMP-R1B 

400297 AI127076 Hs.334473 hypothetical protein DKFZp5640 1278 

450375 AA009647 Hs.8850 a diskitegrin and metalloproteinase doma 

1 02457 NM_001394Hs.2359 dual specificity phosphatase 4 

429170 NMJX)1394Hs.2359 dual specificity phosphatase 4 

15 424399 AI905687 Hs.2533 atdehyde dehydrogenase 9 family, member 

422505 AL1 20862 Hs.124165 ESTs „„ 

449765 N92293 Hs. 206832 ESTs, Moderately similar to ALU8_KUMAN A 

425692 D90041 Hs.1 55956 N-acetyltransferase 1 {arylamine Nicety 

426215 AW953419 Hs.155223 stann'ocalcin 2 

30 439840 AW449211 Hs.105445 GDNF family receptor alpha 1 

410102 AW248508 Hs.279727 Homo sapiens cDNA FU14035 fis, clone HE 

429220 AW207206 Hs.136319 ESTs 

416276 U41060 Hs.79136 UV-1 protein, estrogen regulated 

409079 W87707 Hs.82065 interteukin 6 signal transducer (gp1 30, 

35 442818 AK001741 Hs.8739 hypothetical protein FU 10879 

442082 R41823 Hs.7413 ESTs ^ JAej w 

444381 BE387335 Hs.283713 ESTs, Weakly similar to S64054 hypothet) 

446163 AA026880 Hs.25252 Homo sapiens cDNA FL J 13603 fis, clone PL 

416636 N32536 Hs.42645 solute carrier family 16 (monocarboxylic 

40 442117 AW664964 Hs. 128899 ESTs 

433043 W57554 Hs.125019 lymphoid nuclear protein (LAF-4) mRNA 

429353 AL117406 Hs 200102 ATP-binding cassette transporter MRP8 

452190 H26735 Hs.91668 Homo sapiens clone PP1498 unknown mRNA 

446733 AA863360 Hs,26O40 ESTs, Weakly similar to fatty acid omega 

45 452747 BE153855 Hs.61460 I g superfamily receptor LN I R 

423242 AL039402 Hs.125783 DEM E-6 protein 

417433 BE270266 Hs.82128 5T4 oncofetal trophoblast glycoprotein 

432201 A1538613 Hs.298241 Transmembrane protease, serine 3 

423961 D13666 Hs.136348 osteoblast specific factor 2 (fasciclin 

50 439569 AW602166 Hs.222399 CEGP1 protein 

114480 BE066778 Hs.151678 UDP-N-acetyl-alpha-D-galactosamine:polyp 
404561 NM_Q14112*:Homo sapiens trichorhinophala mitochodria 

325372 NA Phase 2 & 3 Exons nuctear 

112287 AB033064 Hs.334806 KIAA 12 38 protein 
5 5 335824 NA ENSP00000249072*:D J222E 13.1 {^TERMINAL 

424735 U31675 Ks.272499 short-chain alcohol dehydrogenase family 
400289 X07820 Hs,2258 matrix metalloproteinase 10 (stromelysin 
427585 D31152 Hs, 179729 collagen, type X, alpha 1 (Schmid metaph 
429925 NM_000786Hs,226213 cytochrome P450, 51 (lanosterol 14-aipha 
60 429441 AJ224172 Hs,204096 I i pophilin B (uteroglobin family member) 
421155 H87879 Hs.102267 lysyl oxidase 
420931 AF044197 Hs.100431 smaH inducible cytokine B subfamily (Cy 
420813 X51501 Hs.99949 prolactin-induced protein 
452744 A1267652 Hs.30504 



Pred.CeHloc. Seq. ID. No. 



ER 



extracellular 



Homo sapiens mRNA; cDNA DKFZp434E062 (fr 

343 



nuclear 



Seq ID U 2 
Seq ID3&4 
Seq ID 5 & 6 
Seq ID 7 & 8 
Seq ID 9 & 10 
Seq ID 11 & 12 
Seq ID 11 & 12 
Seq ID 13 & 14 
Seq ID 15 & 16 
Seq ID 17 & 13 
Seq ID 19 & 20 
Seq ID 21&22 
SeqlD23&24 
Seq ID 25&26 
Seq ID 27 & 28 
Seq ID29&30 
Seq ID 31 &32 
Seq ID33&34 
SeqlD35&36 
Seq ID 37 & 38 
Seq ID39&40 
Seq ID 41&42 
Seq ID43&44 
Seq ID 45 & 46 
Seq ID 47 &48 
Seq ID49&50 
Seq ID 51 &52 
Seq ID53 &54 
Seq ID 55 & 56 
Seq ID 57 & 58 
Seq ID 59 & 60 
Seq ID6U62 
Seq ID 63 & 64 
Seq ID65&66 
Seq ID 67 &68 
Seq ID 69 & 70 
Seq ID7U72 
SeqlD73&74 
Sea ID 75 & 76 
Seq ID 77 & 73 
Seq ID 79 & 80 
Seq ID 81 &82 
Seq ID83&84 
SeqlD85&86 
Seq ID 87 & 88 
Seq ID 89 & 90 
Seq ID 91 & 92 
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420757 X78592 Hs.99915 

424905 NM_002497Hs.1 53704 

429859 NM_OQ7050Hs.225952 

446921 AB012113 Hs.16530 

5 445537 AJ245671 Hs.12844 

428227 AA321649 Hs.2248 

424001 W67883 Hs.137476 

421727 Y13153 Hs.107318 

452838 U65011 Hs.30743 

10 419667 AU0770O5 Hs.92208 

414812 X72755 Hs.77367 

426320 W47595 Hs.169300 

422867 L32137 Hs.1584 

411558 AA102670 Hs.70725 

1 5 417866 AW067903 Hs.82772 

428398 AI249368 Hs.98558 

431958 X63629 Hs.2877 

428722 U76456 Hs.190787 

412970 AB026436 Hs.177534 

20 421379 Y15221 Hs.103982 

415752 BE314524 Hs,78776 

444051 N48373 Hs.10247 

451110 AI955040 Hs.265398 



androgen receptor (dihydrotestosterone r 
NIMA (never in mitosis gene a)-related k 
protein tyrosine phosphatase, receptor t 
small inducible cytokine subfamily A {Cy 
EGF-like-domain, rnuttiple 6 
small inducible cytokine subfamily B (Cy 
paternally expressed 10 
kynurenine 3-monooxygenase (kynurenlne 3 
preferentially expressed antigen in mela 
a disintegrin and metalloproteinase doma 
monokine induced by gamma interferon 
transforming growth factor, beta 2 
cartilage oligomeric matrix protein (pse 
gamma-aminobutyric acid (GABA) A recepto 
collagen, type XI, alpha 1 
ESTs 

cadherin 3, type 1, P-cadherin (placenta 
tissue inhibitor of metalloproteinase 4 
dual specificity phosphatase 10 
small inducible cytokine subfamily B (Cy 
putative transmembrane protein 
activated leucocyte cell adhesion molecu 
ESTs, Weakly similar to transformation-r 
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TABLE 24A 



Table 24A shows the accession numbers for those pkeys lacking unigenelD's for Table 24. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



10 Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



15 Pkey CAT number Accession 

335824 CH22_3197FG_619_1 1_LINK_E 
325372 c12Jis 
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TABLE 24B 



Table 24B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 24. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



Pkey; Unique number corresponding to an Eos probeset 

1 0 Ref; Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et at* refers to the publication 

entitled "The DN A sequence of human chromosome 22." Dunham I. etal., Nature (1999)402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLpositlon: Indicates nucleotide positions of predicted exons. 



15 



Pkey Ref Strand NLposition 



404561 9795980 Minus 69039-70100 



20 
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10 



40 



45 



50 



Table 25 

DNA ^ng exon precSrfon afgonthms, such as FGENESH (Salamov and Sotovycv. 2CK0. Genome Res. ,0:516-522) 
Seq ID NO: 1 ENAjeauencj 

Nucleic Acid Accession #: FGENESH predicted ORr- ,„ n „ nI ^ 

Coding sequence: 1-1518 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

,J ; ^..™-r.™m-rri i~rrr~r TrTTr.CTGCT GATCCTCCTC 60 



ATGG AGCCCT CCTOGCTTCA GGAACTCATG GCTCACCCCT TCTTGCTGCT GATCCTCCTC 60 
SmSm TCAGGTAATC AGGTTGTACC AGAGGAGGAG ATGGATGATC 120 
1 5 Ir lc r r^CC ATrTOTTTCC TGCACCCCCT GCCCACTGGT TCTATGGCCA CAAGGAGTTT 1 80 
1 5 TACCCaSaA iSSSKSSS^T AAGCTGATGG AAAAATACCC ATGTGCTGTT 240 
rrn$crrJ£ T^ACCCTT TACGATGTTC TTCAGTGTCC ATGACCCAGA CTATGCCAAG 300 
ATTCTCCTGA AAA^iACAAC^A TCCCA A A AGT GCTG TTAGCC ^9^- ^T^^Jj ^^G ATTGTG 4^ 

20 ^^^^^^^^^^^^ ™ 

20 CCC ATC ATCC TG/^C^A^^G C^AGGAACGC ATTGCCCAAA ACTCACGTCT GGAGCTCTiT 540 
rAArATGTC^CCC^SAK^CCCTOTACAGC ATCATGAAGT GTGCCTTCAG CCACCAGGGC 6»0 
^^rl?T^^C^C C(^ACtC\ TACCTGAAAG CAGTGTTCAA CCTrAGCAAA 660 
A °£^^°J^^aa CAATTTTCTA CATCACAACG ACCTGGTTTT CAAATTCAGC 720 
9 S SSE^AhSk ^TAAArTTAAC CAAGAACTTC A TCAGTTCAC AGAGAAAGTA 780 
25 "cStC TCTTAAGGAT AAGCTAAAAC 1 f A G ™^ A A ^f ^ 
CGCTGGGATT TTCTGGACAT ACn™a™^^^?^^SS W 

G aStc?c^™ 

r if ATHTPTT ACACCACGAT GTGCATCAAG GAATGCCTCC GCCTCTACGC ACCGGTAG1 A 1 140 

a Ar ATATrrr g^ttactcga caaacccatc acctttccag atggacgctc cttacctgca 1200 

GGAATAA^^G TCT^ATCAA TATTTGCGCT ^^^ A ^^^^^^^^^^^^^^^ctAT 1320 
rr-mncTTCT TTA ACCCCTT GAG ATTCTCC AGGG A AAATT CTGAAAAAA1 ACATCCCTA 1 1 JZU 
1 S rS^^AC CATTCTCAGC TGG ATTAAGG AACTGCATTG GGCAGCATTT TGCCATAATT 1380 
3 5 r ^^^AG TC^^C^i^CTCre CTCCGCTTCA AGCTGGCTCC AG ACCACTC A 1440 

accc^cc a^octS Ttcaagtigtcctcaagtcca AGAATGGAAT CCATGTGTTT 1 00 

^a a^ a * A r T^^AAT^ TTAAGTCCTTTCGTATAAGA ATTAATGAGA CAATT1TCCT 1560 

TY-rrrnn-nT ATCTGGTGAA ACCCACAAAA ACACCTG AAA AAACTCAAGC TGACrrCCAC 1 J4U 
^A^gJa ^A^ATTGGTTTGTGTAACT AGTGGTAGAG TGGCTTTCAA GCATAGTTTG 1800 

ItcaamctccXc^^ 
ttattctcag ttatctttcc ccaataata a aaa a 



65 



70 



Seq ID NO: 2 Protein sequence: 

Protein Accession #: FGENESH predicted 



60 
120 



LpqI/T OFT _M AHPFLLLILL CMSLLLFQVI RLYQRRRWMI RALHLFPAPP AHWFYGHKEF 
Y^KF^FEWlf KLMEK^T^AV PLWVGPFTMF FSVHDPDYAK ILLKRQDPKS AVSHKILESW 
iS^So SwWuSJv KPGFNISILK IFITMMSESV RMMLNKWEER 1AQNSRLELF 180 

S S XSSqt ^S? ^KCAFSHOC i SIQLDSTLDS YLKAVFNLSK ISNQRMNNFL HHNDLVFKFS 240 
oa^^mQW^LKD KLKQDTTQKR RWDFLDILLS AKSENTKDFS 300 
Ia^St SSaG^S MSWILYCL AKYPEHQQRC RDEIRELLGD GSSITWEHLS 360 
^P^TTMCK ECLRLYAPVV N1SRLLDKPI TFPDGRSLPA GITVFINIWA LHHNPYFWED 420 
SOTsS NCIGQHFA.I ECKVAVALTL LRFKLAPDHS 480 

60 RPPQPVRQVV LKSKNGIHVF AKKVC 



Seq ID NO: 3 DNA sequence 

Nucleic Acid Accession #: NM J)52997 

Coding sequence: 100-4125 (underlined sequences correspond to start and stop codons) 



1 tl 21 31 41 51 

I I I I I I 



CTAGTCTATA CCAGCAACGA CTCCTACATC GTCCACTCTG GGGATCTTAG AAAGATCCAT 60 
Sa^CCT OCGGGGACA AGTCCGGAAG CTGGAGAAGATGACAAAGAG GAAGAAGACC .120 

GGCCATGAGG AAGTAGTAAC ATTTCTGGTA G ACAG AAAGT GCCAGCTTG A CGTCCTTC i AT 240 
GGCG AACACA GGACACCTCT GATG A AGGCT CTACAATGCC ^ ATCAGGAGGC L™TGCAA AT 300 
ATTCTGATAG ATTCTGGTGC CG ATATAAAT CTCGTAGA1 G ]^l^^^J^S^ ^ 
7 S CATTATGCTG TTTATAGTG A G ATTTTGTCA GTGG TGGCAA AACTGCTGTC CCATGGTGCA 4 u 
1 ^ OTcTtCGAAG TGCACAACAA GGCTAGCCTC ACACCACTTT TACTATCCAT AACGAAAAGA 480 
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AGTGAGCAAA TTGTGGAATT TTTGCTGATA AAAAATGCAA ATGCGAATGC AGTTAATAAG 540 
TATAAATGCA CAGCCCTCAT GCTTGCTGTA TGTCATGG AT CATCAGAGAT AGTTGGCATG 600 
CTTCTTCAGC AAAATGTTGA CGTCTTTGCT GCAGATATAT GTGGAGTAAC TGCAGAACAT 660 
TATGCTGTTA CTTGTGGATT TCATCACATT CATGAACAAA TTATGGAATA TATACGAAAA 720 
5 TTATCTAA A A ATCATCA A A A TACCA ATCCA GAAGGAACAT CTGCAGGAAC ACCTGATGAG 780 
GCTGCACCCT TGGCGGAAAG AACACCTGAC ACAGCTGAAA GCTTGGTGGA AAAAACACCT 840 
GATGAGGCTG CACCCTTGGT GGAAAGAACA CCTGACACGG CTGAAAGCrr GGTGGAAAAA 900 
ACACCTGATG AGGCTGCATC CTTGGTGGAG GGAACATCTG ACAAAATTCA ATGTTTGGAG 960 
AAAGCGACAT CTGGAAAGTT CGA AC AGTCA GCAGAAGAAA CACCTAGGGA AATTACGAGT 1 020 

1 0 CCTGCAAA AG AAACATCTGA GAAATTTACG TGGCCAGCAA AAGGAAGACC TAGGAAGATC } 080 
GCATGGGAGA AAAAAGAAGA CACACCTAGG GAAATTATGA GTCCCGCAAA AGAAACATCT 1 140 
GAGAAATTTA CGTGGGCAGC AAAAGGAAGA CCTAGGAAGA TCGCATGGGA GAAAAAAGAA 1200 
ACACCTGTAA AGACTGGATG CGTGGCA AG A GTAACATCTA ATAAAACTAA AGTTTTGG A A 1 260 
AAAGG A AGAT CTAAG ATGAT TGCATGTCCT AC AAAAGAAT CATCTACAAA AGCAAGTGCC 1 320 

1 5 AATGATCAGA GGTTCCCATC AGAATCCAAA CAAGAGGAAG ATGAAGAATA TTCTTGTGAT 1 380 
TCTCGGAGTC TCTTTGAGAG TTCTGCAAAG ATTCAAGTGT GTATACCTGA GTCTATATAT 1440 
CAAAAAGTAA TGGAGATAAA TAGAGAAGTA GAAGAGCCTC CTAAGAAGCC ATCTGCCTTC 1500 
AAGCCTGCCA TTGAAATGCA AAACTCTGTT CCAAATAAAG CCTTTGAATT GAAGAATGAA 1560 
CAAACATTGA GAGCAGATCC GATGTTCCCA CCAGAATCCA AACAAAAGGA CTATGAAGAA 1620 

20 AATTCTTGGG ATTCTGAGAG TCTCTGTGAG ACTGTTTCAC AGAAGGATGT GTGTTTACCC 1 680 

AAGGCTACAC ATCAAAAAGA AATAGATAAA ATAAATGGAA AATTAGAAGA G TCTCCTAAT 1740 
A AAGATGGTC TTCTG AAGGC TACCTGCGGA ATG AAAGTTT CTATTCCAAC TAAAGCCTrA 1 800 
GAATTGAAGG ACATGCAAAC TTTCAAAGCG G AGCCTCCGG GGAAGCCATC TGCCTrCGAG 1 860 
CCTGOCACTG AAATGCAAAA GTCTGTCCCA AATAAAGCCT TGGAATTGAA AAATGAACAA 1920 

25 ACATGGAGAG CAGATGAGAT ACTCCCATCA GAATCCAAAC AAA AGG ACTA TGAAGAAAAT 1980 
TCTTGGGATA CTGAGAGTCT CTGTGAGACT GTTTCACAGA AGGATGTGTG TTTACCCAAG 2040 
GCTGCGCATC AAAAAGAAAT AG AT A A A ATA AATGG AAA AT TAGAAGGGTC TCCTGTTAA A 2 100 
GATGGTCTTC TGAAGGCTA A CTGCGG A ATG AAAGTTTCTA TTCCAACTA A AGCCTTAGA A 2 1 60 
TTGATGGACA TGCAAACT1T CAAAGCAGAG CCTCCCGAGA AGCCATCTGC CTTCGAGCCT 2220 

3 0 GCCATTGAAA TGCAAAAG TC TGTTCCAAAT AAAGCCTTGG AATTGAAGAA TGAACAAACA 2280 
TTGAGAGCAG ATGAGATACT CCCATCAGAA TCCAAACAAA AGGACTATGA AGAAAGTTCT 2340 
TGGGATTCTG AGAGTCTCTG TGAGACTGTT TCACAGAAGG ATGTGTGTTT ACCCAAGGCT 2400 
ACA CATC AAA AAGAAATAGA TAAAATAAAT GGAAAATTAG AAGAGTCTCC TGATAATGAT 2460 
GGTTTTCTGA AGGCTCCCTG CAGAATGAAA GTTTCTATTC CAACTAAAGC CTTAGAATTG 2520 

3 5 ATGGACATGC AAACTTTCAA AGCAGAGCCT CCCGAGAAGC CATCTGCCTT CGAGCCTGCC 2580 
ATTGAAATGC AAAAGTCTGT TCCAAATAAA GCCTTGGAAT TGAAGAATG A ACAAACATTG 2640 
AGAGCAGATC AGATGTTCCC TTCAGAATCA AAACAAAAGA AGGTTGAAGA AAATTCTTGG 2700 
GATTCTGAGA GTCTCCGTGA GACTGTTTCA CAGAAGGATG TGTGTGTACC CAAGGCrACA 2760 
CATC AAA AAG AAATGGATAA AATAAGTGGA AAATTAGAAG ATTCAACTAG CCTATCAAAA 2820 

40 ATCTTGGATA CAGTTCATTC TTGTGAAAGA GCAAGGGAAC TTCAAAAAGA TCACTGTGAA 2880 
CAACGTACAG GAAAAATGGA ACAAATGAAA AAGAAGTTTT GTGTACTGAA AAAGAAACTG 2940 
TCAGAAGCAA AAG AA ATA A A ATCACAGTTA GAGAACCAAA AAGTTAAATG GGAACAAGAG 3000 
CTCTGCAGTG TGAGATTGAC TTT AA A CC AA G AAG AAG AG A AGAGAAGAAA TGCCGATATA 3060 
TTAAATGAAA AAATTAGGGA AGAATTAGGA AGAATCGAAG AGCAGCATAG GAAAGAGTTA 3120 

45 GAAGTGAAAC AACAACTTGA ACAGGCTCTC AGAATACAAG ATATAGAATT GAAGAGTGTA 3180 
GAAAGTAATT TGAATCAGGT TTCTCACACT CATGAAAATG AAAATTATCT CTTACATGAA 3240 
AATTGCATGT TGA AAAAGGA AATTGCCATG CTAAAACTGG AAATAGCCAC ACTGAAACAC 3300 
CAATACCAGG A A A AG G AAA A TAAATACTTT GAGGACATTA AGATTITAAA AGAAAAGAAT 3360 
GCTGAACTTC AGATGACCCT AAAACTGAAA GAGG AATCAT TAACTAAAAG GGCATCTCAA 3420 

50 TATAGTGGGC AGCTTAAAGT TCTG ATAGCT GAG AACACAA TGCTCACTTC TAAATTGAAG 3480 
GAAAAACAAG ACAAAGAAAT ACTAGAGGCA GAAATTGAAT CACACCATCC TAGACTGGCT 3540 
TCTGCTGTAC AAGACCATGA TCAAATTGTG ACATCAAGAA AAAGTCAAGA ACCTGCTTTC 3600 
CACATTGCAG GAGATGCTTG TTTGCA A AG A AAAATGAATG TTGATGTGAG TAGTACGATA 3660 
TATAACAATG AGGTGCTCCA TCAACCACTT TCTGA AGCTC AAAGGAAATC CAAAAGCCTA 3720 

5 5 AAAATTAATC TCAATTATGC AGGAGATGCT CTAAGAGAAA ATACATTGGT TTCAGAACAT 3780 
GCACAAAGAG ACCAACGTGA AACACAGTGT CAAATGAAGG AAGCTGAACA CATGTATCAA 3840 
AACGAACAAG ATAATGTGAA CAAACACACT GAACAGCAGG AGTCTCTAGA TCAGAAATTA 3900 
TTTCAACTAC AAAGCAAA AA TATGTGGCTT CAACAGCAAT TAGTTCATGC ACATAAGAA A 3960 
GCTGACAACA AAAGCAAGAT AACAATTGAT ATTCATTTTC TTGAGAGGAA AATGCAACAT 4020 

60 CATCTCCTAA AAGAGAAAAA TG AGG AG AT A TTTAATTACA ATAACCATTr AAAAAACCGT 4080 
ATATATCAAT ATGAAAAAGA GAAAGCAGAA ACAGAAAACT CATGAGAGAC AAGCAGTAAG 4140 
AAACTTCTTT TGGAGAAACA ACAGACCAGA TCTTTACTCA CAACTCATGC TAGGAGGCCA 4200 
GTCCTAGCAT CACCTTATGT TGAAAATCTT ACCAATAGTC TGTGTCAACA GAATACTTAT 4260 
TTTAGAAGAA AAATTCATGA TTrCTTCCTG AAGCCTACAG ACATAAAATA ACAGTGTGAA 4320 

65 GAATTACTTG TTCACGAATT GCATAAAGCT GCACAGGATT CCCATCTACC CTGATGATGC 4380 
AGCAGACATCATTCA ATCCA ACCAGAATCT CGCTCTGCAC TCCAGCCTAG GTGACAGAGT 4440 
GAG ACTCCAC CTCGGAAA 



70 Seq ID NO: 4 Protein sequence: 

Protein Accession it: NP_443723.1 

1 11 21 31 41 51 

7 5 MTKRKKTINL NIQDAQKRTA LHWACVNGHE EVVTTLVDRK CQLDVLDGEH RTPLMKALQC 60 
HQEACANILI DSGADfNLVD VYGNMALHYA VYSEILSVVA KLLSHGAVIE VHNKASLTPL 120 
LLSITKRSEQ I VEFLLJKNA NANAVNKYKC TALMLAVCHG SSEIVGMLLQ QNVDVFAADI 1 80 
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CGVTAEFTYAV TCGFHH1HEQ LMEYIRKLSK NHQNTNPEGT S AGTFDEAAP ' LAERTPDTAE 240 
SLVEKTPDEA APLVERTPDT AESLVEKTPD EAASLVEGTS DKIQCLEKAT SGKFEQSAEE 300 
TPREITSPAK ETSEKFTWPA KGRPRK1AWE KKEDTPRE1M SPAKETSEKF TWAAKGRPRK 360 
IAWEKKETPV KTGCVARVTSNKTKVLEKGR SKMIACPTKE SSTKASANDQ RFPSESKQEE 420 
5 DEEYSCDSRS LFESSAKIQV CIPESIYQKV MEINREVEEP PKXPSAFKPA ™Q^VPNk 4S0 

AFELKNEQTL RADPMFPPES KQKDYEENSW DSESLCETVS QKDVCLPKAT HQKEIDKWG 540 
KLEESPNKDG LLKATCGMKV S1PTKALELK DMQTFKAEPP GKPSAFEPAT EMQKSVPNKA 600 
LELKNEQTWR ADEILPSESK QKDYEENSWD TESLCETV'SQ KDVCLPKAAH [ O^ID^K 660 
LEGSPVKDGL LKANCGMKVS IPTKALELMD MQTFKAEPPE KPSAFEPAIE MQ^VPNKAL 720 

1 0 ELKNEQTLRA DEILPSESKQ KDY EESSWDS ESLCETVSQK DVCLPKATHQ ™DKWGK1 ™ 
EESPDNDGFL KAPCRMKVSI PTKALELMDM QTFKAEPPEK PSAPEPAlEM QKSVPNKALh 840 
LKNEQTLRAD QMFPSESKQK K.VEENSWDSE SLRETV'SQKD VCVPKATHQK EMDKISGKi-E 900 
ECTSLSWLD TVHSCERARE LQKDHCEQRT GKMEQMKKKF CVLKKKLSEA KEIKSQLENQ 960 
KVKWEQeTc^ ^RLTLNQEEE KRRNADILNE KIREELGRIE EQKRKELEVK QQLEQALRIQ 1020 

1 5 DIELKSVESN LNOVSHTHEN ENYLLHENCM LKKFJAMLKL EIATLKHQYQ EKENKYFED1 1080 
ESXeL QMTLKLKEES LTKRASQYSG QLKVLIAENT MLTSK.LKEKQ DKEILEAEIE 1 340 

QDHDQ^VTSR KSQEPAf H1A GDACLQRKMN VDVSSTIYNN EVLHQPLSEA 1200 
OrSkSLKJN LNYAGDALRE NTLVSEHAQR DQRETQCQMK EAEHMYQNEQ DNVNKHTEQQ '1260 
SkLFQL QSKNMWLQQQ LVHAHKKADN KSKIT1DIHF LERKMQHHLL KEKNEEIFNY 1320 

20 NNHLKNR1YQ YEKEKAETEN S 

Seq ID NO: 5 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 273-1785 (underlined sequences correspond to start and stop CDdons) 

1 11 21 31 41 51 

CGCGGGGCGC GGAGTCGGCG GGGCCTCGCG GGACGCGGGC AG TGCGGAGA CCGCGGCGCT 60 
30 GAGGA^GCGG GAGCCGGGAG CGCACGCGCG GGGTGGAGTT ^^^ r^^ 120 
fiTGAAAGGAA AGGA AGATCA TTTCATGCCT TGTTGATAAA GGTTCAGACT TCTGCTGATT 180 

^TGGCTCTGAG CTATGACAAG AGAGGAAACA AAAAGTTAAA CTTACAAGCC 240 
TGCCATAAGT GAGAAGCAAA CTTCCTTGAT AACATGCTT1 ^GCGAAGTCC AGG^TTA 3 0 
AATGTGGGCA CCAAGAAAGA GGATGGTGAG AGTACAGCCC CCAC^CGTC^GGTC 360 
3 5 TTGCGTTGTA AATGCCACCA CCATTGTCCA GAAGACTCAG TCAACAATAT £GCA^CA 420 
GACGGATATT GTTTCACGAT GATAG AAGAG GATGACTCTG GGTTGCCTGT GGTCACTTCT 480 
GOTTGCCTAG GACTAGAAGG CTCAGATTTT CAGTGTCGGG ACACTCCCAT TCCTCATCAA 540 
A^WTC^A TTOAATGCTC CACAGAAAGG AACGAATGTA ATAAAGACCT ACACCT^A ADO 

40 ATATCTGTGA CTGTCTGTAG TTTGCTCTTG GTCCTTATCA TATTATTTTG TTACTTCCGG 720 

TATAAAAGAC AAGA AACCAG ACCTCGATAC AGCATTGGGT TAGAACAGGA TGAAACTTAC 780 
l^C^G^GA^CCCT GAGAGACTTA ATTGAGCAGT CTCAGAGCTC AGGAAGTGGA 840 
TCAGGCCTCC CTCTG CTG GT CCAAAGGACT ATAGCTAAGC AGATTCAGAT GGTG AA AC AG 900 
ATTGGAAAAG GTCGCTATGG GGAAGTTTGG ATGGGAAAGT GGCGTGGCGA AAAGGTAGCT 960 

45 S^TC?S7^C AGAGGAACXX AGCTGGTTCA GAGAGACAGA ^AATATATCAG 1020 
ACAGTGTTGA TGAGGCATGA AAACATTTTG GGTTTCATTG ^^l C c ^^^ ^ 
GGGTCCTGGA CCCAGTTGTA CCTAATCACA GACTATCATG AAAATGGTTC CCTTTATGAT 1 140 
TATCTG A AGT CC ACCACCCT AGACGCTAAA TCAATGCTGA AGTTAGCCTA CTCTTCTGTC 1200 

50 CATCG AGATC TGAA AAGTAA AAACATTCTG GTGAAGAAAA fc ATGGAACTTC ' CTGTATTGCT 1320 
GACCTGGGCC TGGCTGTTAA ATTTATTAGT GATACAAATG AAGTTGACAT ACCACCTA AC 1 380 
ACTC^AGTTG GCACCAAACG CTATATGCCT CCAGAAGTGT TGGACGAGAG CTTGAACAG A 1440 
AATCACTTCC AGTCTTACAT CATGGCTGAC ATGTATAGTT TTGGCCTCAT CCTTTGGGAG 1500 
OTGCTAGGA GATGTGTATC AGGAGGTATA GTGGAAGAAT ACCAGCTTCC TTATCATGAC 560 

S S SaGTCCCCA GTGACCCCTC TTATGAGGAC ATGAGGGAGA TTGTGTGCAT CAAGAAGTTA 1620 
SctSt7^ TAAGGCAGATGGGAAAACTC 1680 

A?KAAT GCTGGGCTCACAATCCrcCATCAAGGCTGA CAGCCCTGCG f™™" 1740 
ACACTTGCCA AAATGTCAGA GTCCCAGGAC ATTAAACTCT GATAGGAGAG GAAAAGTAAG 18 00 
CATCTCTGCA GAA AGCCAAC AGGTACTCTT CTGTTTGTGG GCAGAGCAAA AGA^TCAAA 1860 

60 TAAGCATCCA CAGTACAAGC CTTGAACATC GTCCTGOTC CCAGTCGGH C AGA^^ 

CTTTCAGGGA GCGACCTGGG CAAAGACAGA GAAGCTCCCA GAAGGAGAGA TTGATCCGTG 1980 
TCTGTTTGTA GGCGGAG AAA CCGTTGGGTA ACTTGTTCAA GATATGATGC AT 

65 Qpg m NO: 6 Protein sequence: 

Protein Accession none found 

! 11 21 31 41 51 

70 • MLLRSAGKLN VGTKKEDGES TAPTPRPK.VL RCKCHHHCPE DSVNN1CSTD GYCFTMIEED 60 
DSGLPVVTSG CLGLEGSDFQ CRDTPIPHQR RSIECCTERN ECNKDLHPTL PPLKKRDFVD 120 
rPlHTlRALLl SVTVCSLLLV LIILFCYFRY KRQETRPRYS IGLEQDETY1 PPGESLRDLI 180 
^^S^SGS^LPL^VQRTI AKQIQMVKQ1 GKGRYGEVWM GKWRGEKVAV KVFFTTEEAS 240 

7^5 SS VLMR^E^LG HAADIKGTG SWTQLYL1TD YHENGSLYDY LKSTTLDAKS 300 
^SIyS^/S GLCHLHTE1F STQGKPAIAH RDUCSCNILV KXNGTCCIAD LGLAVKFISD 360 
^^^ ^G^VmVP EVIDESLNRN HFQSYIMADM YSFGLILWEV ARRCVSGGIV 420 
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EEYQLPYHDL VPSDPSYEDM REIVCIKKLR PSFPNRWSSD ECLRQMGKLM TECWAHNPAS 480 
RLTALRVKKT LAKMSESQDI KL 

Seq ID NO: 7 DNA sequence 

Nucleic Acid Accession none found 

Coding sequence: 482 -3 00 7( underlined sequences correspond to start and stop codons) 



AACTGAGCTA ACAAGAAATA CTAGAAAAGG AGGAAGGAGA ACATTGCTGC AGClTGGATC 60 
TACAACCTAA GAAAGCAAGA GTGATCAATC TC AGCTCTGT TAAACATC1T GITTACTTAC 1 20 
TGCATTCAGC AGCTTGCAAA TGGTTA ACTA TATGCAAAAA AGTCAGCATA GCTGTGAAGT 1 80 
ATGCCGTGAA TTTTAATTGA GGGAAAAAGG GACAATTGCT TCAGGATGCT CTAGTATGCA 240 

1 5 CTCTGCTTGA AATATTTTCA ATGAAATGCT CAGTATTCTA TCTTTGACCA GAGGTTTTAA 300 

CTTTATGAAG CTATGGGACT TGACAAAAAG TGATATTTG A GAAGAAAGTA CGCAGTGGTT 360 
GGTGTTTTCT TTTTTTTAAT AAAGGAATTG AATTACTTTG AACACCTCTT CCAGCTGTGC 420 
ATTACAGATA ACGTCAGGAA GAGTCTCTGC TTTACAGAAT CGGATTTCAT CACATGACAA 480 
CATGAAGCTG TGGATTCATC TCTTTTATTC ATCTCTCCTT GCCTGTATAT CTTTACACTC 540 

20 CC A A ACTCC A GTGCTCTCAT CCAGAGGCTC TTGTGATTCT CTTTGCAATT GTGAGGAAAA 600 
AGATGGCACA ATGCTAATAA ATTGTGAAGC AAAAGGTATC AAGATGGTAT CTGAAATAAG 660 
TGTGCCACCA TCACGACCTT TCCAACTAAG CTTATTA A AT AACGGCTTGA CGATGCTTCA 720 
CACAAATGAC TTTTCTGGGC TTACCAATGC TATTTCAATA CACCTrGGAT TTAACAATAT 780 
TGCAGATATT GAG ATAGGTG CATTTAATGG CCITGGCCTC CTGAAACAAC TTCATATCAA 840 

25 TCACAATTCT TTAGAAATrC TTAAAGAGGA TACTTTCCAT GGACTGGAAA ACCTGGAATT 900 
CCTGCAAGCA GATAACAATT TTATCACAGT GATTGAACCA AGTGCCTTTA GCAAGCTCAA 960 
CAGACTCAAA GTGTTAATTT TAAATGACAA TGCTATTGAG AGTCTTCCTC CAAACATCTT 1020 
CCGATTTGTT CCTTTAACCC ATCTAGATCT TCGTGG A A AT CAATTACAAA CATTGCCTTA 1 080 
TGTTGGTTTT CTCGAACACA TTGGCCGAAT ATTGGATCTT CAGTTGGAGG ACAACAAATG 1140 

30 GGCCTGCAAT TGTGACTTAT TGCAGTTAAA AACTTGGTTG GAGAACATGC CTCCACAGTC 1200 
TATAATTGGT GATGTTGTCT GCAACAGCCC TCCATTTTTT AAAGGAAGTA TACTCAGTAG 1260 
ACTAAAGAAG GAATCTATTT GCCCT ACTCC ACCAG TGTAT GAAGAACATG AGGATCCTTC 1320 
AGGATCATTA CATCTGGCAG CAACATCTTC AATAAATGAT AGTCGCATGT C A ACTA AG AC 1380 
CACGTCCATT CTAAAACTAC CCACCAAAGC ACCAGGTTTG ATACCTTATA TTACAAAGCC 1440 

3 5 ATCCACTCAA CTTCCAGGAC CTTACTGCCC TATTCCTTGT AACTGCAAAG TCCTATCCCC 1 500 
ATCAGGACTT CTAATACATT GTCAGGAGCG CAACATTGAA AGCTrATCAG ATCTGAGACC 1560 
TCCTCCGCAA AATCCTAGAA AGCTCATTCT AGCGGGAAAT ATrATTCACA GTTTAATG AA 1 620 
GTCTG ATCTA GTGGAATATT TCACTTTGGA A ATGCTTCAC TTGGG AAACA ATCGTATTGA 1 680 
AGTTCTTGAA GAAGGATCGT TTATGAACCT A ACGAG ATrA CAAAAACTCT ATCTAAATGG 1 740 

40 TAACCACCTG A CCA A ATT AA GTAAAGGCAT GTTCCTTGGT CTCCATAATC TTGAATACTT 1 800 
ATATCTTG AA TACA ATGCCA TTAAGGAAAT ACTGCCAGGA ACCTTTAATC CAATGCCTAA 1 860 
ACTTAAAGTC CTGTATTTAA ATAACAACCT CCTCCAAGTT TTACCACCAC ATATTITTTC 1920 
AGGGGTTCCT CTAACTAAGG TAAATCTTAA A AC AA ACCAG TTTACCCATC TACCTGTAAG 1980 
TAATATTTTG GATGATCTTG ATTTACTAAC CCAGATTGAC CTTGAGGATA ACCCCrGGGA 2040 

45 CTGCTCCTGT GACCTGGTTG GACTGCAGCA ATGGATACAA AAGTTAAGCA AGAACACAGT 2100 
GACAGATGAC ATCCTCTGCA CTTCCCCCGG GCATCTCGAC AAAAAGGAAT TGAAAGCCCT 2160 
AAATAGTGAA ATTCTCTGTC CAGGTTTAGT AAATAACCCA TCCATGCCAA CACAGACTAG 2220 
TTACCTTATG GTCACCACTC CTGCAACAAC AACAAATACG GCTGATACTA TTTTACGATC 2280 
TCTTACGGAC GCTGTGCCAC TGTCTGTTCT AATATTGGGA CTrCTGATTA TGTTCATCAC 2340 

5 0 TATTGTTTTC TGTGCTGCAG GGATAGTGGT TCTTGTTCTT CACCGCAGGA GAAGATACAA 2400 

AAAGAAACAA GTAGATGAGC AAATGAG AG A CAACAGTCCT GTGCATCTTC AGTACAGCAT 2460 
GTATGGCCAT AAAACCACTC ATCACACTAC TGAAAGACCC TCTGCCTCAC TCTATGAACA 2520 
GCACATGGTG AGCCCCATGG TTCATGTCTA TAGAAGTCCA TCCTTTGGTC CAAAGCATCT 2580 
GGAAGAGGAA GAAG AGAGGA ATGAGAAAGA AGGAAGTGAT GCAAAACATC TCCAAAGAAG 2640 

5 5 TCTTTTGGAA CAGGAAAATC ATTCACCACT CACAGGGTCA AATATGAAAT ACAAAACCAC 2700 
GAACCAATCA ACAGAATTTT TATCCTTCCA AGATGCCAGC TCATTGTACA GAAACATTTT 2760 
AGAAAAAGAA AGGGAACTTC AGCAACTGGG AATCACAGAA TACCTAAGGA AAAACATTGC 2820 
TCAGCTCCAG CCTGATATGG AGGCACATTA TCCTGGAGCC CACGAAGAGC TGAAGTTAAT 2880 
GGAAACATTA ATGTACTCAC GTCCAAGGAA GGTATTAGTG GAACAGACAA AAAATGAGTA 2940 

60 TTTTGAACTT AAAGCTAATT TACA TGCTGA ACCTGACTAT TTAGAAGTCC TGGAGCAGCA 3000 
AACATAGATG GAGAGTTTGA GGGCTTTCGC AG AAA TGCTG TG ATTCTGTT TT AAGTCC AT 3060 
ACCTTGTAAA TAAGTGCCTT ACGTGAGTGT GTCATCAATC AGAACCTAAG CACAGCAGTA 3120 
AACTATGGGG AAAAAAAAAG AAGAAGAAAA GAAACTCAGG GATCACTGGG AGAAGCCATG 31 80 
GCATTATCTT CAGGCAATTT AGTCTGTCCC AAATAAAATC AATCCTTGCA TGTAAATC 

65 

Seq ID NO: 8 Protein sequence: 
Protein Accession #: none found 

70 1 11 21 31 41 51 

: I i I I I 

MKI.WIHI.FYS SLLACISLHS QTPVLSSRGS CDSLCNCEEK DGTMLrNCEA KGIKMVSEtS 60 
VPPSRPFQLS LLNNGLTMLH TNDFSGLTNA ISIHLGFNNI AD I EI G A FN G LGLLKQLHFN 120 
HNSLEILKED TFHGLENLEF LQADNNFITV IEPSAFSK1.N RLKVLILNDN AIESLPPNIF 180 
75 RFVPLTHLDL RGNQLQTLPY VGFLEHIGRI LDLQLEDNKW ACNCDLLQLK TWLENMPPQS 240 

IIGDVVCNSP PFFKGSILSR LKKESICPTP PVYEEHEDPS GSLIILAATSS INDSRMSTKT 300 
TSILK.LPTKA PGUPY1TKP STQLPGPYCP IPCNCKVLSP SGLUHCQER NIESLSDLRP 360 
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PPQKPRKLIL AGN1IHSLMK SDLVEYFTLE MLHU3NNRIE VLEEGS ^ r ^^^ G c ^ 
NIILTKLSKGM FLGLHNLEYL YLEYNAIKEI LPGTFNPMPK ^^^^Y^™^ 48 ^ 
GVPLTKVNLK TNQFTHLPVS NILDDLDLLT QIDLEDNPWD CSCDLVGLQQ W1QKLSKNTV 540 
TDDILCTSPG HLD1CKELKAL NSEILCPGLV NNPSMPTQTS YLMVTTPATT ™ ADTILRS . 600 
LTDAVPLSVL ILGLL1MFIT IVFCAAGlVV LVLHRRRRYK KiCQVDEQMRDNSPy^I^QYSM 660 
YGHKTTHHTT ERPSASLYEQ HN1VSPMVHVY RSPSFGPKHL EEEEERNEKE GSDAKHLQRS 720 
LLFOENHSPL TGSNMKYKTT NQSTEFLSFQ DASSLYRNTL EKERELQQLG ITEYLRKNIA 7S0 
QLQ?DNl" Am' PGAHEELKLM ETLMYSRPRK VLVEQTKNEY FELKANLHAE PDYLEVLEQQ 840 
T 

Scq ID NO: 9 DKA sequence 
Nucleic Acid Accession #: NM__003474 

Coding sequence: 307-3036 (underlined sequences correspond to start and stop codons) 

1 U 21 31 41 51 

CACTAACGCT CTTCCTA.GTC CCCGGGCCAA CTCGGACAGT TTGCTCATIT ATTGCAACGG 60 
TCAAGGCTGG CTTGTGCCAG AACGGCGCGC GCGCGACGCA CGCACACACA CGGGGGGAAA 120 
70 CTHTn^AA . AATGAAAGGC TAGAAGAGCT CAGCGGCGGC GOGGGCCGTG CGCGAGGOCT 1 80 
Ca^AGC^ CAGGAAATCC CTCCGGTCGC GACGCCCGGC CCCGCTCGGC 240 

GCCC^OGTGG G ATGGTGCAG CGCTCGCCGC CGGGCCCGAG AGCTGCTGCA CTG AAGGCCG 300 
GCG^GATGG CAGCGCGCCC GCTGCCCGTG tccoccgccc ^gccct^g^cgc^g ^o 
GCOGGTCCTC TGCTCGCGCC CTGCGAGGCC CGAGGGGTGA GCTrATGGAA CG A AGG AAGA 420 

25 Saag ttgtcagtgc ctctgttcgg agtggogacc ^ A ^Jf^^ 4 5 8 40 

TTCGACTCCA AGAATCATCC AGAAGTGCTG AATATTCGAC TACAACGGGA AAGCAAAGAA 540 
CTGATCATAA ATCTGGAAAG AAATGAAGGT CTCATTGCCA GCAGTTTCAC GG AAACCCAC 600 
TATCTGCAAG ACGGTACTG A TGTCTCCCTC GCTCGAAATT ACACGGTAAT TCTGGGTCAC 660 
TCTTACT ACC ATGGACATGT ACGGGGATAT TCTGATTCAG CAGTCAGTCT CAGCACGTGT 720 
^0 TCTGGTCTCA GGGGACTTAT TGTGTTTGAA AATGAAAGCT ATGTCTTAGA ACCAATGAAA 780 

Iotgcaacc^^^^^^ 

Sa rCAC ATCACAACAC ACCAAACCTC GCTGCAAAGA ATGTCTTO: [f^ff^™ 
CAGACATGGG CAAGAAGGCA TAAAAGAGAG ACCCTCAAGG CAACTAAGTA TGTGGAGCTG 960 
GTGATCGTG(3 CAGACAACCG AGAGTTTCAG AGGCAAGGAA AAG ATCTGGA AAAA^G 1020 

*\ 5 CAGCGATTAA TAG AGATTGC TAATCACGTT GACAAGTTTT ACAGACCACT G AACATTCGG 1080 
SSr TAGGD3TGGA AGTGTGGAAT GACATGGACA AATGCTCTGT AAGTCAGGAC 1 1 140 
rPATTCACCA GCCTCCATGA ATTTCTGGAC TGGAGGAAGA TGAAGCTTCT ACCTCGCAAA 1200 
T^TGACA ATG^GCAGCT TGTCAGTGGG GTITATTTCC AAGGGACCAC CATCGGCATG 1260 
GCC^TCA TGAGCATGTG CACGGCAG AC CAGTCTGGGG GAATTGTCAT GGACCATTCA 1320 

40 GACAA^CC TTGGTGCAGC CGTGACCCTG GCACATGAGC TGGGCCACAA mOGOATC »380 
\ATCATGACA CACTGGACAG GGGCrGTAGC TGTCAAATGG CGGTTGAGAA AGGAGGCTGC 1440 
tTCATGAACG CTTCCACCGG GTACCCATTT CCCATGGTGT TCAGCAGTTG CAGCAGG AAG 1500 

ACtGGAGTCTT TCGGGGGCCA GAAGTGTGGG AACAGATTTG TGGAAGAAGG AGAGG AGTGT 1620 
45 Sa^SSSSaGGA ATGTATGAAT OGCTGCTGCA ATGCCACCAC CTGTACCCTG 1630 
AAGCOGGACG CTGTGTGCGC ACATGGGCTG TGCTGTGAAG ACTCCCAGCT GAAGCCTGCA 40 
GGAACAGCGT GCAGGG ACTC CAGCAACTCC TGTGACCTCC CAGAGTTCTG ^AGGGGCC 1 800 
AGCCCTCACT GCCCAGCCAA CGTGTACCTG CACGATGGGC ACTCATGTCA GGATGTGGAC 860 
mCTACTGCT ACAATGGCAT CTGCCAG ACT CACGAGCAGC AGTGTGTCAC ACTCTGGGGA 1920 
5 0 SaGGTGCTA A ACCTGCCCC TGGG ATCTGC TTTGAG AGAG TCAATTCTGC AGGTG ATCCT 1980 
TATGGCAACT GTGGCAAAGT CTCG AAGAGT TCCTTTGCCA AATGCGAGAT GAG AO ATGCT 2040 
Iaatctggaa AAATCCAGTG TCAAGGAGGT GCCAGCCGGC CAGTCATTGG TACCAATGCC 2 00 
StTTOATAC TaAACAAACAT CCCCCTGCAG CAAGGAGGCC GGATTCTGTG CCGGGGGACC 2160 
SSKCT TGGGCGATG A CATGCCGGAC CCAGGGCTTG TOCTJ^^^T 2220 
55 GCAGATGGAA AAATCTGCCT GAATCGTCAA TGTCAAAATA TTAG rGTCTT TGGGGTTCAC 2280 
GAGTGTGCAA TGCAGTGCCA CGGCAGAGGG GTGTGCAACA ACAGGAAGAA CTGCCACTGC 2340 
G aScC CACT GGGCACC TCC CTTCTGTO AC AAGTTTGGCT TTGGAGGAAG CACAGACAGC 2400 
G^CCCCATCC GGCAAGC AG A TAACCAAGGT TTAACCATAG GAATTCTGGT GACCATCCTG 2460 
TCTCrrCTTG CTGCCGGATT TGTGGTTTAT CTCAAAAGGA AGACCHGAT ACGACTG CTG 2520 
60 TTWAAATA AGAAGACC AC CATTGAAAAA CTAAGGTGTG TCCGCOTC CCGGC^CCC 580 
CGTGGCTTCC AACCCTGTCA GGCTCACCTC GGCCACCTTG GAAAAGGCCT GATG AGGAAG 2640 
rCGCCAG ATT CCTACCCACC G AAGG ACAAT CCCAGGAG AT TGCTGCAGTG TCAG AATGTT 2700 
GACATCAGCA GACCCCTCAA CGGCCTG AAT GTCCCTCAGC CCCAGTCAAC TCAGCGAGTG 2760 
OTCCTCCCC TCCACCGGGC CCCACGTGCA CCTAGCGTCC CTGCCAGACC CCTGCCAGCC 2820 
65 AAGCCTGCAC TTAGGCAGGC CCAGGGGACC TGTAAGCCAA ACCCCCCTCA GAAGCCTCTG 2880 
rrrGCAG ATC CTCTGGCCAG AACAACTCGG CTCACTCATG CCTTGGCCAG GACCCCAGGA 2940 
CA A TG G G A G A CTOGGCTCCG CCTGGCACCC CTCAGACCTG CTCCACAATA TCCACACCAA 3000 
GTGCCCAGAT CCACCCACAC CGCCTATATT AAGTGAGAAG CCGACACCTT TTTTCAACAG 3060 
TG AAG AC AG A AGT1TGCACT ATCTTTCAGC TCCAGTTGGA GTrTTTTGTA CCAACTTTTA 3120 
70 OG ATTTTTTT TAATGTTTAA AACATCATTA CTATAAGAAC TTTGAG CT AC TGCCGTCAGT 31 80 
GCTGTGCTGT GCTATGGTGC TCTGTCTACT TGCACAGGTA CTTGTAAATT ATTAATTT AT 3240 
nCAGAATGTT GATTACAGTG CAGTGCGCTG TAGTAGGCAT TTTTACCATC ACTGAGTTTT 3300 
CCATGGCAGG AAGGCTTGTT GTGCTTrTAG TAlTTTAGTG AACTTGAAAT ATCCTGCTTG 3360 
atGGG ATrCT GGACAGG ATG TGTTTGCTTT CTG ATC AAGG CCTTATTGGA AAGCAGTCCC 3420 
75 CCA^CTACCC CCAGCTGTGC TTATGGTACC AGATGCAGCT CAAGAGATCC CAAGTAGAAT 3480 
ScAGTTGAT TTTCTGG ATT CCCCATCTCA GGCCAGAGCC AAGGGGCTTC AGGTCCAGGC 3540 
TGTGTTTGGC TTTCAGGG AG GCCCTGTGCC CCTTGACAAC TGGCAGGCAG GCTCCCAGGG 3600 
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ACACCTGGGA GAAATCTGGC TTCTGGCCAG GAAGCTTTGG TGAGAACCTG GGTTGCAGAC 3660 
AGGAATCTTA AGGTGTAGCC ACACCAGGAT AGAGACTGGA ACACTAGACA AGCCAGAACT 3720 
TGACCCTGAG CTGACCAGCC GTGAGCATGT TTGGAAGGGG TCTGTAGTGT CACTCAAGGC 3780 
GGTGCTTGAT AGAAATGCCA AGCACTrCTT TTTCTCGCTG TCCTTrCTAG AGCACTGCCA 3840 
5 CCAGTAGGTT ATTTAGCITG GGAAAGGTGG TGTTTCTGTA AGAAACCTAC TGCCCAGGCA 3900 
CTGCAAACCG CCACCTCCCT ATACTGCITG GAGCTGAGCA AATCACCACA AACTGTAATA 3960 
CAATGATCCT GTAITCAGAC AGATGAGGAC ITTCCATGGG ACCACAACTA TTTTCAGATG 4020 
TGAACCATTA ACCAGATCTA GTCAATCAAG TCTGTTTACT GCAAGGTTCA ACTTATTAAC 4080 
AATTAGGCAG ACTCTTTATG CTTGCAAAAA CTACAACCAA TGGAATGTGA TGTTCATGGG 4140 

1 0 TATAGTTCAT GTCTGCTATC ATTATTCGTA GATATTGGAC AAAGAACCTT CTCTATGGGG 4200 
CATCCTCTTT TTCCAACTTG GCTGCAGGAA TCTTTAAAAG ATGCTTTTAA CAGAGTCTGA 4260 
ACCTATTTCT TAAACACTTG CAACCTACCT GTTGAGCATC ACAGAATGTG ATAAGGAAAT 4320 
CAACTTGCTT ATCAACTTCC TAAATATTAT G AGATGTGGC TTGGGCAGCA TCCCCTTGAA 4380 
CTCTTCACTC TTCAAATGCC TGACTAGGGA GCCATGTTTC ACAAGGTCTT TAAAGTGACT 4440 

1 5 AATGGCATGA GAAATACAAA AATACTCAGA TAAGGTAAAA TGCCATGATG CCTCTGTCTT 4500 
CTGGACTGGT TTTCACATTA GAAGACAATT GACAACAGTT ACATAATTCA CTCTGAGTGT 4560 
TTTATGAGA A AGCCTTC ITT TGGGGTCAAC AGTTTTCCTA TGCTTTGAAA CAGAAAAATA 4620 
TGTACCAAGA ATCTTGG ITT GCCTTCCAGA AA AC AAA ACT GCATTTCACT TTCCCGGTGT 4680 
TCCCCACTGT ATCTAGGCAA CATAGTATTC ATGACTATGG AT AA ACT AAA CACGTGACAC 4740 

20 AAACACACAC AAAAGGGAAC CCAGCTCTAA TACATTCCAA CTCGTATAGC ATGCATCTGT 4800 
TTATTCTATA GTTATTAAGT TCTTTAAAAT GTA AAGCCAT GCTGGAAAAT AATACTGCTG 4860 
AGATACATAC AGAATTACTG TAACTGATTA CACTTGGTAA TTGTACTAAA GCCAAACATA 4920 
TATATACTAT TAAAAAGGTT TACAGAATTT TATGGTGCAT TACGTGGGCA TTGTCTTTTT 4980 
AGATGCCCAA ATCCTTAG AT CTGGCATGTT AGCCCTTCCT CCAATTATAA GAGGATATGA 5040 

25 ACCAAAAAAA AAAAAAAAAA AA 



Seq ID NO 1 0 Protein sequenc e: 
Protein Accession if: NP 003465.2 

30 

I 11 21 31 41 51 

I I I 1 I I 

MAARPLPVSP ARALLLALAG ALLAPCEARG VS L WNEG RA D EVVSASVRSG DLWIPVKSFD 60 
SKNHPEVLNI RLQRESKHL I INLFRNEGLI ASSFTETHYL QDGTDVSLAR NYTVILGHCY 120 

35 YHGHVRG YSD SAVSLSTCSG LRGLIVFEKE SYVLEPMKSA TNRYKLFPAK KLKSVRGSCG 1 80 
SHHNTPNLAA KNVFPPPSQT WARRHKRETL KATKYVELVI VADNREFQRQ GKDLEKVKQR 240 
UEIANHVDK FYRPLNIRIV LVGVEVWNDM DKCSVSQDPF TSLHEFLDWR K1MKLLPRKSH 300 
DNAQLVSGVY FQGTTIGMAP 1MSMCTADQS GGIVMDHSDN PLGAAVTLAH ELGHNFGMNH 360 
DTLDRGCSCO MAVEKGGCIM NASTGYPFPM VFSSCSRKDL ETSLEKGMGV CLFNLPEVRE 420 

40 SFGGQKCGNR FVEEGEEODC GEPEECMNRC CNATTCTLKP DAVCAHGLCC EDCQLKPAGT 480 
ACRDSSNSCD LPEFCTGASP HCPANVYLHD GHSCQDVDGY CYNGICQTHE QQCVTLWGPG 540 
AKPAPG [CFE RVNSAGDPYG NCGKVSKSSF AKCEMRDAKC GKJQCQGGAS RPVIGTNAVS 600 
IETN1PLQQG GRILCRGTHV YLGDDMPDPG LVLAGTKCAD GKICLNRQCQ N1SVFGVHEC 660 
AMQCHGRGVC NNRXNCHCEA flWAPPFCDKF GFGGSTDSGP IRQADNQGLT IGILVT1LCL 720 

45 LAAGF VVYLK RKTLIRLL FT NKKTTIEKLR CVRPSRPPRG FQPCQAHLGH LGKGLMRKPP 780 
DSYPPKDNPR RLLQCQNVDI SRPLNGLNVP QPQSTQRVLP PLHRAPRAPS VPARPLPAKP 840 
ALRQAOGTCK PNPPQKPLPA DPLARTTRI>T HALARTPGQW ETGLRLAPLR PAPQYPHQVP 900 
RSTHTAYIK. 

50 Seq ID NO* 1 1 DNA sequence 

Nucleic Acid Accession tk NM_001394 

Coding sequence: 400 1 584( underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

55 i | i i | | 

GGCACGAGGG CTGAGCGCCG GAGGAGCGTA GGCAGGGCAG CGCTGGCGCC AGCGGCGACA 60 
GGAGCCGCGC GACCGGCAAA AATACACGGG AGGCCGTCGC CGAAAAGAGT CCGCGGTCCT 120 
CTCTCGTAA A CACACTC I'CC TCCACCGGCG CCTCCCCCTC CGCTCTGCGC GCCGCCCGGC 1 80 
TGGGCGCCCG AGGCCGCTCC G ACTGCTATG TGACCGCGAG GCTGCGGGAG GAAGGOGACA 240 

60 GGGAAGA AG A GGCTCTCCCG CGGG AGCCCT TGAGGACCAA GTTTGCGGCC ACTTCTGCAG 300 
GCGTCCCTTC TTAGCTCTCG CCTGCCCCTT TCTGCAGCCT AGGCGGCCCA GGTTCTCTTC 360 
TCTTCCTCGC GCGCCCAGCC GCCTCGGTTC CCGGCGACC A TG GTGACGA1 GGAGGAGCTG 420 
CGGGAGATGG ACTGCAG TGT GCTCAAAAGG CTGATGAACC GGGACGAGAA TGGCGGCGGC 480 
GCGGGCGGCA GCGGCAGCCA CGGCACCCTG GGGCTGCCGA GCGGCGGCAA GTGCCTGCTG 540 

65 CTGGAC TGCA GACCGTT C:CT GGCGCACAGC GCGGGCTACA TCCTAGGTTC GG'ICAACGTG 600 
CGCIGTA ACA CCA ICGTGCG GCGGCGGGCT AAGGGCTCCG TGAGCCTGGA GCAG ATCCTG 660 
CCCGCCC AGG AGGAGGTACG CGCCCXjCTTG CGCTCCGGCC TCTACTCGGC GGTCATCGTC 720 
TACGACGAGC (.CAGCCCv iC:G CGCCGAGAGC CTCCGCGAGG ACAGCACCGT GTCGCTGGTG 780 
GTGCAGGCGC TGCGCCGCAA (XiCCGAGCGC ACCGACATCT GCCTGCTCAA AGGCGGCTAT 840 

70 GAGAGGTTTT CCTCCG AGTA CCCAGAATTC TGTTCTAAAA CCAAGGCCCT GGCAGCCATC 900 
CCACCCCCGG TTCCCCCC AG CGCCACAGAG CCCTTGGACC TGGGCTGCAG d'CCTG rGGG 960 
ACCCCACTAC ACGACCA<jGG GGGTCCTGTG GAGATCC'ITC CCTTCCTCTA CCTCGGCAGT 1020 
GCCTACCATG CTGCCCGGAG AGACATGCTG GACGCCCTGG GCATCACGGC TCTG ITGAAT 1080 
GTCICCTCGG ACTGCCCAAA CCACTTTGAA GGACACTATC AGTACAAGTG CA'ICCCAGTG 1 140 

75 GAAGATAACC ACAAGGCCGA CATCAGCTCC TGGTTCATGG AAGCCATAGA GTACATCGAT 1200 
GCCGTGAAGG ACI GCCGTGG GCGCGTGCTG GTGCACTGCC AGGCGGGCAT CI CGCGGTCG 1260 
GCCACCATC'I GCC1GGCCTA CCIGA IX5ATG AAGAAACGGG TGAGGCTGGA GGAGGCCTTC 1320 
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65 



1680 



1980 



GAGTTCGTTA AGCAGCGCCG CAGCATCATC TCGCCCAACT TCAGCTrCAT GGG^GCTG 380 
CTGCACTTCG AGTCCCAGGT C3CTCGCCACG TCCTGTGCTG OGG AGGCTCC T ACOTDCTCG 1440 
GGACCCCTGC GGGAGCGGGG CAAG ACCCCC GCCACCCCCA CCrCGCAGTI CG CTrCAGC 500 

rrrccGGTcr ccgtgggcgt gcactcggcc cccagcagcc tgccctaoct gca^gcxcc i seo 

5 ATCACCACCT CTCCCAGCTG TTAGAGCCGC CCTGGGGGCC CCAGAACCAG AGCTGGCTCC 1 620 
CAGCAAGGGT AGGACGGGCC GCATGCGGCA GAAAGTTGGG ACTGAGCAGC TGGC AG<>GG 16 
CGACCGAGCT CCTTCCCCAT CA'nTCTCCT TGGCCAACG A CGAGGCCAGC CAG AATGGCA1 
ATAAGGACTC CGAATACATA ATAAAAGCAA ACAGAACACT CCAAOTAGA G^TAACCG 1 800 
GTGCCGCAGC AGCCAGGGAA GACCTTGGTT TGGTTTATGT GTCAGTTTCA CTTTTCCGAT 1 860 
1 0 AGAAATTTCT TACCTCATTT TTTTAAGCAG TAAGGCTTGA AGTG ATGAAA CCCACAGATC 1920 
CTAGCAAATG TGCCCAACCAGCTTTACTAA AGGGGGAGGA AGGGAGGGCA AAGGGATGAG IS 
AAGACAAGTT TCCCAGAAGT GCCTGGTTCT GGGTACTTCT CCCTTTGTTG TCGTTGTTGT 2040 

agtoa^gS ATTTCArrrr taaaagaaat cttcg aaggt GiGGnrrcA titctcagtc 2100 

ACCAACAGAT GAATAATTAT GCTTAATAAT AAAGTATTTA TTAAGACTTT dTCAGAGTA 21 60 
1 5 TGAAAGTACA AAAAGTCTAG TTACAGTGGA TTTAGAATAT ATTTATGTTG ATGTCAAACA 2220 
GCTGAGCA CC GTAGCATGCA GATGTCAAGG CAGTTAGGAA GTAAATGGTG TCTTGTAGAT 2280 

atgSSg Sgcatg atg agcaacitga gtttgttgcc actgagaagc ■ aockx^ 

GGTGGG AGGA GGAAGAAAGG GAAGAATTAG GTTTG AAITG CTTnTAAAA AAAAAAGAAA 2400 
AGAAAAAGAC AGCATCTCAC TATGTTGCCA AGGCTCATCT TGAGAAGCAG GCGGGTTGGG 2460 
20 TGGGAGGAGG AAGAAAGGGA AGAATTAGGT TTGAATTGCT TTTTTAAAAA AAAA 

Scq ID NO: 12 Protein sequence-. 
Protein Accession ft NP_001 3 85 

25 1 11 21 31 41 51 

MVTMEELREM DCSVLKRLMN RDENGGGAGG SGSHGTLGLP SGGKCLLLDC RPFl^SAGY 60 
ILGSVNVRCN TIVRRRAKGS VSLEQILPAE EEVRARLRSG LYSAVIV YDE RSPRAESLRE 120 
D^TlVVQA LRR^aStDICLLKGGYERF SSEYPEFCSK TICALAAIPPP VPPSATEPLD ISO 
30 LGCSSCGTPL HDQGGPVEIL PFLYLGSAYH AARRDMLDAL G ITALLNVSS DCPNHFEGHY 240 

O^VEDN HKAMSSWFM EAIEYIDAVK DCRGRVLVHC QAGISRSAT1 CLAYLMMKKR 300 
VRLEEAf EFV KQRRSIISPN FSFMGQLLQF ESQVLATSCA AEAASPSGPL RERGKTPATP 360 
TSQFVFSFPV SVGVHSAPSS LPYLHSPITT SPSC 

35 Seq ID NO: 1 3 DNA sequence 

Nucleic Acid Accession ft none found 

Coding sequence: 68-340( underlined sequences correspond to start and stop codons) 

40 1 11 21 31 41 51 

AGCGCCTTGC CTICrcrrAGGCTTTGAAGC ATTTTTGTCT GTGCTCCCTG ATCTTC^GGT 60 
CACCACCATG AAGTTCTTAG CAGTCCTGGT ACTCTTGGG A GTTTCCATCT TTCTGGTCTC 120 
TGCCCAG AAT CCGACAACAG CTGCTCCAGC TGACACGTAT CCAGCTACTG GTCCTGCTGA 180 

45 TGATGAAGCC CCTGATGCTG AAACCACTGC TGCTGCAACC ACTGCGACCA CTGCTGCTCC 240 
ScSCTGCA ACCACCGCTG CTTCTACCAC TGCTCGTAAA GACATTCCAG TTTTACCCAA 300 
ItGGGTOGG GATCTCCCGA ATGGTAGAGT GTGTCCCTGA GATGG AATCAGC^GAGTCT 360 
TCTGCAATTG GGTCACAACT ATTCATGCTT CCTGTGATTT CATCCAACTA CTTACCTTGC 420 
CTACGATATC CCCTTTATCT CTAATCAGTT TATTTTCTTT CAAATAAAAA ATAACTATGA 480 

50 GCGAGCTAAC AT 



Seq ID NO: 14 Protein sequence; 
Protein Accession ft none found 



1 11 21 31 41 51 



MKFLAVLVLL GVSIFLVSAQ NPTTAAPADT YPATGPADDE APDAETTAAA TTATTAAPTT 60 



60 ATTAASTTAR KDIPVLPKWV GDLPNGRVCP 



Seq ID NO: 15 DNA sequence 
Nucleic Acid Accession #: NM_0 16640.2 

Coding sequence: 39-1358(underlined sequences correspond to start and stop cottons) 

i li 21 31 41 5! 

GCTTAAGTTG ACCTCTGGGT CCGGA ATCGC GGGCAAAGAT GGCGGCGGCC AGGTGTTGGA 60 
70 GGCCTITGCT ACGCGGTCCG AGGCTTTCAT TGCACACCGC GGCTAATGCC GCCGCCACGG 120 
CTACAGAAAC GACCTCCCAA GACGTCGCGG CG ACCCCCGT CGCGCGGTAC CCGCCG ATTG 1 80 
TGGCCTCCAT GACAGCCGAC AGCAAAGCTG CACGGCTGCG GCGG ATCGAG CGCTGGCAGG 240 
PGACGGTGCA CGCTGCGGAG TCGGTAGACG AGAAGCTGCG AATCCTCACC AAGATGCAGT 300 
T^ATG^AGTA CA^TGGTTTAC CCGCAG ACCT TCGCACTGAA TGCCGACCGC TGGTACCAGT 360 
7 5 ACTTCACCAA GACCGTGTTC CTGTCGGGTC TGCCGCCGCC CCCAGCGGAG CCCGAGCCCG 420 
^ AACC CGAACCTGAA CCTGCGCTGG ACCTCGCGGC GCTGCGTGCG GTCGCCTGCG 480 

ACTGCCTGCT GCAGG AGCAC 1TCTACCTGC GGCGCAGGCG GCGCGTGCAC CGTTACGAGG 540 
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AGAGCGAGGT CATATCTTTG CCCTTCCTGG ATCAGCTGGT GTCAACCCTC GTGGGCCTCC 600 
TCAGCCCACA CAACCCGGCC CTGGCCGCTG CCGCCCTCGA TTATAGATGC CCAGTTCATT 660 
TTTACTGGGT GCGTGGTGAA GAAATTATTC CTCGTGGTCA TCGAAGAGGT CGAATTGATG 720 
ACTTGCGATA CCAGATAGAT GATAAACCAA ACAACCAGAT TCGAATATCC AAGCAACTCG 780 
5 CAGAGTTTGT GCCATTGGAT TATTCTGTTC CTATAGAAAT CCCCACTATA AAATGTAAAC 840 
CAG ACA A ACT TCCATTATTC AAACGGCAGT ATG AAAACCA CATATTTGTT GGCTCAAAAA 900 
CTGCAGATCC TTGCTGTTAC GGTCACACCC AGTTTCATCT GTTACCTGAC AAATTAAGAA 960 
GGGAAAGGCT TTTGAGACAA AACTGTGCTG ATCAGATAGA AGTTGTTTTT AGAGCTAATG 1020 
CTATTGCAAG CCTTTTTGCT TGGACTGGAG CACAAGCTAT GTATCAAGGA TTCTGGAGTG 1080 

1 0 AAGCAGATGT TACTCGACCT TTTGTCTCCC AGGCTGTGAT CACAGATGGA AAATACTITT 1 140 
CCTITTTCTG CTACCAGCTA AATACTri'GG CACTGACTAC ACAAGCTGAT CAAAATAACC 1200 
CTCGTAAAAA TATATGTTGG GGTACACAAA GTAAGCCTCT TTATGAAACA ATTGAGGATA 1260 
ATGATGTGAA AGGTTTTAAT GATGATGTTC TACTTCAGAT AGTTCACTTT CTACTGAATA 1320 
GACCAAAAGA AGAAAAATCA CAGCTGTTGG AAAACTGAAA AAGCATATTT GATTGAGAAC 1380 

1 5 TGTGGGAATA TTTAAATTTT ACTGAAGGAA CAATAATGAT GAGATTTGTA ACTGTCAACT 1440 
ATTAAATACA TTGATTTTTG AGACAAATAT TTCTTATGTC AACCTGTTAT TAGATCTCTT 1500 
ACTCTGCTCA AATTCATCAC TGA AAGATTT AATTTTAGTT ACCTTTTGTT G ATTTAAAAA 1 560 
TAATTGCATT TGTATATTGC TAACTGATAA GACAAATTGA GTTATTGAGC TATTAAATGC 1620 
ACATTTTAAT ATAAATGCAG AAATCCCAAA TAAAATGCTA ACATACTGAA TTCAGTAATT 1680 

20 AAAAGAACCC ACTGC 



Seq ID NO: 16 Protein sequence: 
Protein Accession #: NP_057724.1 

1 11 21 31 41 51 
1 I I I I I 

MAAARCWRPL LRGPRLSLHT AANAAATATE TTSQDVAATP VARYPPIVAS MTADSKAARL 60 
RRIERWQATV HAAESVDEKL R1LTKMQFMK YMVYPOTFAL NADRWYQYFT KTVFLSGLPP 120 

30 PPAEPEPEPE PEPEPALDLA ALRAVACDCL LQEHFYLRRR RRVIIRYEESE VISLPFLDQL 1 80 

VSTLVGLLSP HNPALAAAAL DYRCPVHFYW VRGEEIIPRG HRRGRIDDLR YQIDDKPNNQ 240 
IRJSKQLAEF VPLDYSVPIE IPTIKCKPDK LPLFKRQYEN HIFVGSKTAD PCCYGHTQFH 300 
LLPDKLRRER LLRQNCADQI EVVFRANAIA SLFAWTGAQA MYQGFWSEAD VTRPFVSQAV 360 
ITDGICYFSFF CYQLNTLALT TQADQNNPRK N1CWCTQSKP LYETIEDNDV K.GFNDDVLLQ 420 

35 IVHFLLNRPfC EEKSQLLEN 

Seq ID NO: 17 DNA sequence 

Nucleic Acid Accession #: NM_025059.1 

Coding sequence: 3-2150 (underlined sequences correspond to start and stop codons) 

40 

1 11 21 31 41 51 
1 I I I 1 I 

GCATGAGCCT GGACTGCACC AGCCATATCG CGCTGGGTGC CGCTTCGCCA GCGCCCGAGG 60 

45 AAACTTACGA TCATCTTTCG GAAGTCCCGG TCACGCGGGA GCAGTTAAAC CACTATCGGA 120 
ATGTGGCTCA AAATGCTCGA AGTGAACTTG CAGCAACTTT GGTCAAATTT G AATGTGCrC 1 80 
AGTCTGAGCT TCAAGACCTC CGATCCAAGA TGCT1TCTAA AGAAGTCTCC TGTCAAGAAC 240 
TGAAAGCTG A AATGGAGAGC TACAAGGAAA ACAATGCCAG AA AATCATCT CTCCTTACCT 300 
CITTGAGAGA CAGAGTTCAG G A ACT AG A AG AAGAATCAGC AGCACTTTCC ACTTCTAAAA 360 

50 TCAGAACAGA AATCACAGCT CACGCTGCAA TCAAGGAGAA CCAGGAATTA AAGAAGAAAG 420 
TTGTAGAGTT AAATGAAAAA TTACAAAAGT GTTCAAAAGA AAATGAGGAG AATAAGAAAC 480 
AAGTTTCAAA GAATTGCAGG AAACATGAGG AATTTCTGAC TCAACTGCGT GACTGCTTGG 540 
ATCCAGATGA GAGGAATGAC A AG GC A TCAG ATGAAGATTT AATTTTAAAG CTTAGAGACC 600 
TGCGCAAAGA AAATGAATTC GTGAAAGGAC AAATTGTTAT TCTTGAAGAG ACTATAAATG 660 

5 5 TCCATGAGAT GGAAGCAAAA GCTAGCAGAG AAACGATCAT GAGGCTGGCT TCAGA AGTCA 720 
ACAGAGAGCA GAAAAAAGCT GCCTCCTGTA CTGAAGAG AA AGAGAAGCTG AACCAGGACC 780 
TGCTCAGTGC TGTAG A AGCA AA AGAAGCTC TTGAAAGGGA AGTTAAGATC TTCCAAGAAA 840 
GGCTGCTTGC TGGCCAGCAG GTCTGGGATG CCTCAAAGCA GGAAGTGAGC CTCCTGAAGA 900 
AAAGCTCTTC TGAGTTGGAG AAGAGTITGA AGGCCAGTCA GGATGCAGTC ACAACCTCAC 960 

60 AAAGCCAGTA CTCCTCATTT AGGGAGAAAA TCGCAGCCCT CCTTAGGGGC AGATTGAGCA 1020 
TGACTGGGTC CACTG AGGAC ACCATTTTGG AGAAGATTCG AGAAATGG AC AGCCGGGAAG 1 080 
AAAGCAGGGA CCGG ATGGTC TCCCAGCTTG AAGCCGAAAT ATCTG AGCTT GTTGAACAGT 1 1 40 
TGGGAAAGGA GTCTGGGTTT CACCAGAAAG CTCTCCAG AG GGCCCAGAAA GCAGAGAA TA 1 200 
TGTTGGAGAC TCTTCAGGGT CAGCTGACAC ACCTGGAGGC AGAGCTGGTT TCTGGAGGTG 1260 

65 riTTGCGAGA CAACTTGAAT TTTGAGAAAC AAAAATATCT TAAATTTCTG GATCAGCTTT 1320 
CI CAG AAA AT GAAGTTGGAC CAGATGGCTG CCGAACTTGG CTTTGACACG CG GCTGGACG 1380 
TGGTTTTAGC TCGAACAGAG CAGCTGGTTC GTCTTGAGAG CAATGCAGTC ATTGAGAACA 1440 
AGACCATTGC CCACAATTTG CAGAGAAAGC TAAAGACACA GAAAGAGAGA CTGGAGAGCA 1500 
AAGAATTACA CATGAGCCTC CT CCGG CAG A AAA! AGCCCA GCTGGAGGAG GAGAAGCAGG 1560 

70 CACGCACGGC CTTGGTGGTT GAGAGGGACA ACGCGCATCT TACCATCAGG AACTTGCAGA 1620 
AGAAGGTGGA GAGGCTGCAG AAAGAGCTGA ACACGTGTCG AGACTTGCAC ACCGAGCTCA 1680 
AAGCCAAACT GGCCGACACC AATGAACTGA AG A 'IT AAA AC TTTGGAACAG ACTAAAGCCA 1740 
TTGAAGATCT AAACAAATCC AGAGACCAAC TGGAGAAGAT GAAGGAGAAA GCTGAGAAAA 1800 
AGCTCATGTC TGTCA AGTCA GAACTGGATA CCACAGAACA TGAGGCTAAG GAGAATAAAG 1860 

75 AAAGGGCCAG AAACA I'GATA GAAGTGGTAA CCAGTGAAAT GAAGACACTA AAAAAATCTC 1920 
TGGAAGAAGC AGAAAAGAGA GAAAAGCAGC TGGCAGACTT CAGGGAGGTG GTGTCGCAGA 1980 
TGCTAGGCTT GAACGTGACC AGCCTTGCTC TTCCTGATTA TGAAATCATC AAGTGTCTTG 2040 

354 
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AAAGATTGGT CCATTCACAT CAGCATCACT TTGTTACCTG TGCCTGCCTC AAAGATGTGA 2100 
CTACTGGGCA AGAGAGGCAC CCACAAGGCC ATTTACAGCT TCTTCATTGA ACACTGTATC 2 160 
TCTTGAGAGA GGTGGCCATA AGACATGGCA CACAATTCCC AATTTCACAA ATTCCTCATG 2220 
TCTTTGAGAT TTGATCAGTT TGTGAATATT TTATGCTTTG ATGATATAGT GAGAATGCAT 2280 
5 CACTTGCAAA AACGATCTCA AAAGTGTCAG CCTTAGATAA ACGTCAGCAT TAAAAAACGC 2340 
CAAAAAAAAA AAAAAAAAGC ATTTTAGGAT CCAGAAG AAT TCCACCAGAT TGCATGAGTT 2400 
AGATTGGGAA ATGGGAGTGG GAGATAATAT TGGG AGGTAT CTATTTTAAG TCAGGGGCTT 2460 
T ACTAGCCG A TTTAGTTCTC ACAATAACCA TGTGG AG AAG CTGTGACATT TTTAATTTAC 2520 
AACCTTTCTG GGGCTCAGAC ATAAAGTTAC CTATCCAAGG TTGCAGTTGG GTAGTGGTGG 2580 

1 0 GACCAGGATG GACAACrCAT TGGCCCTGCC TCAAAAGCCA TACCTCTTCT CCTGCTATGC 2640 
AGAATCTGTT TCTCCTGAAT CTCTGTG ATG CTGGTGGG AA TTGTTTGCAT AGAGGAAGGA 2700 
CAATAACCCT GCCATCGTGA GTTAATGTCC GGGCTGGTCA CAGTGGTTCA TGCCTGTAAT 2760 
CCCAGCACTT TGGGAGTCCA AGGCAGGCAT ATCATTTGAG GTCAGGAGTT TAAGACCAGC 2820 
CTGGCTAACA TAGTGAGACC CTGTTTCTAC TAAAAATACA AAAATAAGCC AGGTGTGGTG 2880 

1 5 GTGCATGACT GTAATCCCAA CTACTCAGCA GGAGAAGCAC TTGAACCCAG GAGACGGAGG 2940 
CGGCAGTGAG CCAAG ATTGT GCCACTGCAC TCCAGCCTGG GCGACAGAGT GA AACTACAT 3000 
CTCAGAAAAA AAAAAAAAAA A AAA A AAA AA AAGTTAATGT CCAAAAATGA CAGATTTACA 3060 
AGTGTAAGCT ATATGATTTC TTCAAAAAGC AAAAGCAATA TACCTAATTC ATTTGG ATCA 3 1 20 
AACTTACATA GGTCTCAGGT CCTGTAAGAA ACTTGCCTGT TCTAACTGTT GCTACCAG AT 3 1 80 

20 TATATCTGGT GGTAATTGTT AATGTTTCAG CAGGGCTGGT CTCAGTCCTT TAAAATGGAA 3240 
AGCTTTATTT GGAGCCCAAC CCTATA AGAT GAAGAAATCC TATATAGTCT TATTCACCAA 3300 
TATATCCAAT ACACCCACAG CAATGGTACC TTTTTAAG AT CAGGATTTTA TTATGAATTC 3360 
CTGTCACTTT CTGTTTTCCA TTTAAATTTC TATTTTACAA ATTTTTCAGG GAATCATATT 3420 
CTTAACTTCA CTGAGAAAAA TGTGTTACTC TTTTGGACAA TTTATCTTAT TTCTATCATA 3480 

25 T A AG ATG TAT TTTTTTATTG TCCTTAAAAG AAGCTCTAGC ATGAAATTAA AGGAAAGGGA 3540 
AAGAAATAG A TCTGGTGCAC CCG AACATTA GGAG AAAATG AAAAATATAC AACCAACCGT 3600 
TCGTG AGTCA TCAAAAAGTC AAAGTCAGCC TGGCCAACAT GGCAAAACTC CGTCTCTGCA 3660 
AAAAATACAA AAATGAGCCC GGTATGTTGG CATATGCCTG TAATCCCAGC TACTCGAGAG 3720 
GCTG AGACAC G AAAATTGCT TGAACCTGGG AGGCGG AGGT TGCAATGAGC CGAGATCGCG 3780 

30 CTACTGCACT CCAGCCTGGG CAACAGAGAG AGACCTTGTC TCAAAAAACA ACAACAACAA 3840 
AA AGTCA AAG TCATAATAAG CAAATTATTG GCTTCTTTCT TCTAGACYAA AAGAAATTAA 3900 
AG AG ATG AAA CAATCAATTG CAAGGGTCAA AACTAG ATTG GATCTTGGTT TG AATGAAAA 3960 
AAAGCATAAA ATATTCTTGC AATAATTGTA AAAATTTGAA TGTGGACTAA GTCCTAGATT 4020 
ATATTAAAAT ATTTTTAATT TTTTAAGCTT GACAAATGCA CTGATTGTTA TACTTTAAAT 4080 

35 AACTAAAAAT CTGAG AATCC ACAGTGCTAC AG AC A ATA AA TGATAAAATG GGAAAAAAAA 4140 
AAAAAAAAAA A 



Seq ID NO: 18 Protein sequence: 
40 Protein Accession #: NP_079335.1 

1 11 21 31 41 51 

MSLDCTSHIA LGAASPAPEE TYDHLSEVPV TREQLNHYRN VAQNARSELA ATLVKFECAQ 60 
45 SELQDLRSKM LSKEVSCQEL KAEMESYKEN NARKSSLLTS LRDRVQELEE ESAALSTSKJ 120 
RTEITAHAAI KENQELKKKV VELNEKLQKC SFCENEENKKQ VSKNCRKHEE FLTQLRDCLD 1 80 
PDERNDKASD EDLILKLRDL RKENEFVKGQ IVILEETINV HEMEAKASRE T1MRI^\SEVN 240 
REQKKAASCT EEKEKLNQDL LSAVEAKEAL EREVKIFQER LLAGQQVWDA SKQEVSLLKK 300 
SSSELEKSLK ASQDAVTTSQ SQYSSFREKI AALLRGRLSM TGSTEDT1LE KIREMDSREE 360 
50 SRDRMVSQLE AQISELVEQL GKESGFHQKA LQRAQKAENM LETLQGQLTH LEAELVSGGV 420 
LRDNLMFEKQ KYLKPLDQLS QKMKLDQM AA ELGFDTRLDV VLARTEQLVR LESNAVIENK 480 
TIAHNLQRKL KTQJCERLESK ELHMSLLRQK IAQLEEEKQA RTALVVERDN AHLTIRNLQK 540 
KVERLQKELN TCRDLHTELK AKLADTNELK KTLEQTKAI EDLNKSRDQL EKMKEKAEKK 600 
LMSVKSELDT TEHEAKENKE RARNMIEWT SEMKTLKKSL EEAEKREKQL ADFREVVSQM 660 
5 5 LGLNVTSLAL PDYEIDCCLE RLVHSHQHHF VTCACLKD VT TGQERHPQGH LQLLH 



Seq LD NO: 19 DNA sequence 

Nucleic Acid Accession #: AF071552, NM_000662 

60 Coding sequence: 441-1313 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

65 CTTTGTATAA GGCTCAGCTA AAAGGGAAAT TGAGTGGGTC AGGTACCACG GATACTATAC 60 
ACTCTATTGC ATG ATTCTCC TGCCTACATC AGAAG ACGTT TATAAGCCTA TTTTAAAGG A 1 20 
TACCAGTTGG AATCTCTCTT TTATTA ATCA CCAAG AG AAC CATG AACAAG CTG i iTATCA 1 30 
TTTG ACTCAT CATTTAATCT TGATTTCCAG CTTCTCACAC TTGAAAGAAG ACATAATACA 240 
TTTCTCACAG GATTCTGGG A CTATTAACTG AACTTATGTG TGTAAAAGGA ATTCATACAA 300 

70 TG AAAGCACT AGAAATAATT ATTATACTTA TAACCATTGT ATTTTTACAT GTTTAAAATA 360 
TAGCCATAAT TAGCCTACTC AAATCCAAGT GTAAAAGTAA AATGATTTGC TTTCGTTTTG 420 
TTTTCCTTGC TTAGGGGATC ATGG ACATTG AAG CATATCT TGAAAGAATT GGCTATAAGA 480 
AGTCTAGGAA CAAATTGGAC TTGGA AACAT TAACTG ACAT TCTTCAACAC CAGATCCGAG 540 
CTGTTCCCTT TGAGAACCTT AACATCCATT GTGGGGATGC CATGGACTTA GGCTTAGAGG 600 

75 CCATTITTGA TCAAGTTGTG AGAAGAAATC GGGGTGGATG GTGTCTCCAG GTCAATCATC 660 
rrCTGTACTG GGCTCTG ACC ACTATTGGTT TTGAG ACCAC GATGTTGGGA GGGTATGTTT 720 
ACAGC^CTCC AGCCAAAAAA TACAGCACTG GCATGA1TCA CCTTCTCCTG CAGGTGACCA 780 
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PI GATGGCAG G AACTACATT GTCGATGCTG GGTTTGGACG CTCATACCAG ATGTGGCAGC 840 
CTCTGGAGTT AATTTCTGGG AAGGA'E CAGC CTCAGGTGCC TTGTGTCTTC CGTTTGACGG 900 
AAGAGAATGG ATTCTGGTAT CTAGACCAAA TCAGAAGGGA ACAGTACATT CCAAATG A AG 960 
AATTTCTTCA TTCTGATCTC CTAGAAGACA GCAAATACCG AAAAATCTAC TCCITTACTC 1020 
5 TTAAGCCTCG AACAATTGAA GATTTTGAGT CTATGAATAC ATACCTGCAG ACATCTCCAT 1080 
CATCTGTGTT TACTAGTAAA TCATTTTGTT CCTTGCAGAC CCCAGATGGG GTTCACTGTT 1 140 
TGGTGGGCTT CACCCTCACC CATAGG AG AT TCAATTATAA GGACAATACA GATCTAATAG 1200 
AGTTCAAGAC TCTGAGTGAG GAAGAAATAG AAAAAGTGCT GAAAAATATA TTTAATATTT 1260 
CCTTGCAGAG AAAGCTTGTG CCCAAACATG GTGATAGATT TTTTACTA IT TAG AATAAGG 1320 
1 0 AGTAAAACAA TCrrGTCTAT TTGTCATCCA GCTCACCAGT TATCAACTG A CGACCTATCA 1380 
IGTATCTTCT GTACCCTTAC CTf ATnTGA AGAAAATCCT AGACATCAAA TCATITCACC 1440 
TATAAAAATG TCATCATATA TAATTAAACA GC1 I' I'll AAA GAAACATAAC CACAAACCTT 1500 
TTCAAATAAT AATAATAATA ATAATAATAA ATGTATTTTA AAGATGGCCT GTGGTTATCT 1560 
TGGAAAITGG TG ATTTATGC TAGAAAGCTT TTAATGTTGG TTTATTGTTG AATTC 

Seq ID NO: 20 Protein sequence: 
Protein Accession #: NPJ)00653.1 



20 1 11 21 31 41 51 

I i I I f I 

MD1EAYLERI GYKKSRNKLD LETLTDILQH QIRAVPFENL MHCGDAMDL GLEAIFDQVV 60 
RRNRGGWCLQ VNHLLYAVALT TIGFE1TMLG GYVYSTPAKK YSTGMIHLLL QVTIDGRNYI 1 20 
VDAGFGRSYQ MWQPLELISG KDQPQVPCVF RLTEENGFWY LDQIKREQYI PNEEFLHSDL 1 80 
25 IEDSKYRKIY SFTLKPRTIE DFESMNTYL.Q TSPSSVFTSK SFCSLQTPDG VHCLVGFTLT 240 
HRRFNYKDNT DUEFKTLSE EEIKKVLKNI FNISLQRKLV PKHGDRFFTI 

Seq ID NO: 21 DNA sequence 
Nucleic Acid Accession #: NM 003714 

30 Coding sequence: 123- 1031 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

f 1 1 1 1 

35 CGGCACXjAGC AAAAAGGAAG AGTGGGAGGA GGAGGGGAAG CGGCGAAGGA GGAAGAGGAG 60 
GAGGAGGAAG AGGGGAGCAC AAAGGATCCA GGTCTCCCGA CGGGAGGTTA ATACCAAGAA 120 
CCATGTGTGC CGAGCGGCTG GGCCAGTTCA TGACCCTGGC TITGGTGTTG GCCACCTTTG 1 80 
ACCCGGOGCG GGGGACCGAC GCCACCAACC CACCCGAGGG TCCCCAAGAC AGGAGCTCCC 240 
AGCAGAAAGG CCGCCTGTCC CTGCAGAATA CAGCGGAGAT CC AG CACTGT TTGGTCAACG 300 

40 CTGGCGATGT GGGGTGTGGC GTGTTTGAAT GTTTCGAGAA CAACTCTTGT GAGATTCGGG 360 
GCTTACATGG GATTTGCATG ACTTTTCTGC ACAACGCTGG AAAATTTGAT GCCCAGGGCA 420 
AGTCA1TCAT CAAAGACGCC TTGAAA TGTA AGGCCX>lCGC TCTGCGGCAC AGGTl'CGGCT 480 
GCATAAGCCG GAAGTGCCCG GCCATCAGGG AAATGGTGTC CCAGTTGCAG CGGGAATGCT 540 
ACCTC/\AGCA CGACCTGTGC GCG<3CTGCCC AGGAGAACAC CCGGGTGATA GTGGAGATGA 600 

45 'ICCATi rCAA GGACTrGCTG CTGCACGAAC CCTACGTGGA CCTCGTGAAC TTGCTGCTGA 660 

C'CTGTGGGGA GGAGGTGAAG GAGGCCATCA CCCACAGCGT GCAGGTTCAG TGTGAGCAGA 720 
ACTGGGGAAG CCTGTGCTCC ATC1TG AGCT TCrTGCAOTTC GGCCATCCAG AAGCCTCCCA 780 
CGGCGOCCCC CGAGCGCCAG CCCCAGGTGG ACAGAACCAA GCTCTCCAGG GCCCACCACG 840 
GGGAA( iCAGG ACATCACCTC CCAGAGCCCA GCAGTAGGGA GACTGGCCG A GGTGCCAAGG 900 

50 GTGAGCGAGG TAGCAAGAGC CACCCAAACG CCCATGCCCG AGGCAGAGTC GGGGGCCTTG 960 
GGGCTCAGGG ACCTTCCGGA AGCAGCGAGT GGGAAGACGA ACAGTCTGAG TA TTCT GATA 1020 
TCCGGAGGTGAAATGAAAGG CCTGGCCACG AAATCTTTCC TCCACGCCGT CCATTTTCTT 1080 
ATCTATGGAC: ATTCCAAAAC ATTTACCATT AGAGAGGGGG GATGTCACAC GCAGGATTCT 1140 
GTGGGGACTG TGGACTTCAT CGAGGTGTGT GTTCGCGGAA CGGACAGGTG AGATGGAGAC 1200 

5 5 CXXTGGGGCC GTGGGGTCTC AGGGGTGCCT GGTGAATTCT GCACTTACAC GTACTCAAGG 1260 
GAGCGCGCCC GCGTTATCCT CGTACCITTG TCTTCTTTCC ATCTGTGG AG TCAGTGGGTG 1 320 
TCGGCCGCTC TGTTGTGGGG GAGGTGAACC AGGGAGGGGC AGGGCAAGGC AGGGCCCCCA 1380 
GAGCTGGGCC ACACAGTGGG TGCTGGGCCT CGCCCCGAAG CTTCTGGTGC AGCAGCCTCT 1440 
GGTGCTGTCT CCGCGGAAGT CAGGGCGGCT GGATTCCAGG ACAGGAGTGA ATGTAAAAAT 1500 

60 AAATAT CGCT TAGAATGC:AG GAGAAGGGTG GAGAGGAGGC AGGGGCCGAG GGGGTGCTTG 1560 
GTGCCAAACT GAAATTCAGT TTC1TGTGTG GGGCCTTGCG GTTCAGAGCT CTTGGCGAGG 1620 
GTGGACiGGAG GAGTGTCATT TCTAT(}TGTA ATTTCTGAGC CA1TGTACTG TCl'GGGCrGG 1680 
GGGGGACACI' GTCCAAGGGA GTGGCCCCTA TGAGTTTATA TTTTAACCAC TGCTTCAAAT 1740 
CTCGATITCA CTTTTTTTAT TTATCCAGTT ATATCTACAT ATCTGTCATC TAAATAAATG 1 800 

65 GCTTTCAAAC AA AAA AAA A A AAAAAAAAAA AAAAAAA 

,St;q ID NO 22 l Yolein sequence 
Prutein Accession it: NPJJ03705 

70 

1 11 21 31 41 51 
I I I I I i 

MCAERLGQFM TLALVLATFD PARGTPATNP PEGPQURSSQ QKGRLSLQNT AE1QHCLVNA 60 
GDVGCGVFEC FENNSCEIRG LHG1CMTFLH fJAGKFDAQGK SFIKDALKCK AHALRHRFGC 120 
75 1SRKCPAIRE MVSQLQRECY LKHDLCAAAQ ENTRVIVEMI HFKDLL-LHEP YVDLVNLLLT 1 80 
OGEEVKEAIT HSVQVQCEQN WGSLCSILSF CTSAIQKPPT APPERQPQVD RTKLSRAHHG 240 
EAGHHLPHPS SRETGRGAKG ERGSKSIIPNA HARGRVGGI.G AQGPSGSSEW EDEQSEYSDI 300 
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Seq ID NO: 23 DNA sequence 
5 Nucleic Acid Accession NM_005264.1 

Coding sequence: 557-1954(underIined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

10 i t t i ( I 

G AATTCCGGC CAG AAGAAAT CTGGCCTCGG AACACGCCAT TCTCCGCGCC GCTTCCAATA 60 
ACCACTAACA TCCCTAACGA GCATCCGAGC CGAGGGCTCT GCTCGGAA AT CGTCCTGGCC 120 
CAACTCGGCC CTTCGAGCTC TCGAAGATTA CCGCATCTAT TTTTTTTTTC TTTTTTTTCT 130 
TTTCCTAGCG CAG ATAAAGT GAGCCCGGAA AGGGAAGGAG GGGGCGGGGA CACCATTGCC 240 
1 5 CTGAAAGAAT AAATAAGTAA ATAAACAAAC TGGCTCCTCG CCGCAGCTGG ACGCGGTCGG 300 
TTGAGTCCAG GTTGGGTCGG ACCTGAACCC CTAAAAGCGG AACCGCCTCC CGCCCTCGCC 360 
ATCCCGGAGC TGAGTCGCCG GCGGCGGTGG CTGCTGCCAG ACCCGGAGTT TCCTCTTTCA 420 
CTGGATGGAG CTGAACTTTG GGCGGCCAGA GCAGCACAGC TGTCCGGGGA TCGCTGCACG 480 
CTGAGCTCCC TCGGCAAGAC CCAGCGGCGG CTCGGGATTT TTTTGGGGGG GCGGGGACCA 540 
20 GCCCCGCGCC GGCACCATGT TCCTGGCGAC CCTGTACTTC GCGCTGCCGC TCTTGGACTT 600 
GCTCCTGTCG GCCG AAGTGA GCGGCGGAGA CCGCCTGG AT TGCGTGAAAG CCAGTGATCA 660 
GTGCCTGAAG GAGCAGAGCT GCAGCACCAA GTACCGCACG CTAAGGCAGT GCGTGGCGGG 720 
CAAGGAGACC AACTTCAGCC TGGCATCCGG CCTGGAGGCC AAGGATGAGT GCCGCAGCGC 780 
C ATGGAGGCC CTGAAGCAG A AGTCGCTCTA CAACTGCCGC TGCAAGCGGG GTATGAAG AA 840 
25 GGAGAAGAAC TGCCTGCGCA TTTACTGGAG CATGTACCAG AGCCTGCAGG GAAATGATCT 900 
GCTGijAGGAT TCCCCATATG AACCAGTTAA CAGCAGATTG TCAGATATAT TCCGGGTGGT 960 
CCCATTCATA TCAGATGTTT TTCAGCAAGT GGAGCACATT CCCAAAGGG A ACAACTGCCT 1020 
GGATGCAGCG AAGGCCTGCA ACCTCGACGA CATTTGCAAG AAGTACAGGT CGGCGTACAT 1080 
CACCCCGTGC ACCACCAGCG TGTCCAACG A TGTCTGCAAC CGCCGCAAGT GCCACAAGGC 1 140 
30 CCTCCGGCAG TTCTTTGACA AGGTCCCGGC CAAGCACAGC TACGGAATGC TCTTCTGCTC 1200 
CTGCCGGGAC ATCGCCTGCA CAGAGCGGAG GCGACAG ACC ATCGTGCCTG TGTGCTCCTA 1260 
TGAAGAGAGG GAGAAGCCCA ACTGTTTGAA TTTGCAGGAC TCCTGCAAGA CGAATTACAT 1320 
CTGCAGATCT CGCCTTGCGG ATTTTTTTAC CAACTGCCAG CCAGAGTCAA GGTCTGTCAG 13S0 
CAGCTGTCTA AAGGAAAACT ACGCTGACTG CCTCCTCGCC TACTCGGGGC TTATTGGCAC 1440 
35 AGTCATGACC CCCAACTACA TAGACTCCAG TAGCCTCAGT GTGGCCCCAT GGTGTGACTG 1500 
CAGCAACAGT GGGAACGACC TAGAAGAGTG CTTGAAATTT TTGAATTTCT TCAAGGACAA 1560 
TACATGTCTT AAAAATGCAA TTCAAGCCTT TGGCAATGGC TCCGATGTG A CCGTGTGGCA 1 620 
GCCAGCCTTC CCAGTACAGA CCACCACTGC CACTACCACC ACTGCCCTCC GGGTTAAGAA 1680 
CAAGCCCCTG GGGCCAGCAG GGTCTGAGAA TGAAATTCCC ACTCATGTTT TGCCACCGTG 1740 
40 TGCAAATTTA CAGGCACAGA AGCTGAAATC CAATGTGTCG GGCAATACAC ACCTCTGTAT 1800 
TTCCAATGGT AATTATGAAA AAGAAGGTCT CGGTGCTTCC AGCCACATAA CCACAAAATC 1860 
AATGGCTGCT CCTCCAAGCT GTGGTCTGAG CCCACTGCTG GTCCTGGTGG TAACCGCTCT 1920 
GTCCACCCTA TTATCTTTAA CAGAAACATC ATAGCTGCAT TAAAAAAATA CAATATGG AC 1980 
ATGTAAAAAG ACAAAAACCA AGTTATCTGT TTCCTGTTCT CTTGTATAGC TGAAATTCCA 2040 
45 GTTTAGGAGC TCAGTTGAGA AACAGTTCCA TTCAACTGG A ACATTTTTTT TTTTCCTTTT 2100 
AAG AAAGCTT CTTGTGATCC TTCGGGGCTT CTGTGAAAAA CCTGATGCAG TGCTCCATCC 2160 
AAACTCAGAA GGCTTTGGGA TATGCTGTAT TTTAAAGGG A CAGTTTGTAA CTTGGGCTGT 2220 
AAAGCAAACT GGGGCTGTGT TTTCGATGAT GATGATCATC ATGATCATGA TGATTTTAAC 2280 
AGTTTTACTT CTGGCCTTTC CTAGCTAGAG AAGGAGTTAA TATTTCTAAG GTAACTCCCA 2340 
50 TATCTCCTTT AATGACATTG ATTTCTAATG ATATAAATTT CAGCCTACAT TGATGCCAAG 2400 
CTTTTTTGCC ACAAAGAAGA TTCTTACCAA GAGTGGGCTT TGTGGAAACA GCTGGTACTG 2460 
ATGTTCACCT TTATATATGT ACTAGCATTT TCCACGCTG A TGTTTATGTA CTGTAAACAG 2520 
TTCTGCACTC TTGTACAAAA GAAAAAACCA CCCGGAATTC 

55 ID NO: 24 Protein sequence: 

Protem Accession #: NP_0052 5 5 

1 U 21 31 41 51 

60 MFLATLYFAL PLLDLLLSAE VSGGDRLDCV KASDQCLKEQ SCSTKYRTLR QCVAGKETNF 60 

SLASGLEAKD ECRSAMEALK QKSLYNCRCK RG M KKEKNCL RJYWSMYQSL QGNDLLEDSP 120 
YEPVNSRLSD IFRVVPFISD VFQQVEHIPK GNNCLDAAK A CNLDDICKKY RSAYrTPCTT 1 80 
SVSNDVCNRR KCHKALRQFF DKVPAKHSYG MLFCSCRDIA CTERRRQTIV PVCSYEEREK 240 
PNCLNLQDSC KTNYICRSRL ADFFTNCQPE SRSVSSCLKE NYADCLLAYS GLIGTVMTPN 300 

65 Y1DSSSLSVA PWCDCSNSGN DLEECLKFLN FFKDNTCLKN AIQAFGNGSD VTVWQPAFPV 360 
QTTTATTTTA LRVKNKPLGP AGSENEIPTH VLPPCANLQA QKLRSNVSGN THLCISNGNY 420 
EKEGLGASSH ITTKSMAAPP SCGLSPLLVL VVTALSTLLS LTFTS 

Seq ID NO: 25 DNA sequence 
70 Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1576(entire sequence represents open reading frame) 

1 11 21 31 41 51 

75 CTTTTGTTrC GCCATGCCTA GTCTAGTGGT ATCTGGAATA ATGGAAAGAA ATGGGGGCTT 60 

TGGAGAACTA GGATGTTTCG GGGGAAGOGC TAAGG ACCGA GGGCTGCTGG AAGACGAGCG 120 
CGCCCTTCAG CTGGCTCTCG ATCAACTCTG CCTCCTGGGT TTGGGGGAGC CCCCCGCCCC 1 80 
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CAGGGCGGGC GAGGACGGGG GAGGTGGGGG GGGCGGCGCC CCCGCGCAGC CGACAGCCCC 240 
CCCGCAGCCG GCGCCGCCGC CGCCGCCCGC GGCGCCCCCG GCCGCCCCGA CGACGGCCCC 300 
CGCAGCGCAG ACGCCCCAGC CCCCCACCGC CCCCAAAGGG GCGAGCGACG CCAAGCTCTG 360 
CGCTCTCTAC AAAGAGGCCG AGCTGCGCCT GAAGGGCAGC AGCAACACCA CGGAGTGTGT 420 
5 TCCCGTGCCC ACCTCCGAGC ACGTGGCCGA GATCGTGGGC AGGCAAGGCT GCAAGATTAA 480 
GGCCTTGAGG GCCAAGACCA ACACCTACAT CAAGACACCG GTG AGGGGCG AGGAACCAGT 540 
GTTCATGGTG ACAGGGCGAC GGGAGGACGT GGCCACAGCC CGGCGGGAAA TCATCTCAGC 600 
AGCGGAGCAC TTCTCCATGA TCCGTGCCTC CCGCAACAAG TCAGGCGCCG CCTTTGGTGT 660 
GGCTCCTGCT CTGCCCGGCC AGGTGACCAT CCGTGTGCGG GTGCCCTACC GCGTGGTGGG 720 

1 0 GCTGGTGGTG GGCCCCAAAG GGGCAACCAT CAAGCGCATC CAGCAGCAAA CCAACACATA 780 
CA ri'ATCACA CCAAGCCGTG ACCGCGACCC CGTGTTCGAG ATCACGGGTG CCCCAGGCAA 840 
CGTGGAGCGT GCGCGCGAGG AGATCGAGAC GCACATCGCG GTGCGCACTG GCAAGATCCT 900 
CGAGTACAAC AATGAAAACG ACTTCCTGGC GGGGAGCCCC GACGCAGCAA TCGATAGCCG 960 
CTACTCCG AC GCCTGGCGGG TGCACCAGCC CGGCTGCAAG CCCCTCTCCA CCTTCCGGCA 1020 

1 5 GA ACAGCCTG GGCTGCATCG GCGAGTGCGG AGTGGACTCT GGCTITGAGG CCCCACGCCT 1 080 
GGGTGAGCAG GGCGGGGACT TTGGCTACGG CGGGTACCTC TTTCCGGGCT ATGGCGTGGG 1 140 
CAAGCAGGAT GTGTACTACG GCGTGGCCGA GACTAGCCCC CCGCTGTGGG CGGGCCAGGA 1200 
GAACGCCACG CCCACCTCCG TGCTCTrCTC CTCTGCCTCC TCCTCCTCCT CCTCTTCCGC 1260 
CAAGGCCCGC GCTGGGCCCC CGGGCGCACA CCGCTCCCCT GCCACTTCCG CGGGACCCGA 1320 

20 GCTGGCCGGA CTCCCGAGGC GCCCCCCGGG AGAGCCGCTC CAGGGCTTCT CTAAACTTGG 1380 
TGGGGGCGGC CTGCGGAGCC CCGGCGGCGG GCGGGATTGC ATGGTCTGCT TTGAGAGCGA 1440 
AGTGACTGCC GCCCTTGTGC CCTGCGGACA CAACCTGTTC TGCATGG AGT GTGCAGTACG 1500 
CATCTGCGAG AGGACGGACC CAGAGTGTCC CGTCTGCCAC A TC AC AG CCA CGCAAGCCAT 1560 
CCGAATATTC TCCTAA 

25 

Seq ID NO 26 Protein sequence: 

Protein Accession #: FGENESH predicted 



30 1 11 21 31 41 51 
I I I I I I 

FCFAMPSLV V SGIMERNGGF GELGCFGGSA KDRGLLEDER ALQLALDQLC LLGLGEPPAP 60 
RAGEDGGGGG GGAPAQPTAP PQPAPPPPPA APPAAPTTAP AAQTPQPPTA PKGASDAKLC 120 
ALYKEAELRL KGSSNTTECV PVPTSEHVAE IVGRQGCK1K ALRAKTNTYI KTPVRGEEPV 1 80 

35 FMVTGRREDV ATARREIISA AEHFSMIRAS RNKSGAAFGV APALPGQVTI RVRVPYRVVG 240 
LVVGPKGATI KRIQQQTNTY IITPSRDRDP VFEITGAPGN VERAREFIET HIAVRTGKIL 300 
EYNNENDFLA GSPDAAIDSR YSDAWRVHQP GCKPLSTFRQ NSLGCIGECG VPSGFEAPRL 360 
GEQGGDFG YG GYLFPGYGVG KQDVYYGVAE TSPPLWAGQE NATWWrVLFS SASSSSSSSA 420 
KARAGPPGAH RSPATSAGPF LAG LPRRPPG EPLQGFSKt-G GGGLRSPGGG RDCMVCFESE 480 

40 VTAALVPCGH NLFCM ECAVR ICFRTDPECP VCHITATQAI RIFS 

Seq ID NO: 27 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1 -2070 (underlined sequences correspond to start and stop codons) 

45 

1 11 21 31 41 51 

I I I I ! I 

ATGAGCGGTG CGGGGGTGGC GGCTGGGACG CGGCCCCCCA GCTCGCCGAC CCCGGGCl'CT 60 
CGGCGCCGGC GCCAGCGCCC CTCTGTGGGC GTCCAGTCCT TGAGGCCGCA GAGCCCGCAG 120 

50 CTCAGGCAGA GCGACCCGCA GAAACGGAAC CTGGACCTGG AGAAAAGCCT GCAG'ITCCTG 180 
CAGCAGCAGC ACTCGGAGAT GCTGGCCAAG CTCCATGAGG AGATCGAGCA ICTGAAGCGG 240 
GAAAACAAGG GTGAGCCGGC GCGGGGCCCT AGGCCGGCCC TGCCTCCCCA GGCACACTCA 300 
ACACTGCCGC TCCCGCAGCA CAGAAACACA GCCATCAACT CCAGCACACG CCTGGGCTCA 360 
GGGGGAACAC AGGACGGGGA GCCCCTCCAG ACTGTCCTTG CCCACCTGGC TGCACTGGCC 420 

5 5 CCTGTATGCC AACCCAGTGG GTACAGGTTC TGGGGGACCT GGACAGATGC CGCTACCTCT 480 
AGCCGTGGCT GGACGATGTT ATGCAGCCAA GCACAGCACG TGCTGCTCTC GGG AAGCCCA 540 
GGGCCTGAGG TCATTGCAGG GCGGCAGGTG GCCACAGGGT GCTCCCCAGA CCTCCCTCCT 600 
CCAAGTAGAG CTGAAATGGG AAGGAACCCC TGGGACAGCC CCTGCCCTGC TAGATCTTTG 660 
CCrCAGATTG CTGCTGTGGC CAGGCCCAGG ATTTCCAGCC CTATGGCTCT GAGTCCTCAC 720 

60 ATGCTGGGGG CCCAGGGGAT A1GGACACAC TCCATCCAGG GATCCCrrCC TGCCATCTGG 780 
GCAGCAACCA TGGGGACAAA GGGAGGAAGC AGAGTCCTGT TTCCTTGCCA CTTGTCCAAG 840 
GCACTTCCCC ATCCTGACAG CGGCCCCCAC CCAGCCCAGG ATCCTGGGCT GTGGTCTCAA 900 
GCTCACTTCC CATTATCTTT GGGGCTGGGG CTGACATCAG GAGGACATCT GACTGGTGGA 960 
IGGAGCCAGC CTGGGAACAT CGCAGCTGGG GCAGTGCCTA GGGCTCTCCC TrCCCAGGGA 1020 

65 GACATGG AGA AGGGGGTTGA GCiGAGGGCCC TTCCCTAGCC GCTGTGGCAA CTCCAGTGAG 1080 
CTGTTCTGGG CAAAGTGTGG CCCAAGTCGG CAGCCCCAGC CCTGCAGTGCTGGGGACGCT 11 40 
GACAGGACAC GGGAAGAGGC CATGCITTCC CTCGGGACCT GCTGTTCCAT GTGTCCCAAG 1200 
CCCTCCTGCT TTCCAGATGG CCCCTCAGGA AACCACCTTT CCAGGGCCTC TGCTCCCTTG 1260 
GGCGCTCGCT GGGTCTGCAT CA\CGGAGTG TGGGTAGAGC CGGGAGGACC CAGCCCTGCC 1320 

70 AGGCTGA AGG AGGGCTCCTC AC GGACACAC AGGCCAGGAG GCAAGCGTGG GCGTCTTGCG 1380 
GGCGGTAGCG CCGACACTG T GC'GCTCTCCT GCAGACAGCC TCTCCATGTC AAGC1TCCAG 1440 
TCTGTCAAGT CCATCTCTAA TI CAGCCAAC TCTCAAGGCA AGGCCAGGCC CCAGCCCGGC 1500 
TCCITCAACA AGCAAGA'ITC AAAAGCTGAC GTCTCCCAGA AGGCGGACCT GGAAGAGGAG 1560 
CCCCTACTTC ACAACAGCAA GCTGGACAAA GTTCCTGGGG TACAAGGGCA GGCCAGAAAG 1620 

75 GAGAAAGCAG AGGCCTCTAA TGCAGGAGCT GCCTGTATGG GGAACAGCCA GCACCAGGGC 1680 
AGGCAG ATGG GGGCGGGGGC ACACCCCCCA ATG ATCCTGC CCCTTCCCCT GCGAAAGCCC 1 740 
ACCACACTTA GGCAGTGCGA AGTGCTCATC CGCGAGCTGT GGAATACCAA CCTCCTGCAG 1800 
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10 



25 



30 



ACCCAAGAGC TGCGGCACCT CAAGTCCCTC CTGGAAGGGA GCCAGAGGCC C^GGCAGCC 1860 
CCGGAGGAAG CTAGCTTTCC CAGGGACCAA GAAGCCACGC ATTTCCCCAA GGTCTCCACC 920 
AAG AGCCTCT CCAAG AAATG CCTG AGCCCA CCTGTGGCGG AGCGTGCCAT CCTGCCOGCA 1 930 
CTGAAGCAGA CCCCGAAGAA CAACTTTGCC GAGAGGCAGA AGAGGCTGCA GGCAATGCAG 2040 
AAACGGCGCC TGCATCGCTC AGTGCTTTGA 



Seq ID NO: 28 Protein sequence: 

Protein Accession #: FGENESH predicted 



1 II 21 31 41 51 



MSGAGVAAGT RPPSSPTPGS RRRRQRPSVG VQSLRPQSPQ LRQSDPQKRN LDLEKSLQFL 60 
QOQHSEMLAK LHEEIEHLKR ENKGEPARGP RPALPPQAHS TLPLPQHRNT AINSSTRLGS 120 
1 5 GGTODGEPLQ TVLAHLAALA PVCQPSG YRF WGTWTDAATS SRGWTMLCSQ AQHVLLSGSP 1 SO 
GPEV1AGROV ATGCSPDLPP PSRAEMGRKP WDSPCPARSL PQIAAVARPR ISSPMALSPH 240 
MLGAQGIWTH SIQGSLPAIW AATMGTKGGS RVLFPCHLSK ALPHPDSGPH PAQDPGLWSQ 300 
AHFPLSLGLG LTSGGHLTGG WSQPGN1AAG AVPRALPSQG DMEKGVEGGP FPSRCGNSSE 360 
LFWAKCGPSR QPQPCSAGDA DRTREE ANILS LGTCCSMCPK PSCFPDGPSG NHLSRASAPL 420 
20 GARWVCINGV WVEPGGPSPA RLJCEGSSRTH RPGGKRGRLA GGSADTVRSP ADSLSMSSFQ 480 
SVKS^SNSAN SQGKARPQPG SFNKQDSKAD VSQKADLEEE PLLHNSKLDK VPGVQGQARK 540 
E^ASNAGA^MGNSQHQG RQMGAGAHFP NULPLPLRKPTTLRQCEVU RELWNWLLQ 600 
TQELRHUCSL LEGSQRPQAA PEEASFPRDQ EATflFPKVST KSLSKJCCLSP PVAERAILPA 660 
LKQTPKNNFA ERQKRLQAMQ KRRLHRSVL 



Seq ID NO: 29 DNA sequence 

Nucleic Acid Accession #: NM012319.2 

Coding sequence: 138-2405 (underlined sequences correspond to start and stop codons) 



CTCGTGCCGA ATTCGGCACG AGACCGCGTG TTCGCGCCTG GTAGAGATTT CTCG A AG AC A 60 
CCAGTGGGCC CGTGTGGAAC CAAACCTGCG CGCGTGGCCG GGCCGTGGGA CAACGAGGCC 120 
3 5 G CGG AGACG A AGGCGCAATG GCGAGG AAGT TATCTGTAAT CTTGATCCTG ACCTTTCCCC 180 
TCTCTCTrCAC AAATCCCCTTCATGAACTAA AAGCAGCTGC TTTCCCCCAG ACCACTGAGA 240 
AAATTAGTCC G AATTGGGAA TCTGGCATTA ATGTTGACTT GGCAATTTCC AC AC G GCA AT 300 
ATCATCTACA ACAGCTTTFC TACCGCTATG GAG AAAATAA TTCT1TGTCA GTTGAAGGGT 360 
TCAG AAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 420 
40 ACC ACGACCA TC ACT C AG AC CACGAGCATC ACTCAGACCA TGAGCGTCAC TCAGACCATG 480 
AG^ATCACTC AGACCACGAG CATCACTCTG ACCATGATCA TCACTCTCAC CATAATCATG 540 
^OTOT ^ATAAO CGAAAAGCTC TTTG CCC AGA CC ^"^^^^^ ^ pp^^S^^^^^Q 
GTAAAGATCC TAGAAACAGC CAGGGGAAAG G AGCTCACCG ACCAGAACAT GCCAGTGGTA 660 
GAAGGAATCT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG TACAACACTG 720 
45 TCTCTGAAGG AACTCACTTT CTAGAGACAA TAG AOACTCC AAGACCTGGA AAACTCTTCC 780 
SaTCT AAGCAGCTGC ACTCCACCCA GTGTCACATC AAAGAGCCGG GTCAGCCGGC 840 
TGGCTGGTAG GAAAACAAAT GAATCTGTGA GTGAGCCCCG AAAAGGCm ATGTATTCCA 900 
GAAACACAAA TGAAAATCCT CAGGAGTGTT TCAATGCATC AAAGCTACTG ACATCTCATG 960 
GCATGGGCAT CCAG GTTCCG CTGAATGCAA CAGAGTTCAA CTATCTCTGTCCAGCCATCA 1020 
50 TCAACCAAAT TG ATGCT AG A TCTTGTCTG A TTCATACAAG TGAAAAGAAG GCTGAAATCC 080 
CTCCAAAGAC CTATTCATTA C AAATAGCCT GGGTTGGTGG TTTTATAGCC ATTTCCATCA 1140 
TCAGTTTCCT 3TCTCTGCTG GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGTTTTTCA 1200 
AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTTGAGTGGT GATGCTTTTT .260 
TACACCTTCT TCCACATTCT CATGCAAGTC ACCACCATAG TCATAGCCAT GAAGAACCAG 1320 
5 5 CAATG^TOAA A AG AGGA CCACTTTTCA GTCATCTGTC TTCTCAAAAC ATAOAAGAAA !380 
GTGCCTATTr TG ATTCCACG TGGAAGGGTC TAACAGCTCT AGGAGGCCTG TATITCATGT 1440 
TTCTTGTTGA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AA AAAGAATC 1500 
A^3AAGAAACC TGAAAATGATGATG ATGTGG AGATTAAGAA GCAGTTGTCC AAGTATGAAT 1560 
CTCAACTITC AACAAATGAG GAG AAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 1 620 
60 GAGCAGACTC ACAAGAGCCC TCCCACTTTG ATTCTCAGCA ^^f™™™f™*f r \ 6 *° 
AAGAGGTCAT GATAGCTCAT GCTCATCCAC AGGAAGTCTA^TCAATATOTACCCAGAG ,740 
GGTGCAAGAA TAAATGCCAT TCAC ATTTCC ACG ATACACT CGGCCAGTCA G ACGATCTCA 1 800 
TTCACCACCA TCATGACTAC CATCATATTC TCCATCATCA CCACCACCAA AACCACCATC 1860 
CTCACAGTCA CAGCCAGCGC TACTCTCGGG AGO AGCTGAA AGATGCCGGC GTCGCCACTT 1 920 
65 TGGCCTGGATGGTGATAATG GGTG ATGGCC TGCACAATTT CAGCGATGGC CTAGCAATTG 1980 
GTGCTGCTTT TACTG A AGGC TTATCA AGTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 2040 
ATG AGTTGCC TCATG AATTA GGTGACTTTG CTGTTCTACT AAAGGCTGGC ATGACCGTTA 2!00 
AGCAGGCTGT CCTTTATAAT OCATTGTCAG CCATGCTGOC GTATCTTGGA ATGGCAACAG 21 60 
GAATTTTCATTGGTCATTAT GCTGAA AATG TTTCTATGTG GATATTTGCA CTTACTGCTG 2220 
70 GCTTATTCAT GT ATGTTGCT CTGGTTG ATA TGGTACCTGA AATGCTGCAC AATGATGCTA 2280 
GtgTcCATGG ATGTAGCCGCTGGGGGTATT TCTTTTTACA GAATGCTGGG ATGCTTTTGG 2340 
GTTTTGG AAT TATGTT ACTT ATTTCCATAT TTGAACATAA AATCGTGTTT CGTATAAATT 2400 
TCTAGTTAAG GTTTAAATGC TAG AGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT 2460 
Ig^GATCA GTTTGTATGC TGTACTATGC AGCGTTTAAA GTTAGTGGGT TTTGTGATTT 2520 
7 5 TTCTATOA A TA^G^I'GTC TGTTAC AA AG TCAGTTAAAG GTACGTTTTA ATATTT AAGT 2580 
tTttCTWCT TGGAG ATA AA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTATTATG 2640 

; atttggcatg acatgttctg tatgtttcag ggaaaaatot ctttaatgct 2700 
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"TTrrCAAGAA CTAACACAGT TATTCCTATA CrGGATTTTA GGTCrCTGAA GAACTGCTGG 2760 
TGTTTAGGAA TAAGAATGTG CATGAAGCCT AAAATACCAA GAAAGCTTAT ACTGAATTTA 2820 
AGCAAAGAAA TAAAGGAGAA AAGAGAAGAA TCTGAGAATT GGGGAGGCAT AGATTCTTAT 2880 
AAAAATCACA AAATTTGTTG TAAATTAGAG GGGAG AAATT TAGAATTAAG TATAAAAAGG 2940 
5 CAG AATTAGT ATAGAGTACA TTCATTAAAC AT1TTTGTCA GGATrATTTC CCGTAAAAAC 3000 
GTAGTGAGCA CTCTCATATA CTAATTAGTG TACATTTAAC TTTGTATAAT ACAGAAATCT 3060 
AAATATATTT AATG AATTCA AGCAATATAC ACTTGACCAA GAAATTGGAA TTTCAAAATG 3 120 
TTCGTGCGGG TTATATACCA GATGAGTACA GTGAGTAGTT TATGTATCAC CAGACTGGGT 3180 
TATTGCCAAG TTATATATCA CCAAAAGCTG TATGACTGGA TGTTCTGGTT ACCTGGTTTA 3240 
1 0 CAAAATTATC AGAGTAGTAA AACTTTGATA TATATGAGGA TATTAAAACT AC ACTA A GT A 3300 
TCATTTGATT CGATTCAGAA AGTACTTTGA TATCTCTCAG TGCTTCAGTG CTATCATTGT 3360 
GAGCAATTGT CTTTATATAC GGTACTGTAG CCATACTAGG CCTGTCTGTG GCATTCTCTA 3420 
GATGTTTCTT TTTTACACAA TAAATTCCTT ATATCAGCTT G 

15 Seq ID NO: 30 Protein sequence: 

Protein Accession U: NP_036451.2 

1 11 21 31 41 51 

20 MARKLSV1LI LTFALSVTNP LHELKAAAFP QTTEKJSPNW ESGENVDLAI STRQYHLQQL 60 

FYRYGENNSL SVEGFRKLLQ NIGIDKIKRI HIHHDHDHHS DHEHHSDHER HSDHEHHSDH 120 
EHHSDHDHHS HHNHAASGKN KRKALCPDHD SDSSGKDPRN SQGKGAHRPE HASGRRNVKD 1 80 
SVSASEVTST VYNTVSEGTH FLETIETPRP GKLFPKDVSS STPPSVTSKS RVSRLAGRKT 240 
NESVSEPRKG FMYSRNTNEN PQECFNASKL LTSHGMGIQV PLNATEFNYL CPAIfNQIDA 300 

25 RSCLIHTSEK KAEIPPKTYS LQIAWVGGFI AISIISFLSL LGVILVPLMN RVFFKFLLSF 360 

LVALAVGTLS GDAFLHLLPH SHASHHHSHS HEEPAMEMKR GPLFSHLSSQ NIEESAYFDS 420 
TWKGLTALGG LYFMFLVEHV LTLIKQFKDK KKKNQKKPEN DDDVEIKKQL SKYESQLSTN 480 
EEKVDTDDRT EGYLRADSQE PSHFDSQQPA VLEEEEVMIA HAIIPQEVYNE YVPRGCKNKC 540 
HSHFHDTLGQ SDDLIHUHHD YHIHLHHHHH QNH1IPHSHSQ RYSREELKDA GVATLAWMVI 600 

30 MGDGLHNFSD GLAIGAAFTE GLSSGLSTSV AVFCHELPHE LGDFAVLLKA GMTVKQAVLY 660 
NALSAMLAYL GMATGIFIGH YAENVSMWIF ALTAGLFMYV ALVDMVPEML HNDASDHGCS 720 
RWGYFFLQNA GMLLGFGIML USIFEHKIV FRJNF 

Seq ID NO: 3 1 DNA sequence 
35 Nucleic Acid Accession #: NM_002184.1 

Coding sequence: 256-3 012(underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
1A I I I I I 1 

40 GAGCAGCCAA AAGGCCCGCG GAGTCGCGCT GGGCCGCCCC GGCGCAGCTG AACCGGGGGC 60 
CGCGCCTGCC AGGCCGACGG GTCTGGCCCA GCCTGGCGCC AAGGGGTTCG TGCGCTGTGG 120 
AGACGCGG AG GGTCGAGGCG GCGCGGCCTG AGTGAAACCC AATGGAA A AA GCATG AC ATT 1 80 
TAGAAGTAGA AGACTTAGCT TCAAATCCCT ACTCCTTCAC TTACTAATTT TGTGATTTGG 240 
A A ATATCCGC GCAAGATGTT G ACGTTGCAG ACTTGGGTAG TGCAAGCCTT GTTTATTTTC 300 

45 CTCACCACTG AATCTACAGG TGAACTTCTA GATCCATGTG GTTATATCAG TCCTGAATCT 360 
CCAGTTGTAC AACTTCATTC TAATTTCACT GCAGTTTGTG TGCTAAAGG A AAAATGTATG 420 
GATTATTTTC ATGTAAATGC TAATTACATT GTCTGGAAAA CAAACCATTT TACTATTCCT 480 
AAGGAGCAAT ATACTATCAT AAACAGAACA GCATCCAGTG TCACCTTTAC AGATATAGCT 540 
TCATTAAATA TTCAGCTCAC TTGCAACATT CTTACATTCG GACAGCTTG A ACAGAATGTT 600 

50 TATGGAATCA CAATAATTTC AGGCTTGCCT CCAGAAAAAC CTAAAAATTT GAGTTGCATT 660 

GTGAACGAGG GGAAGAAAAT GAGGTGTGAG TGGGATGGTG GAAGGGAAAC ACACTTGGAG 720 
ACAAACTTCA CTTTAAAATC TGAATGGGCA ACACACAAGT TTGCTGATTG CAAAGCAAAA 780 
CGTGACACCC CCACCTCATG CACTGTTGAT TATTCTACTG TGTATTTTGT CAACATTGAA 840 
GTCTGGGTAG AAGCAGAG A A TGCCCTTGGG AAGGTTACAT CAGATCATAT CAATTTrGAT 900 

5 5 CCTGTATATA AAGTGAAGCC CAATCCGCCA CATAATTTAT CAGTGATCAA CTCAGAGG AA 960 
CTGTCTAGTA TCTTA AAATT GACATGGACC AACCCAAGTA TTAAGAGTGT TATAATACTA 1020 
AAATATAACA TTCAATATAG GACCAAAGAT GCCTCAACTT GGAGCCAG AT TCCTCCTGAA 1080 
GACACAGCAT CCACCCGATC TTCATTCACT GTCCAAG ACC TTAAACCTTT TACAGAATAT 1 1 40 
GTGTTTAGGA TTCGCrGTAT GAAGGAAGAT GGTAAGGGAT ACTGGAGTGA CTGGAGTGAA 1200 

60 GAAGCAAGTG GGATCACCTA TGAAGATAGA CCATCTAAAG CACCAAGTTT CTGGTATAAA 1260 
ATAGATCCAT CCCATACTCA AGGCTACAGA ACTGTACAAC TCGTGTGGAA GACATTGCCT 1320 
CCITTTGAAG CCAATGGAAA AATCTTGGAT TATGAAGTGA CTCTCACAAG ATGGAAATCA 1380 
CATTTACA AA ATTACACAGT TAATGCCACA AAACTGACAG TAAATCTCAC AAATGATCGC 1 440 
TATCTAGCAA CCCTAACAGT AAGAA ATCTT GTTGGCAAAT CAG ATGCAGC TGTTTTAACT 1 500 

65 ATCCCTGCCT GTGACTTTCA AGCTACTCAC CCTGTAATGG ATCTT AAAGC ATTCCCCAAA 1560 
GATAACATGC TTTGGGTGGA ATGGACTACT CCAAGGGAAT CTGTAAAGAA ATATATACTT 1620 
GAGTGGTGTG TGTTATCAGA TAAAGCACCC TGTATCACAG ACTGGCAACA AGAAGA1 GGT 1680 
ACCGTGCATC GCACCTATTT AAGAGGGAAC TTAGCAGAGA GCAAATGCTA TTTGATAACA 1740 
GTTACTCCAG TATATGCTGA TGGACCAGGA AGCCCTGAAT CCATAAAGGC ATACCTTAAA 1800 

70 CAAGCTCCAC CTTCCAAAGG ACCTACTG IT CGGACAAA AA AAGTAGGGAA AAACGAAGCT 1860 
GTCTTAGAGT GGGACCAACT TCCTGTTGAT GTTCAGAATG GATTTATCAG AAATTATACT 1920 
ATATTTTATA GAACCATCAT TGGAAATGAA ACTGCTGTGA ATGTGGATTC TTCCCACACA 1980 
GAATATACAT TGTCCTCTTT GACTAGTGAC ACATTGTACA TGGTACGAAT GGCAGCATAC 2040 
ACAGATGAAG GTGGGAAGGA TGGTCCAGAA TTCACTTTTA CTACCCCAAA GTTTGCTCAA 2100 

75 GGAGAAATTG AAGCCATAGT CGTGCCTGTT TGCTTAGCAT TCCTATTGAC AACTCTTCTG 2160 
GGAGTGCTGT TCTGCTTTAA TAAGCGAGAC CTAA ITAAAA aacacatctg GCCTAATGTT 2220 
CCAGATCCTT CAAAGAGTCA TATTGCCCAG TGGTCACCTC ACACTCCTCC AAGGCACAAT 2280 
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TTTAATTCAA AAGATCAAAT GTATTCAGAT GGCAATTTCA CTGATGTAAG TGTTGTGGAA 2340 
ATAGAAGCAA ATG AC A A AAA GCCTTTTCCA GAAGATCTGA AATCATTGGA CCTGTTCAAA 2400 
A AGG AA A AAA TTAATACTGA AGGACACAGC AGTGGTATTG GGGGGTCTTC ATGCATGTCA 2460 
TCTTCTAGGC CAAGCATTTC TAGCAGTGAT GAAAATGAAT CTTCACAAAA CACTTCGAGC 2520 
5 ACTGTCCAGT ATTCTACCGT GGTACACAGT GGCTACAGAC ACCAAGTTCC GTCAGTCC.AA 25S0 
GTCTTCTCAA GATCCGAGTC TACCCAGCCC TTGTTAGATT CAGAGGAGCG GCCAGAAGAT 2640 
CTACAATTAG TAGATCATGT AGATGGCGGT GATGGTATTT TGCCCAGGCA ACAGTACTTC 2700 
AAACAGAACT GCAG TCAGCA TGAATCCAGT CCAGATATTT CACATTTTGA AAGG TC AAAG 2760 
CAAGTTTCA T CAGTCAATGA GGAAG ATTTT GTTAGACTTA AACAGCAGAT TTCAGATCAT 2820 
1 0 ATTTCACAAT CCTGTGG ATC TGGGCAAATG AAAATGTTTC AGGAAGTTTC TGCAGCAGAT 2880 
GCTTTTGGTC CAGGTACTGA GGGACAAGTA GAAAGATITG AAACAGTTGG CATGGAGGCT 2940 
GCGACTGATG AAGGCATGCC TAAAAGTTAC TTACCACAGA CTGTACGGCA AGGCGGCTAC 3000 
ATGCCTCAG T GA AGGACTAG TAGTTCCTGC TACAAC1TCA GCAGTACCTA TAAAGTAAAG 3060 
CTAAAATGAT TTTATCTGTG AATTC 

15 

Seq ID NO: 32 Protein sequence: 
Protein Accession U: NPJ>02 1 75. 1 

1 11 21 31 41 51 

20 | | | | I | 

MLTLQTWVVQ ALFIFLTTES TGELLDPCGY ISPESPVVQL HSNFTAVCVL KEKCMDYFHV 60 
NANYIVWKTN HFTIPKEQYT HNRTASSVT FTDIASLN1Q LTCN 1LTFGQ LfiQNVYGITI 120 
ISGLPPEKPK NLSCIVNEGK KMRCEWDGGR ETHLETNFTL KSEWATKKFA DCKAKRDTPT 130 
SCTVDYSTVY FVNIEVWVE A FNALGKVTSD HINFDPVYKV KPNPPHNLSV nMSEELSSIL 240 

25 KLTWTNPSIK SV1ILKYN1Q YRTKDASTWS QrPPEDTAST RSSFTVQDLX PFTEYVFRJR 300 

CMKEDGKG YW SDWSEEASGI TYEDRPSKAP SFWYK1DPSH TQGYRTVQLV WKTLPPFEAN 360 
GKILDYEVTL TRWKSHLQNY TVNATKLTVN LTNDRYLATL TVRNLVGKSD AAVLTIPACD 420 
FQATHPVMDL KAFPKDNMLW VEWTTPRESV KKYILEWCVL SDKAPCITDW QQEDGTVHRT 480 
YLRGNLAESK CYLITVTPVY ADGPGSPESI KAYLKQAPPS KGPTVRTKKV GKNEAVLEWD 540 

30 QLPVDVQNGF IRNYTIFYRT IIGNETAVNV DSSHTEYTLS SLTSDTLYMV RMAAYTDEGG 600 
KDGPEFTFTT PKFAQGEIEA IWPVCLAFL LTTLLGVLFC FNKRDLIKKH IWPNVPDPSK 660 
SHIAQWSPHT PPRHNFNSKD QMYSDGNFTD V SVVEIEAND KXPFPEDLKS LDLFKKEKIN 720 
TEGHSSG1GG SSCMSSSRPS ISSSDENESS QNTSSTVQYS TVVHSGYRHQ VPSVQVFSRS 780 
ESTQPLLDSE ERPEDLQLVD FFV'DGGDGILP RQQYFKQNCS QUESSPDISH FERSKQVSSV 840 

3 5 NEEDFVRLKQ Q1SDHISQSC GSGQMKMFQE VSA ADAFGPG TEGQVBRFET VGMEAATDEG 900 
MPKSYLPQTV RQGGYMPQ 

Seq ID NO: 33 DNA sequence 
Nucleic Acid Accession #: NM_01 8255. 1 

40 Coding sequence: 1 1-2491 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
] I 1 I 1 I 

AGTTGGCGAC ATGGTGGCAC CCGTGCTGGA GACTTCTCAC GTGTTrTGCT GCCCAAACCG 60 

45 GGTGCGGGGA GTCCTGAACT GG AGCTCTGG GCCCAGAGGA CTTCTGGCCT TTGGCACGTC 1 20 
CTGCTCCGTG GTGCTCTATG ACCCCCTGAA AAGGGTTGTT GTTACCAACT TGAATGGTCA 180 
CACCGCCCGA GTCAATTGCA TACAGTGGAT TTGTAAACAG GATGGCTCCC C1TCTACTGA 240 
ATTAGTTTCT GGAGGATCTG ATAATCAAGT GATTCACTGG GAAATAGAGG ATAATCAGCT 300 
TTTAAAAGCA GTGCATCTTC AAGGCCATG A AGGACCTGTT TATGCGGTGC ATGCTGTrTA 360 

50 CCAGAGGAGG AC ATC AG ATC CTGCATTATG TACACTGATC GTTTCTGCAG CTGCAGATTC 420 
TGCTGTTCGA CTCTGGTCTA AAAAGGGTCC AGAAGTAATG TGCCTTCAGA CTTTAAACTT 480 
TGGAAATGGA TTTGCTTTGG CTCTCTGCTT ATCTTTTTTG CCAAATACTG ATGTACCAAT 540 
ATTAGCATGT GGCAATGATG ATTGCAGAAT TCACATATTT GCTCAACAAA ATGATCAGTT 600 
TCAGAAAGTG CTTTCTCTCT GTGGACATGA GGATTGGATT AGAGGAGTGG AATGGGCAGC 660 

5 5 CTTTGGTAG A GATCTTTTCC TAGCAAGCTG TTCACAAG AT TGCCTGATAA GAATATGGAA 720 
GCTGTATATA AAGTCAACAT CTTTAGAAAC TCAGGATGAC GATAACATAA GACTGAAAGA 780 
AAATACTTTT ACCATAGAAA ATGAAAGTGT TAAAATAGCA TTTGCTGTTA CTCTGGAGAC 840 
AGTGCTAGCC GGTCATGAAA ACTGGGTAAA TGCAGTTCAC TGGCAACCTG TGTTTTACAA 900 
AGATGGTGTC CTACAGCAGC CAGTGAGATT ATTATCTGCT TCCATGGATA AAACCATGAT 960 

60 TCTCTGGGCT CC AG ATG A AG AGTCAGGAGT TTGGCTAGAA CAGGTTCGAG TAGGTGAAGT 1020 
AGGTGGGAAT ACTTTGGGAT TTTATGATTG CCAGTTCAAT GAAGATGGCT CCATGATCAT 1 080 
TGCTCATGCT TTCCACGGAG CGTTGCACCT TTGGAAACAG AATACAGTTA ACCCAAGAGA 1 140 
GTGGACTCCA GAGATTGTCA TTTCAGGACA CTTTGATGGT GTCCAAGACC TAGTCTGGGA 1200 
TCCAGAAGGA GAATTTATTA TCACTGTTGG TACTGATCAG ACAACTAGAC TTTTTGCTCC 1260 

65 ATGGAAGAGA AAAGACCAAT CACAGGTG AC TTGGCATGAA ATTGCAAGGC CTCAGATACA 1320 
TGGGTATGAC CTGAAATGTT TGGCAATGAT TAATCGGTTT CAGTTTGTAT CTGGAGCAGA 1380 
TGAAAAAGTT CTTCGGGTTT TTTCTGCA CC TCGGAATTTT GTGGAAAATT TTTGTGCCAT 1440 
TACAGGACAA TCACTGAATC ATGTGCTCTG TAATCAAGAT AGTGATCTIC CAGAAGGAGC 1500 
CACTGTCCCT GCATTGGG AT TATCAAATAA AGCTGTCTTT CAGGGAGATA TAGCTTCTCA 1 560 

70 GCCTTCTGAT GAAGAGGAGC TGTTAACTAG TACTGGTTTT GAGTATCAGC AGGTGGCCTT 1620 
TCAGCCCTCC ATACTTACTG AGCCTCCCAC TGAGGATCAT CTTCTGCAGA ATACTTTGTG 1680 
GCCTGA AGTT CAAAA ACTAT ATGGGCACGG TTATGAAATA TTTTGTGTTA CTTGTAACAG 1 740 
TTCAAAGACT CTGCTTGCCT CAGCTTGTA A GGCAGCTAAG AAAG AGCATG CAGCTATCAT 1 800 
TCTTTGGAAC ACTACATCTT GG AAACAGGT GCAGAATTTA GTTTTCCACA GTTTGACAGT 1 860 

75 CACGCAGATG GCCTTCTCAC CTAATGAG AA GTTCTTACTA GCTGTTTCCA GAGATCGAAC 1920 
CTGGTCATTG TGGAAAAAGC AGGATACAAT CTCACCTGAG TTCGAGCCAG TTTTTAGTCT 1980 
TTTTGCCrrC ACCAACAAAA TTACTTCTGT GCACAGTAGA ATTATTTGGT CTTGTGATTG 2040 
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GAGTCCTGAC AGCAAGTATT TCTTCACTGG GAGTCGAGAC AAAAAGGTGG TTGTCTGGGG 2100 
TGAGTGCGAC TCCACTGATG ACTGTATTGA GCACAACATT GGCCCCTGCT CCTCAGTCCT 2160 
GGAOGTGGGT GGGGCTGTGA CAGCTGTCAG CGTCTGCCCA GTGCTCCACC CTTCTCAACG 2220 
ATACGTGGTT GC AGTAGG AT TGG AGTGTGG AA AGATTTGC TTATATACCT GGAAAAAGAC 2280 
5 TG ATCAAGTT CCAGAA ATA A ATG ACTGG AC CCACTGTGTA GAAACAAGTC A AAGCCA AAG 2340 
TCATACACTG GCTATCAGAA AATTATGCTG GAAGAATTGC AGTGGAAAAA CTGAACAGAA 2400 
GG AAGCAGAA GGTGCTGAGT GGTTACACTT TGCAAGCTGT GGTGA AGATC ACACTGTG AA 2460 
GATACACAGA GTCAATAAAT GTGCACTG TA A TGG 

10 Scg ID NO: 34 Protein sequence: 

Protein Accession U; NP_060725. 1 

1 11 21 31 41 51 

11)111 

1 5 MVAPVLETSH VFCCPNRVRG VLNWSSGPRG LLAFGTSCSV VLYDPLKRVV VTNLNGHTAR 60 

VNC1QWICKQ DGSPSTFLVS GGSDNQVIHW EIEDNQLLKA VHLQGHEGPV YAVHAVYQRR 1 20 
TSDPALCTLI VSAAADSAVR LWSKKGPEVM CLQTLNFGNG FALALCLSFL PNTDVPILAC 180 
GNDDCRIHIF AQQNDQFQKV LSLCGHEDWI RGVEWAAFGR DLFLASCSQD CLIRIWKLYI 240 
KSTSLETQDD DNIRLKENTF TIENESVKIA FAVTLETVLA GHENWVNAVH WQPVFYXDGV 300 

20 LQQPVRLLSA SMDKTMILWA PDEESG VWLE QVRVGEVGGN TLGFYDCQFN EDGSMIIAHA 360 
FHGALHLWKQ NTVNPREWTP EIVISGHFDG VQDLVWDPEG EFHTVGTDQ TTRLFAPWKR 420 
KDQSQVTWHE IARPQIHGYD LKCLAMINRF QFVSGADEKV LRVFSAPRNF VENFCAFrGQ 480 
SLNHVLCNQD SDLPEGATVP ALGLSNKAVF QGDIASQPSD EEELLTSTGF EYQQVAFQPS 540 
ILTEPPTEDH LLQNTLWPEV QKLYGHGYEI FCVTCNSSKT LLASACKAAK KEHAAIILWN 600 

25 TTSWKQVQNL VFHSLTVTQM AFSPNEK.FLL AVSRDRTWSL WKKQDTISPE FEPVFSLFAF 660 
TNKITSVHSR IIWSCDWSPD SKYFFTGSRD KKVVVWGECD STDDCIEHNI GPCSSVLDVG 720 
GAVTAVSVCP VLHPSQRYVV AVGLECGKJC LYTWKKTDQV PEINDWTHCV ETSQSQSHTL 780 
AIRKLCWKNC SGKTEQ1CEAE GAEWLHFASC GEDHTVKIHR VNKCAL 

30 Seq ID NO: 35 DNA sequence 

Nucleic Acid Accession U: NM 022131 

Coding sequence: 1 1 -2878 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

35 | | ! I 1 | 

TGCTGCGAGG ATGCTGCCTG GGCGGCTGTG CTGGGTGCCG CTCCTGCTGG CGCTGGGCGT 60 
GGGGAGCGGC AGCGGCGGTG GCGGGGACAG CCGGCAGCGC CGCCTCCTCG CGGCTAAAGT 120 
CAATAAGCAC AAGCCATGGA TCG AG ACTTC ATATCATGGA GTCATAACTG AG AACAATGA 1 80 
CACAGTCATT TTGGACCCAC CACTGGTAGC CCTGGATAAA GATGCACCGG TTCCTTTTGC 240 

40 AGGGGAAATC TGTGCGTTCA AGATCCATGG CCAGGAGCTG CCCTTTGAGG CTGTGGTGCT 300 
CAACAAGACA TCAGGAGAGG GCCGGCTCCG TGCCAAGAGC OCCATTGACT GTGAGTTGCA 360 
GAAGGAGTAC ACATTCATCA TCCAGGCCTA TGACTGTGGT GCTGGGCCCC ACGAGACAGC 420 
CTGGAAAAAG TCACACAAGG CCGTGGTCCA TATACAGGTG AAGGATGTCA ACGAGTTTGC 480 
TCCCACCTTC AAAGAGCCAG CCTACAAGGC TGTTGTGACG GAGGGCAAG A TCTATGACAG 540 

45 CATTCTGCAG GTGGAGGCCA TTGACG AGGA CTGCTCCCCA CAGTACAGCC AGATCTGCAA 600 
CTATGAAATC GTCACCACAG ATGTGCCTTT TGCCATCGAC AGAAATGGCA ACATCAGG A A 660 
CACTGAGAAG CTGAGCTATG ACAAACAACA CCAGTATGAG ATCCTGGTGA CCGCCTACGA 720 
CTGTGGACAG AAGCCCGCTG CTCAGGACAC CCTGGTGCAG GTGGATGTGA AGCCAGTTTG 780 
CAAGCCTGGC TGGCAAGACT GGACCAAGAG GATTGAGTAC CAGCCTGGCT CCGGGAGCAT 840 

5 0 GCCCCTGTTC CCCAGCATCC ACCTGG AG AC GTGCG ATGGA GCCGTGTCTT CCCTCCAGAT 900 
CGTCACAGAG CTGCAGACTA ATTACATTGG GAAGGGTIX3T GACCGGGAGA CCTACTCTGA 960 
G AAATCCCTT CAGA AGTTAT GTGG AGCCTC CTCTGGCATC ATTG ACCTCT TGCCATCCCC 1 020 
TAGCGCTGCC ACCAACTGGA CTGCAGGACT GCTGGTGGAC AGCAGTGAG A TG ATCTTC AA 1 080 
GTTTGACGGC AGGCAGGGTG CCAAAATCCC CGATGGGATT GTGCCCAAGA ACCTGACCGA 1 140 

55 TCAGTTCACC ATCACCATGT GGATG AAACA CGGCCCCAGC CCTGGTGTGA GAGCCGAGAA 1200 
GGAAACCATC CTCTGC A ACT CAGACAAAAC CG A A ATG A AC CGG CATC ACT ATGCCCTGTA 1260 
TGTGCACAAC TGCCGCCTCG TCTTTCTCTT GCGGAAGGAC TTCGACCAGG CTGACACCTT 1 320 
TCGCCCCGCG GAGTTCCACT GGAAGCTGGA TCAGATTTGT GACAAAGAGT GGCACTACTA 1380 
TGTCATCAAT GTGG AGTTTC CTGTGGTAAC CTTATACATG GATGGAGCAA CATATGAACC 1 440 

60 ATACCTGGTG ACCAACGACT GGCCCATTCA TCCATCTCAC ATAGCCATGC AACTCACAGT 1500 
CGGCGCTTGT TGGCAAGGAG GAGAAGTCAC CAAACCACAG TTTGCTCAGT TCTTTCATGG 1560 
AAGCCTGGCC AGTCTCACCA TCCGCCCTGG C AAA ATGG A A AGCCAGAAGG TGATCTCCTG 1620 
CCTGCAGGCC TGCAAGGAAG GGCTGGACAT TAATTCCTTG GAAAGCCTTG GCCAAGGAAT 1680 
AAAGTATCAC TTCAACCCCT CGCAGTCCAT CCTGGTGATG GAAGGTGACG ACATTGGGAA 1740 

65 CATTAACCGT GCTCTCCAGA AAGTCTCCTA CATCAACTCC AGGCAGTTCC CA ACGGCGGG 1 800 
TGTGCGGCGC CTCAAAGTAT CCTCC AAAGT CCAGTGCTTT GGGGA AGACG TATGCATCAG 1 860 
TATCCCTGAG GTAG ATGCCT ATGTG ATGGT CCTCC AGGCC ATCG AGCCCC GG ATCACCCT 1920 
CCGGGGCACA GACCACTTCT GGAGACCTGC TGCCCAGTTT GAAAGTGCCA GGGGAGTGAC 1980 
CCTCTTCOCT GATATCAAGA TTGTGAGCAC CTTCGCCAAA ACCGAAGCCC CCGGGGACGT 2040 

70 GAAAACCACA GACCCCAAAT CAGAAGTCTT AGAGGAAATG CTTCATAACT TAGATTTCTG 2100 
TGACATTTTG GTGATCGGAG GGGACTTGGA CCCAAGGCAG GAGTGCTTGG AGCTCAACCA 2160 
CAGTGAGCTC CACCA ACGAC ACCTGG ATG C CACTAATTCT ACTGCAGGCT ACTCCATCTA 2220 
CGGTGTGGGC TCCATGAGCC GCTATG AGCA GGTGCTACAT CACATCCGCT ACCGCAACTG 2280 
GCGTCCGGCT TCCCTTG AGG CCCGGCGTTT CCGGATTAAG TGCTCAGAAC TCAATGGGCG 2340 

75 CTACACTAGC AA'I GAGTTCA ACITGGAGGT CAGCATCCTT CATGAAGACC AAGTCTCAGA 2400 
TAAGG AGCAT GTCAATCATC TGATTGTGCA GCCTCCCTTC CTCCAGTCTG TCCATCATCC 2460 
TGAGTCCCGG AGTAGCATCC AGCACAGTTC AGTGGTCCCA AGCATTGCCA CAGTGGTCAT 2520 
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CATCATCTCC GTGTGCATGC TTGTGTTTGT CGTGGCCATG GGTGTGTACC GGGTCCGGAT 25SO 
CGCCCACCAG CACTTCATCC AGGAGACTGA GGCTGCCAAG GAATCTGAG A TGGACTGGGA 2640 
CGATTCTGCG CTG ACT ATCA CAGTCAACCC CATGGAGAAA CATGAAGGAC CAGGGCATGG 270O 
GGAAGATGAG ACTGAGGGAG AAGAGGAGGA AGAAGCCGAG GAAGAAATGA GCTCCAGCAG 2760 
5 TGGCTCTGAC GACAGCGAAG AGGAGGAGGA GGAGGAAGGG ATGGGCAGAG GCAGACATGG 2820 
GCAG AATGGA GCCAGGCAAG CCCAGCTGGA GTGGGATGAC TCCACCCTCC CCTACTAGTG 2380 
CCC^GGGGTC TGCTGCCTGG CCCACATGTC CCTTTTGTAA ACCCTGACCC AGTGTATGCC 2940 
CATGTCTATC ATACCTCACC TCTGATGTCT GTGACATGTC TGGGAAGGCC TTCTCCAGCT 30OO 
TCCTGGAGCC CACCCTTTAA GCCTTGGGCA CTCCCTGTGT TTCATCCATG GGGAAGTTCC 3060 

1 0 AAGAAGCCCA GCATGGCCAT CAGTGAGGAC TTCAGGGTAG ACTTTGTCCT GTAGCCTCCA 3120 
CTTCTGCCCT AAGTTCCCCA GCATCCTGAC TACCTGTCTG CAGAGTTTGC CTTTGTTTTT 3180 
TCCTGCAGGG AAGAAGGCCC ACCTTTGTGT CACTCACCTC CCCAGGCTCA GAGTCCCCAA 3240 
GGCCCTGGGG TTCCAACTCA CTGTGCGTCT CCTCCACACA G ACCAGTAGG TTCTCCTATG 3300 
CTGACTCCAG GTTGCTTCAT ACAAGGAGGG TGGT TGAACT TCACACACGT AAGGTCTTAG 3360 

1 5 TGCTT \ \CAG TTTAAAGG AA AGTCCTTGTT G AGGCAG AAC TAAGTTTACA GGGAAAGGTA 3420 
CACACATTCT CTCTCTCTCT CTCTCTCTGT CTATCTAGTT CCCCAGCTTG GAGAGCCTTT 34S0 
CCCCTTGCTT CTTTCTGAGG CCATATAAGC TTATAAGAAA AGTCCCAAAC CAAGAATAGG 3540 
TCCTTGGCCA CAAGCAGGGT CTGATCCCCC ATCAGAGCTA TCTGAGCCTG CCTGTCTGGG 3600 
CACCTGCTGC AACCATGCAG CTACCCTGCC AGGGGCACTC AGCAAACAGA ACCACAGGGC 3660 

20 CCAGGAGGCA TTCCACACAG GCACTGCCCC AGG ACAACAC A ACAAGGACA GTCACAACAA 3720 
GGACAACAAG GACACAACAC AACACACAAC AAGGACAGTC ACAACAAGCC TAGAGCCAGA 3780 
AAGCAGATGG AAATGCTAAT GAGGTCAAAC GTAGGCTTCA TGGTGGGTGG AGTGGGGGTG 3840 
GCTGGGCTCC CCCAGGACAG AGGGGACCCT GAGGTTGGCA AGGCTCTCAC CACTCAGCCT 3900 
TATGGTCCCT TATCTCCTAT CTTCCCTCTT GAGAAAATAC ACGCTTTCTG CATGTATTAG 3960 

25 AAACGCACGA GCTCCACCAA GTCTACAATG AAAGTTTGAA ATTTAACTGC AAGGAATTAG 4020 
AAGCATATTT GCAATCATTG CAGCTTCTTC TTTCTTCrGC TCATAAAAGG AGGAACACTT 4080 
TAGATAGAGG GCA AATATAT CTGAAA ACCT AATTrCTTTC TTTTTTTGAT AAGGAAATCT 4140 
TTTCCATCTC CATCCTAACA TGCACAACCT GTGAAGAGAA TTGTTTCTAT AGTAACTGGT 4200 
CTGTGATCTr TTGTGGCCAA GAGAATAGCA GGCAAGA ATT AGGGCCTTGA CAGAATTTCC 4260 

30 ACG AAGCTCT GAGAACATGT TTGTTTCGAA TGTCTGATTC CTCTTTGTCA TCAATGTGTA 4320 
TGCTCTGTCC CCATCCTTCA CTCCTCCTCA AGCTCACACC AATTGGTTTG GCACAGGCAC 4380 
AGAGCTGGTC CCTAGTTAAG TGGCATTTAT GTTAAAAAAA AATAGTTCAG AATCTCAGCC 4440 
TTTTCTTTGT GTCATCAAAA CAGCTTAAGA AGGGGACTAC TGCCAATGTC CTCTAGTCTG 4500 
ACCTCCACCC AGGGAGGACC CATGGCAGGT CTTTTCAACT TTCTGATTCA TGAGAACAAC 4560 

3 5 CTTGTGAAGC TTTTCCCACC TCCTAAAGTG TTTTCTGCAT CTGTTCCTTC CTTTGGACCT 4620 

CACAACAAAT CCTGTGAAGT AACTGAG ACA TCTGTTGTTA GATACATTTT TGTGATGAGT 4680 
AAACTGAGGC TTCG 

Seq ID NO: 36 Protein sequence: 
40 Protein Accession U: NP„071 4 1 4. 1 

1 11 21 31 41 51 

MLPg'rLCWVP LLLALGVGSG SGGGGDSRQR RLLAAKVNKH KPWIETSYHG VITENNDTVI 60 
45 LDPPLVALDK DAPVPFAGEI CAFKIHGQEL PFEA VVLNKT SGEGRLRAKS PIDCELQKEY 120 
TFIIQAYDCG AGPHETAWKK SHKAWHIQV KDVNEFAPTF KEPAYKAVVT EGKJYDSILQ 180 
VEAIDEDCSP QYSQICNYEI VTTDVPFAID RNGNIRNTEK LSYDKQHQYE ILVTAYDCGQ 240 
KPAAQDTLVQ VDVKPVCKPG WQDWTKRIEY QPGSGSMPLF PSIHLETCDG AVSSLQIVTE 300 
LQTNYIGKGC DRETYSEKSL QKLCGASSGI IDLLPSPSAA TNWTAGLLVD SSEMIFKFDG 360 
50 RQGAKIPDGI VPKNLTDQFT ITMWMKHGPS PGVRAEfCETl LCNSDKTEMN RHFTYALYVHN 420 
CRLVFLLRKD FDQADTFRPA EFHWKLDQIC DICEWHYYVIN VEFPWTLYM DGATYEPYLV 480 
TNDWPIHPSH 1AMQLTVGAC WQGGEVTKPQ FAQFFHGSLA SLTIRPGKME SQKVISCLQA 540 
CKEGLDINSL ESLGQGDCYH FNPSQSILVM EGDDIGNINR ALQKVSYINS RQFPTAGVRR 600 
LKVSSKVQCF GEDVCISIPE VDAYVMVLQA IEPRITLRGT DHFWRPAAQF ESARGVTLFP 660 
5 5 DIKIVSTFAK TEAPGDVKTT DPKSEVLEEM LHNLDFCDIL V 1GGDLDPRQ ECLELNHSEL 720 

HORHLDATNS TAGYS1YGVG SMSRYEQVLH HIRYRNWRPA SLEARRFRIK CSELNGRYTS 780 
NEFNLEVSH HEDQVSDKEH VNHLIVQPPF LQSVHHPESR SSIQHSSVVP SIATVVinS 840 
VCMLVFWAM G VYRVRIAHQ HFIQETEAAK ESEMDWDDSA LTITVNPMEK HEGPGHGEDE 900 
TEGEEEEEAE EEMSSSSGSD DSEEEEEEEG MGRGRHGQNG ARQAQLEWDD STLPY 



60 



Seq ID NO: 37 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 143-874 (underlined sequences correspond to start and stop codons) 



65 1 11 21 31 41 51 

GGGAGGGAGA GAGGCGCGCG GGTGAAAGGC GCATTGATGC AGCCTGCGGC GGCCTCGGAG *0 
CGCGGCGGAG CCAGACGCTG ACCACGTTCC TCTCCTCGGT CTCCTCCGCC TCCAGCTCCG 1 20 
CGCTGCCCGG CAGCCGGGAG CCATGOGACC CC AGGGCCCC GCCGCCTCCC CGCAGCGGCT 1 80 

70 CCGCGGCCTC CTGCTGCTCC TGCTGCTGCA GCTGCCCGCG CCGTCGAGCG CCTCTGAGAT 240 

CCCCAAGGGG AAGCAAAAGG CGCAGCTCCG GCAGAGGGAG GTGGTGGACC TGTATAATGG 300 
AATGTGCTTA CAAGGGCCAG CAGGAGTGCC TGGTCGAG AC GGGAGCCCTG GGGCCAATGG 360 
CATTCCGGGT ACACCTGGGA TCCCAGGTCG GGATGGATTC AA AGG AG AAA AGGGGGAATG 420 
TCTGAGGGAA AGCTTrGAGG AGTCCTGGAC ACCCAACTAC AAGCAGTGTT CATGGAGTTC 480 

75 ATTGAATTAT GGCATAGATC TTGGGAAAAT TGCGGAGTGT ACATTTACAA AGATGCGTTC 540 
A ^ATAGTGCT CTAAGAGTTT TGTTCAGTGG CTCACTTCGG CTA AAATGCA GAA ATGCATG 600 
CTGTCAGCGT TGGTATTTCA CATTCAATGG AGCTG AATGT TCAGGACCTC TTCCCATTGA 660 
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AGCTATAATT TATTTGGACC AAGGA AGCCC TGAAATGAAT TCAACAATTA ATATTCATCG 720 
CACTTCTTCT GTGGAAGGAC TTTGTG AAGG AATTGGTGCT GGATTAGTGG ATGTTGCTAT 780 
CTGGGTTGGC ACTTGTTCAG ATTACCCAAA AGGAGATGCT TCTACTGGAT GGAATTCAGT S40 
TTCTCGCATC ATTATTGAAG AACTACCAAA ATAAATGCTT TAATTTTCAT TTGCTACCTC 900 
5 TTTTTTTATT ATGCCTTGGA ATGGTTCACT TAAATGACAT TTTAAATAAG TTTATGTATA 960 

CATCTGAATG AAAAGCAAAG CTAAATATGT TTACAGACCA AAGTGTGATT TCACACTGTT 1020 
TTTAAATCTA GCATTATTCA TTTTGCTTCA ATCAAAAGTG GTTTCAATAT TTTTTTTAGT 1 080 
TGGTTAGAAT ACTTTCTTCA TAGTCACATT CTCTCAACCT ATAATTTGGA ATATTGTTGT 1140 
GGTCTTTTGT TTTTTCTCTT AGTATAGCAT TITTAAAAAA ATATAAAAGC TACCAATCTT 1200 
1 0 TGTACAATTT GTAAATGTTA AGAATTTTTT TTATATCTGT TAAATAAAAA TTATTTCCAA 1 260 
CAACCTTAAA AAAAAAAAAA AAA A 

Seq ID NO: 38 Protein sequence: 
^ ^ Protein Accession #: none found 

1 11 21 31 41 51 
till I I 

MRPQGPAASP QRLRGLLLLL LLQLPAPSSA SEIPKGKQKA QLRQREVVDL YNGMCLQGPA 60 
GVPGRDGSPG ANGIPGTPGI PGRJX3FKGEK GECLRESFEE SWTPNYKQCS WSSLNYGIDL ! 20 
20 GKIAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNGAECSGPL PIE Ail YLDQ 1 80 
GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VArWVGTCSD YPKGDASTGW NSVSRIIIEE 240 
LPK 

Seq ID NO: 39 DNA sequence 
25 Nucleic Acid Accession*: NM_000949 

Coding sequence: 285-2 1 53(underlincd sequences correspond to start and stop codons) 

1 11 21 31 41 51 

1 I I f t I 

3 0 GGAGGCTGAA ATCCCCAGAC GCCGGTTTTC TGGGCTGGGC TTTCTGCTTA CTCACTCCTT 60 
CrCCCTCTTT CTGGATTTTA CCGACCGTTC GCGAAACAGC TTTCCACACA ATGGAGCTTC 1 20 
ATGTCCTCGT GCAGGAAGTA CTCATCGACT GATGTGGCAG ACTTTGCTCC CTGACAAAAC 1 80 
TAAAGAACTC TCCTATTCAT GGAGGCGAAC ACTGAGGATG CTTTCCACAT GAACCCTGAA 240 
GTGAACTTCT GATACATTTC CTGCAGCAAG AGAAGGCAGC CAACATGAAG GAAAATGTGG 300 

3 5 CATCTGCAAC CGTTTTCACT CTGCTACTTT TTCTCAACAC CTGCCTTCTG A ATGGACAGT 360 
TACCTCCTGG AAAACCTGAG ATCTTT AAAT GTCGTTCTCC CAATAAGGAA ACATTCACCT 420 
GCTGGTGGAG GCCTGGGACA GATGG AGGAC TTCCTACCAA TTATTCACTG ACTTACCACA 480 
GGGAAGGAGA GACACTCATG CATGAATGTC CAGACTACAT AACCGGTGGC CCCAACTCCT 540 
GCCACTTTGG CAAGCAGTAC ACCTCCATGT GG AGGAC AT A CATCATGATG GTCAATGCCA 600 

40 CTAACCAGAT GGGAAGCAGT TTCTCGG ATG AACTTTATGT GGACGTGACT TACATAGTTC 660 
AGCCAGACCC TCCTTTGGAG CTGGCTGTGG AAGTAAAACA GCCAGAAGAC AGAAAACCCT 720 
ACCTGTGGAT TAAATGGTCT CCACCTACCC TGATTGACTT AAAAACTGGT TGGTTCACGC 7S0 
TCCTGTATGA AATTCGATTA AAACCCGAGA AAGCAGCTGA GTGGGAGATC CATTTTGCTG 840 
GGCAGCAAAC AGAGTITAAG ATTCTCAGCC TACATCCAGG ACAGAAATAC CTTGTCCAGG 900 

45 TTCGCTGCAA ACCAGACCAT GGATACTGGA GTGCATGGAG TCCAGCGACC TTCATTCAGA 960 
TACCTAGTG A CTTCACCATG AATG ATACAA CCOTGTGGAT CTCTGTGGCT GTCCTTTCTG 1 020 
CTGTCATCTG TTTGATTATT GTCTGGGCAG TGGCTTTGAA GGGCTATAGC ATGGTGACCT 1080 
GCATCTTTCC GCCAGTTCCT GGGCCAAAAA TA AAAGGATT TGATGCTCAT CTGTTGGAGA 1 140 
AGGGCAAGTC TGAAGAACTA CTGAGTGCCT TGGGATGCCA AGACTTTCCT CCCACTTCTG 1200 

5 0 ACTATGAGGA CTTGCTGGTG GAGTATTTAG A AG TAG ATG A TAGTGAGGAC CAGCATCTAA 1 260 
TGTCAGTCCA TICAAAAGAA CACCCAAGTC AAGGTATGAA ACCCACATAC CTGGATCCTG 1320 
ACACTGACTC AGGCCGGGGG AGCTGTG AC A GCCCTTCCCT TTTGTCTGA A AAGTGTG AGG 1 380 
AACCCCAGGC CA ATCCCTCC ACATTCTATG ATCCTGAGGT CATTGAG AAG CCAGAGAATC 1 440 
CTGAAACAAC CCAC ACCTGG G ACCCCCAGT GCATAAGCAT GGAAGGCAA A ATCCCCTATT 1 500 

55 TTCATGCTGG TGGATCCAAA TGTTCAACAT GGCCCTTACC ACAGCCCAGC CAGCACAACC 1560 
OCAGATCCTC TTACC AC AAT ATTACTG ATG TGTGTGAGCT GGCTGTGGGC CCTGCAGGTG 1 620 
CACCGGCCAC TCTGTTGAAT GAAGCAGGTA AAGATGCTTT AAAATCCTCT CAAACCATTA 1 680 
AGTCTAGAGA AG AGGGAAAG GCAACCCAGC AGAGGGAGGT AGAAAGCTTC CATTCTGAG A 1 740 
CTGACCAGGA TACGCCCTGG CTGCTGCCCC AGG AGAAAAC CCCCTTTGGC TCCGCTAAAC 1 800 

60 CCTTGG ATTA TGTGG AG ATT CACA AGGTCA ACA AAGATGG TGCATTATCA TTGCTACCAA 1 8 60 

AACAGAGAGA GAACAGCGGC A AGCCC A AG A AGCCCGGGAC TCCTGAGAAC AATAAGGAGT 1920 
ATGCCAAGGT GTCCGGGGTC ATGGATAACA ACATCCTGGT GTTGGTGCCA GATCCACATG 1980 
CTAAAAACGT GGCTTGCTTT GA AG AATCAG CCAAAGAGGC CCCACCATCA CTTGAACAG A 2040 
ATCAAGCTGA GAAAGCCCTG GCCAACTTCA CTGCAACATC AAGCAAGTGC AGGCTCCAGC 2 1 00 

65 TGGGTGGTTT GGATTACCTG GATCCCGCAT GTTTTACACA CTCCTTTCAC TGATAGCTTG 2160 
ACTAATGGAA TG ATTGGTTA A A ATG TG ATT TTTCTTCAGG TAACACTACA GAGTACGTGA 2220 
AA TGCTCAAG AATGTAGTCA GACTG AC ACT ACTAAAGCTC CCAGCTCCTT TCATGCTCCA 2280 
ITTTTAACCA CTTGCCTCTT TCTCCAGCAG CTGATTCCAG AACAAATCAT TATGTTrCCT 2340 
AACTGTGATT TGTAGATTTA CTTTTTGCTG 'ITAGTTATAA AACTATGTGT TC AATG AAAT 2400 

70 AAAAGCACAC TGCTTAGTAT TCTTGAGGG A CAATGCCAAT AGGTATATCC TCTGGAAA AG 2460 
GCTTTCATGA TTTGGCATGG GACAGACGGA AATGAAATTG TCAAAATTGT TTACCATAG A 2520 
A AG ATG ACA A AAG AAAATTT TCCACATAGG AAA ATGCCAT GAAAATTGCT TTTGAAAAAC 2580 
AACTGCATAA CCTTT ACACT CCTCGTCCAT TTTATTAGGA TTACCCAAAT ATAACCATIT 2640 
AAAGAAAGAA TGCATTCCAG A AC AAATTGT TTACATAAGT TCCTATACCT TACTG ACACA 2700 

75 TTGCTGATAT GCAAGTAAGA AAT 



Seq ID NO: 40 Protein sequence: 
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Protein Accession U: NP_OOO940.1 
1 11 21 31 41 51 

1 ! I 1 I 1 

MKEKVASATV FTLLLFLNTC LLNGQLPPGK PEIFKCRSPN KETKTCWWRP GTDGGLPTNY 60 
SLTYHREGET LMHECPDYIT GGPNSCHFGK QYTSMWRTYI MMVNATNQMG SSFSDELYVD 120 
VTYIVQPDPP LELA VEVKQP EDRKPYLWIK WSPPTUDLK TGWFTLLYEI RLKPEKAAEW 1 80 
HHFAGQQTE FKILSLHPGQ KYLVQVRCKP DHGYWSAWSP ATFIQIPSDF TMNDTTVWIS 240 
VAVLSAVICL UVWAVALKG YSMVTCIFPP VPGPKIKGFD AHLLEKGKSE ELLSALGCQD 300 
FPPTSDYEDL LVEYLEVDDS EDQHLMSVHS KEHPSQGMKP TYLDPDTDSG RGSCDSPSLL 360 
SEKCEEPQAN PSTFYDPEVI EKPENPETTH TWDPQCISME GKiPYFHAGG SKCSTWPLPQ 420 
PSQHNPR5SY HN1TDVCELA VGPAGAPATL LNEAGKDALK SSQTIKSREE GKATQQREVE 480 
SFHSETDQDT PWLLPQEKTP FGSAKPLDYV EIHKVNKDGA LSLLPKQREN SGKPKKPGTP 540 
ENNKEYAKVS G VMDNNILVL VPDPHAKNVA CFEESAKEAP PSLEQNQAEK ALANFTATSS 600 
KCRLQLGGLD YLDPACFTHS FH 

Seq ID NO: 41 DNA sequence 

Nucleic Acid Accession #: none found, Eos cloned sequence 

Coding sequence: 1-1572 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

ATGACCCAAA ATAAATTAAA GCTTTGTTCC AAAGCCAATG TGTATACTGA AGTGCCTGAT 60 
GGAGGATGGG GCTGGGCGGT AGCTGTTTCA TTTTTCTTCG TTGAAGTCTT CACCTACGGC 120 
ATCATCAAGA CATTTGGTGT CTTCTTTAAT GACTTAATGG ACAGTTTTAA TGAATCCAAT 180 
AGCAGGATCT CATGG ATA AT CTCAATCTGT GTGTTTGTCT TAACATTTTC AGCTCCCCTC 240 
GCCACAGTCC TGAGCA ATCG TTTCGGACAC CGTCTGGTAG TGATGTTGGG GGGGCTACTT 300 
GTCAGCACCG GGATGGTGGC CGCCTCCTTC TCACAAGAGG TTTCTCATAT GTACGTCGCC 360 
ATCGGCATCA TCTCTGGTCT GGGATACTGC TTTAGTTTTC TCCCAACTGT AACCATCCTA 420 
TCACAATATT TTGGCAAAAG ACGTTCCATA GTCACTGCAG TTGCTTCCAC AGGAGAATGT 480 
TTCGCTGTGT TTGCTTTCGC ACCAGCAATC ATGGCTCTGA AGGAGCGCAT TGGCTGGAGA 540 
TACAGCCTCC TCTTCGTGGG CCTACTACAG TTAAACATTG TCATCTTCGG AGCACTGCTC 600 
AGACCCATCT TTATCAGAGG ACCAGCGTCA CCGAAAATAG TCATCCAGGA AAATCGGAAA 660 
GAAGCGCAGT ATATGCTTGA AAATGAGAAA ACACGAACCT CAATAGACTC CATTGACTCA 720 
GGAGTAGAAC TAACTACCTC ACCTAAAAAT GTGCCTACTC ACACTAACCT GGAACTGGAG 780 
CCGAAGGCCG ACATGCAGCA GGTCCTGGTG AAGACCAGCC CCAGGCCAAG CGAAAAGAAA 840 
GCCCCGCTAT TAGACTTCTC CATTTTGAAA GAGAAAAGTT TTATTTGTTA TGCATTATTT 900 
GGTCTCTTTG CAACACTGGG ATTCTTTGCA CCTTCCTTGT ACATCATTCC TCTGGGCATT 960 
AGTCTGGGCA TTGACCAGGA CCGCGCTGCT TTTTTATTAT CTACGATGGC CATTGCAGAA 1020 
GTTTTCGGAA GGATCGGAGC TGGTTTTGTC CTCAACAGGG AGCCCATTC G TAA GATTTAC 1080 
ATTGAGCTCA TCTGCGTCAT CTTATTGACT GTGTCTCTGT TTGCCTTTAC TTTTGCTACT 1 140 
GAATTCTGGG GTCTAATGTC ATGCAGCATA TTTTTTGGGT TTATGGTTGG AACAATAGGA 1200 
GGGACTCACA TTCCACTGCT TGCTGAGGAT GATGTCGTGG GCATTGAGAA GATGTCTTCT 1260 
GCAGCTGGGG TCTACATCTT CATTCAGAGC ATAGCAGGAC TGGCTGGACC GCCCCTTGCA 1320 
GGTTTGTTGG TGGACCAAAG TAAGATCTAC AGCAGGGCCT TCTACTCCTG CGCAGCTGGC 1380 
ATGGCCCTGG CTGCTGTGTG CCTCGCCCTG GTGAGACCGT GTAAGATGGG ACTGTGCCAG 1440 
CATCATCACT CAGGTGAAAC AAAGGTAGTG AGCCATCGTG GGAAGACTTT ACAGGACATA 1500 
CCTGAAGACT TTCTGGAAAT GGATCTTGCA AAAAATGAGC ACAGAGTTCA CGTGCAAATG 1560 
GAGCCGGTATGA 

Scq ID NO: 42 Protein sequence: 

Protein Accession U: none found, Eos cloned sequence 
I 1 1 21 31 41 51 

I I ! ! I 1 

MTQNKLKLCS KANVYTEVPD GGWGWAVAVS FFFVEVFTYG IUCTFGVFFN DLMDSFNESN 60 
SRJSWIISIC VFVLTFSAPL ATVLSNRFGH RLWMLGGLL VSTGMVAASF SQEVSHMYVA 120 
IGIISGLGYC FSFLPTVTIL SQYFGKRRSI VTAVASTGEC FAVFAFAPAI MALKERJGWR 180 
YSLLFVGLLQ LNIVIFGALL RPFFIRGPAS PKJVIQENRK EAQYMLENEK TRTSIDSIDS 240 
GVELTTSPKM VPTHTNLELE PKADMQQVLV KTSPRPSEKK APLLDFSILK EKSFICYALF 300 
GLFATLGFFA PSLYIIPLGI SLGIDQDRAA FLLSTMAIAE VFGRIGAGFV LNREP1RKIY 360 
IEUCVILLT VSLFAFTFAT EFWGLMSCSI FFGFMVGTIG GTH1PLLAED DVVGIEKMSS 420 
AAGVYIFIQS IAGLAGPPLA GLLVDQSKIY SRAFYSCAAG MALAAVCLAL VRPCKMGLCQ 480 
HHHSGETKW SHRGKTLQDI PEDFLEMDLA KNEHRVHVQM EPV 

Seq ID NO: 43 DNA sequence 

Nucieic Acid Accession U: FGENESH predicted ORE 

Coding sequence: 1-1749 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

ATGCTGTCTG GCTTCTTGATG AGTCCCAGT ACCCAGCACA GAGCACAGTA CACTCCCGGA 60 
GGAAAGAA AC TTCCGTGGGA GGCTTCCATC GGTGCGCACA CCTCCCG AGG GCGAGGCAGC 1 20 
GACCGGGAGA GGGAGAGCCG GCCGGAGGCT GCCGGGCTCC TGTGGGACCG CGCTGCAGCC 1 80 
GGGGAGGCGG AGAAGGGG AA CCGGGGCGAG CCGCCCGCCT GGATCCGCGC CCAGCAGCAG 240 
CCGCGGCCGC CGCCAGCTGG GCAGGCTCCC GGGACTGCGG CTGGGGGCGC GCAGGACCCT 300 
CGCCTGCGTC CTGGACGTTC CCGGGGGAGG GTCCGGTTGC CAGTGAAACC TCCAGAGGCT 360 
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TCCGGACG AC AGCCCCGGGG GCCTTCTGAC TGCATCCCGA GATTTCCATC AGCGAGTGCA 420 
ACTCATAAGG CAGTCCCTAA GGGGACCGGG CCACCGGCTG AGGACGGGGA TGGCTTAGGA 480 
GCTCCTGGAC CTAGGGCCCG GCGTCGTCGC CTCCTGGGCG TCGCGGCAGA GGGGAGTGGC 540 
CCGCGCGGAA AGCGCCGCGG GACAGTCAGT GACGAGGCCC GGGGGTCGCC GGGGCCACGA 600 
5 CTTCTCGGAG ACCGTCCTGC GCTCTCTGGA GACGCGCTGT CCGCGCCCAG GGTGGTGCCA 660 
TGTGGGGCGC TCGOCGCTCG TCCGTCTCCT CATCCTGGAA CGCCGCTTCG CTCCTGCAGC 720 
TGCTGCTGGC TGCGCTGCTG GCGGCGGGGG CGAGGGCCCA GCGGCGAGTA CTGCCACGGC 780 
TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGACGGC 840 
GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 900 

1 0 GCGCGCCTGG ACCAGGGCGG CTGCG ACAAT G ACCGCCAGC AGGGCGCTGG CGAGCCTGGC 960 
CGGGCGGACA AAGACGGGCC CCGACGGCTC GGCAGGGCTT CATGTCTTAG GGGTACCCAA 1020 
GGAGACGGCG AGGGTGCGCC CCCACCCGTG AGGGCCTGGC AGCGGTGCTC CCCTGAAGGC 1 080 
TCCCCGAAAG GAAGGCAGCT CCTCAGGGCT TTCCCGGGGC TGCTGCCCCG TGCCAGACGC 1 1 40 
CGCGG ATTCC CATCTTCTCC ACGCGGCG GC CCCTCTCCCC TGCAGCGGCC CGCCTTGCCC 1 200 

1 5 ATCTACGTGC CGTTCCTCAT TGTTGGCTCC GTGTTTGTCG CCTTTATCAT CTTGGGGTCC 1260 

CTGGTGGCAG CCTGTTGCTG CAGATGTCTC CGGCCTAAGC AGG ATCCCCA GCAGAGCCGA 1 320 
GCCCCAGGGG GTAACCGCTT GATGGAGACC ATCCCCATGA TCCCCAGTGC CAGCACCTCC 1 380 
CGGGGGTCGT CCTCACGCCA GTCCAGCACA GCTGCCAGTT CCAGCTCCAG CGCCAACTCC 1 440 
GGGGCCCGGG CGCCCCCAAC AAGGTCACAG ACCAACTGTT GCTTGCCGGA AGGGACCATG 1 500 

20 AACAACGTGT ATGTCAACAT GCCCACGAAT TTCTCTGTGC TGAACTGTCA GCAGGCCACC 1560 
CAGATTGTGC CACATCAAGG GCAGTATCTG CATCCCCCAT ACGTGGGGTA CACGGTGCAG 1620 
CACGACTCTG TGCCCATGAC AGCTGTGCCA CCTTTCATGG ACGGCCTGCA GCCTGGCTAC 1680 
AGGCAGATTC AGTCCCCCTT CCCTCACACC AACAGTGAAC AGAAGATGTA CCCAGCGGTG 1 740 
ACTGTATAAC CGAGAGTCAC TGGTGGGTTC CTTTACTGAA GGGAGACGAA GGCAGGGGTG 1 800 

ZD GATTCTCGAG GTGGAAGTCC GCACATGTCG GTGGTATTTA TGGCACGATT CCnTGGATG 1 860 
GCTTCATTTG CCCCCAGACT GTATGAAAAC ATCTCCGAAT TAGCATITCT GGATATGTTT 1920 
CATCCAGGGT ATCATTGATT TATGATGGAA AACCGGCCTC AGCTGGAGAT GACTGTGATG 1980 
TTGCTGATGG GTGTATAACA AATGCTTGAG TCCGAAGTGC CCTTGAGATA TGGTTGACGA 2040 
A AG AATTTTA TAAACTGATA AATTAAGG AT TTTT ATT ATG TTGTTATTAT TATTTCTTTT 2 1 00 

-3 0 T TGTTGT TG A CTGC ACAGGA TC AAA ATGCC TGTT ATCTCC CTTTTACTGG GACTTnTTT 2 1 60 
T1T1 1 1 1 IT I TTTTTTTTAA TCAGACAGGG TCTTGCTCTG TTGCCCAGGC TGGAGTGCAG 2220 
TGGTGCGATC TCGGCTCACT GCA ACTTCAG CCTCCTGGAT TCAGGCAACA CTCCTGCCTC 2280 
AGCCTCCCAC GTGGCTGGGA TTACAGGTGC CTGCCCCCAT GGCTAATTTT TTGTATTTTT 2340 
TGTAGAGATG GGGTTTCACC ATGTTGGCTG GGCTGGTCTC ACTCTCCTGA CCTCAAGCAA 2400 

3 5 TCTGCCTGTC TCAGCCTCCC AAAGTGCTGG GATTACAGGC GTGAGCCACC GCCCCCAGCC 2460 
TGAGCCTTTT TTTTTTTCTA ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 2520 
ATTCTAAAAG GAAACCTGTT TGAACTCTGT GAGATCAGTC ATCAGTCTCA GTATTCCACA 2580 
GGCACACCTT AATTTCATTG TAAA AAGATA TATATATTTT GTCTATTTTT GTGCTTTTGG 2640 
GGGCCTATTT TGTGCTTTTT TACCTTATGT AGAGATCTTA TTACAAAGTG ATTTTCTACA 2700 

40 TTAAAAAGAG ACTGAAATAA ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 2760 
TGGGATGATT TTAATCTTGA AGTAGTAGGT GGTATAGTCA TAAAACCATT CATCCCCTTC 2820 
TTGATTGTAT CTTAATTTTC TGGCTTTAAG GTGACATCTG AGAGGTAATG CATTCTTTTT 2880 
TATATTGAAA TCATAAACTA TCACCCGCTG CTTCTCTGAG TTACTTTTAA TTTTGCCTTG 2940 
TGGTTATGGT TTGGCGTTTC CTTCTGTTTG GTTTTCAGAG CCCCATGTCT ATATAGTCCT 3000 

43 GAGTGCAAGT AATTACTATA CTTGTAAATG AAGATCAGTA TTTCTGCCTA GATCTGATAA 3060 
A AA AATTTTC TTGTCTTAGT TATAAAAATT CAA AGA AATG TGTTACAAAG ATACITAGTA 3 120 
TAGCTCCTCA GCCATAACCT GAGACTTGGG ATGAAATTTA AACCAGATAC GATTTACTTT 3180 
GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 
G ATTGTTA AG AGAAAAGCTT TTCAACGAAG GATTGCCTTT C TTCTCCCAC ACTGTTCTTG 3300 

5 0 ArrTCCTCTC TCTTTCAGGC CTCAACAGGC ACTGTATTCA TTGCCAATGT TCCAAATTAT 3360 
CAAATTCAAG TGAATTTATT TGTGTGTTCT TTACTTATAT AAAAAAAGAT AACTTTAAGG 3420 
ATGTGCAAGT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATTAA ACAAATCGCT 3480 
GTATGGTATG GTCTTCTACA CATTTATGTC TATAGATATC T ATCGATCAT CTTTCTAITC 3540 
TGTTTCATGA CTGAATAATG TAAAACCAGT GTTGGCAATT GGTATCATCA ATGATACTCA 3600 

5 5 TTTTTTAATA ACCAAAGGCA GGGGAAAATC ATTTTACTTA TTAATAAATA TTTT ATG ATG 3660 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



Seq ID NO: 44 Protein sequence: 

Protein Accession #: FGENESH predicted 

60 

1 11 21 31 41 51 

I I ! I I I 

MLSGFLMSPS TQHRAQYTPG GKKLPWEASI GAHTSRGRGS DRERESRPEA AGLLWDRAAA 60 
GEAEKGNRGE PPAWIRAQQQ PRPPPAGQAP GTAAGG AQDP RLRPGRSRGR VRLPVKPPEA 120 

65 SGRQPRGPSD CIPRf PSASA THKAVPKGTG PPAEDGDGLG APGPRARRRR LLGVAAEGSG ] 80 
PRGKRRGTVS DEARGSPGPR LLGDRPALSG DALSAPRWP CGALAARPSP HPGTPLRSCS 240 
CCWLRCWRKG RGPSGEYCHG WLDAQGVWRJ GFQCPERFDG GDATICCGSC ALRYCCSSAE 300 
ARLDQGGCDN DRQQGAGEPG RADKDGPRRL GRASCLRGTQ GDGEGAPPPV RAWQRCSPEG 360 
SPKGRQLLRA FPGLLPRARR RGFPSSPRGG PSPLQRPALP IYVPFLTVGS VFVAFIILGS 420 

70 LVAACCCRCL RPKQDPQQSR APGGNRLMET 1PM1PSASTS RGSSSRQSST AASSSSSANS 480 

GARAPPTRSQ TNCCLPEGTM NNVYVNMPTN FSVLNCQQAT QIVPHQGQYL HPPYVGYTVQ 540 
HDSVPMTAVP PFMDGLQPGY RQIQSPFPHT NSEQKMYPAV TV 

Seq ID NO: 45 DNA sequence 
75 Nucleic Acid Accession #: NM_002285 

Coding sequence: 55-3738 (underlined sequences correspond to sLirt and stop codons) 
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1 II 21 31 41 51 
I I I I I 

GGCCGAGCCT CGGCGGCGGC GGTAGCGGCG GCGGCGACGC TGACACCTCC CACCATGGAC 60 
AGCTTCGACT TAGCCCTGCT CCAGGAATGG GACCTCGAGT CACTGTGTGT CTATG AACCA 1 20 
5 GATAGAAATG CATTACGGAG GAAAGAACGA GAAAGAAGAA ATCAAGAAAC TCAACAGGAT 180 
GATGGCACGT TTAATTCTAG TTACTCTCTC TTCAGTGAGC CCTACAAGAC TAACAAGGGG 240 
GATGAACTCT CCAACCGGAT CC AG A AC ACT TTAGGCAATT ATGATGAAAT GAAAGACTTT 300 
TTAACTGATA GAACCAATCA G AGTCATCTC GTTGGAGTTC CCAAACCTGG GGTTCCTCAG 360 
ACTOCTGTGA ACAAGATCGA TGAACATTTT GTTGCAGATT CAAGAGCCCA GAACCAGCCC 420 

1 0 TCGTCTATCT GTAGCACTAC AACTTCCACA CCAGCAGCTG TCCCCGTGCA GCAGAGTA AA 480 
AGAGGCACTA TGGGCTGGCA G AAGGCTGGG CACCCACCCT CTGACGGCCA ACAGAGAGCA 540 
ACACAACAGG GCTCTCTCAG GACCTTGCTT GGAGATGGTG TTGGCAGACA GCAGCCTCGG 600 
GCCAAACAAG TGTGCAATGT GGAGGTGGGC CTTCAGACCC AGGAGAGGCC ACCTGCCATG 660 
GCGGCCAAGC ACAGCAGCAG CGGACACTGT GTTCAGAACT TTCCTCCATC CCTAGCTTCA 720 

1 5 AAACCCAGCC TGGTCCAGCA GAAACCGACC GCGTATGTGA GGCCAATGGA CGGCCAAGAT 780 
CAGGCCCCTG ATGAGTCTCC TAAGCTGAAG TCGTCTTCGG AAACCAGCGT GCACTGCACA 840 
TCATACAGGG GAGTCCCTGC CAGCAAGCCG GAGCCTGCCA GAGCCAAGGC CAAGCTCTCC 900 
AAGTTCAGCA TCCCCAAGCA GGGGGAGGAG AGTAGATCTG GAGAAACCAA CAGCTGTGTT 960 
GAAG AAATAA TCCGGGAGAT GACCTGGCTT CCACCACTTT CTGCTATTCA AGCACCTGGC 1020 

20 AAAGTGGAAC CAACCAAATT TCCATTTCCA AATAAGGACT CTCAGCTTGT ATCCTCTGGA 1080 

CACAATAATC CAAAGAAAGG TGATGCAGAG CCAGAGAGTC CAGACAATGG CACATCGAAT 1140 
ACATCAATGC TGGAAGATGA CCTTAAGCTA AGCAGTGATG AAGAGGAGAA TGAACAGCAG 1200 
GCAGCTCAGA GAACGGCTCT CCGCGCTCTC TCTGACAGCG CCGTGGTCCA GCAGCCCAAC 1260 
TGCAGAACCT CGGTGCCTTC CAGCAAGGGC AGCAGCAGCA GCAGCAGCAG CGGCACGAGC 1320 

25 AGCTCCTCCA GCGACTCAG A GAGCAGCTCC GGATCTGACT CGGAGACCGA GAGCAGCTCC 1380 
AGCGAGAGTG AGGGCAGCAA GCCCCCCCAC TTCTCCAGCC CCGAGGCTGA ACCGGCATCC 1440 
TCTAACAAGT GGCAGCTGGA TAAATGGCTA AACAAAGTTA ATCCCCACAA GCCTCCTATT 1500 
CTGATCCAAA ATG AAAGCCA CGGGTCAG AG AGCAATCAGT ACTACAACCC GGTGAAAGAG 1 560 
GACGTCCAGG ACTGTGGGAA AGTCCCCGAC GTTTGCCAGC CCAGCCTGAG AGAGAAGGAG 1620 

30 ATCAAGAGCA CTTGCAAGGA GGAGCAAAGG CCAAGGACAG CCAACAAGGC CCCTGGGAGT 1680 
AAAGGCGTGA AGCAGAAGTC CCCGCCCGCG GCCGTGGCCG TGGCGGTGAG CGCAGCCGCC 1740 
CCGCCACCCG CAGTGCCCTG TGCGCCCGCG GAGAACGCGC CCGCGCCTGC CCGGAGGTCC 1800 
GOGGGCAAGA AGCCCACCAG GCGCACCGAG AGGACCTCAG CCGGGGACGG CGCCAACTGC 1860 
CACCGGCCCG AGGAGCCCGC GGCCGCGGAC GCGCTGGGGA CGAGCGTGGT GGTCCCCCCG 1920 

35 GAGCCCACCA AAACCAGGCC CTGTGGCAAC AACAGAGCGA GCCACCGCAA GGAGCTGCGC 1980 
TCCTCCGTGA CCrGCGAGAA GCGCCGCACG CGGGGGCTAA GCAGGATCGT CCCCAAATCC 2040 
AAGGAGTTCA TTGAGACAGA GTCGTCATCT TCATCCTCCT CCTCGGACTC CGACCTGGAG 2100 
TCCGAGCAGG AGGAGTACCC TCTGTCCAAA GCACAGACCG TGGCTGCCTC TGCCTCCTCC 2160 
GGGAATGATC AGAGGCTGAA GGAGGCCGCT GCCAACGGGG GCAGTGGTCC TAGGGCCCCT 2220 

40 GTAGGCTCCA TCAACGCCAG G ACCACCAGT GACATCGCCA AGGAGCTGGA GGAGCAGTTC 2280 
TACACACTGG TCCCCTTrGG CCGGAACGAA CTTCTCTCCC CTCTAAAGGA CAGTGATGAG 2340 
ATCAGGTCTC TCTGGGTCAA AATCGACCTG ACCCTCCTGT CCAGGATCCC AGAACACCTG 2400 
CCCCAGGAGC CAGGGG TATT GAGCGCCCCT GCCACCAAGG ACTCTGAGAG CGCACCGCCC 2460 
AGCCACACCT CGGACACACC TGCAG AAAAG GCTTTGCCAA AATCCA AG AG G AAACGCAAG 2520 

45 TGTGACAACG AAGACGACTA CAGGGAGATC AAGAAGTCCC AGGGAGAGAA AGACAGCTCT 2580 
TCAAGACTGG CCACCTCCAC CAGTAATACT TTGTCTGCAA ACCACTGCAA CATGAACATC 2640 
AACAGTGTGG CAATACCAAT AAATAA A A AT GAAAAAATGC TTCGGTCGCC CATCTCACCC 2700 
CTCTCTGATG CATCTAAACA CAA ATACACC AGCGAGGACT TAACTTCTTC CAGCCG ACCT 2760 
AATGGCAACA GTTTG'ITTAC TTCAGCCTCT TCCAGCAAAA AGCCTAAGGC CGACAGCCAG 2820 

50 CTGCAGCCTC ACGGCGGAGA CCTCACGAAA GCAGCTCACA ACAATTCTGA AAACATTCCC 2880 
CTCCACAAGT CACGGCCGCA GACGAAGCCG TGGTCTCCAG GCTCCAACGG CCACAGGGAC 2940 
TGCAAGAGGC AGAAACTTGT CTTCGATGAT ATGCCTCGCA GTGCCGATTA TTTTATGCAA 3000 
GAAGCTAAAC GAATGAAGCA TAAAGCAGAT GCAATGGTGG AAAAGTTTGG AAAGGCTTTG 3060 
AACTATGCTG AAGCAGCATT GTCGTTTATC GAGTGTGGAA ATGCAATGGA ACAAGGCCCC 3 120 

55 ATGG AATCCA AATCTCCTTA TTACCTGATG TATTCAGAAA CAGTAGAGCT CATCAGGTAT 31 80 
GCTATGAGAC TAAAAACCCA CTCAGGCCCC AATGCCACAC CAGAAGACAA ACAACTGGCT 3240 
GCATTATGTT ACCGATGCCT GGCCCTCCTG TACTGGCGGA TGTTTCGACT CAAAAGGG AC 3300 
CACGCTGTAA AGTA1TCAAA AGCACTAATC GACTATTTCA AGAACTCATC TAAAGCCGCC 3360 
CAAGCCCCAT CTCCGTGGG G G GCC AGTGG A AAG AGCACTG G AACCCCATC CCCCATTTCT 3420 

60 CCCAACCCCT TTCCCGGCAG CTCCGTGGGG TCTCAGGGCA GCCTCTCCAA CGCCAGCGCC 3480 
CTGTCCCCGT CGACCATCGT CAGCATCCCA CAGCGCATCC ACCAGATGGC GGCCAACCAC 3540 
GTCAGCATCA CCAACAGCAT CCTGCACAGC TACGACTACTGGGAGATGGC CGACAACCTG 3600 
GCCAAGGAAA ACCGAGAATT CTTCAACGAC CTGGATCTGC TCATGGGGCC GGTCACCCTG 3660 
CACAGCAGCA TGGAGCACCT GGTCC AGTAC TCCCAACAGG GCCTGCACTG GCTGCGGAAC 3720 

65 AGCGCCCACC TGTCATAGGG ACCTCACCCT GGGGCCAGAG TGGGCTCTGG TCTCCACAGA 3780 
TGGCTCAACG TTTTTGGACA CTGTGCTACT GAAACTCCCA GCCACAGCAT TTATAGACTG 3840 
CGGTGAACAT TTCCTCA 

SeqIDNO: 46 Protein sequence: 
70 Protein Accession U: NP 002276 

1 11 21 31 41 51 

i 1 i I I I 

MDSFDLALLQ EWDLESLCVY EPDRNALRRK ERERRNQETQ QDDGTFNSSY SLFSEPYKTN 
75 KGDELSNRIQ NTLGNYDEMK DFLTDRTNQS HLVGVPKPGV PQTPVNKIDE HFVADSRAQN 
QPSS1CSTTT STPAAVPVQQ SKRGTMGWQK AGHPPSDGQQ RATQQGSLRT LLGOGVGRQQ 
PRAKQVCNV'E VGLQTQERPP AMAAXHSSSG HCVQNFPPSL ASKPSLVQQK PTAYVRPMOG 
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QDQAPDESPK LKSSSETSVH CTSYRGVPAS KPEPARAKAK. LSKFSIPKQG EESRSGETNS 300 
CVEEIIREMT WLPPLSAIQA PGKVEPTKf P FPNKDSQLVS SGHNNPKKGD AEPESPDNGT 360 
SNTSMLEDDL KLSSDEEENE QQAAQRTALR ALSDSAVVQQ PNCRTSVPSS KGSSSSSSSG 420 
TSSSSSDSES SSGSDSETES SSSESEGSKP PHFSSPEAEP ASSNKWQLDK WLNKVNPHKP 480 
PILIQNESHG SESNQYYNPV KEDVQDCGKV PDVCQPSLRE KEIKSTCKEE QRPRTANKAP 540 
GSKGVKQKSP PAAVAVAVSA AAPPPAVPCA PAENAPAPAR RSAGKKPTRR TERTSAGDGA 600 
NCHRPEEPAA ADALGTSVVV PPEPTKTRPC GNNRASHRKE LRSSVTCEKR RTRGLSRJ VP 660 
KSKEFIETES SSSSSSSDSD LESEQEEYPL SKAQTVAASA SSGNDQRLKE AAANGGSGPR 720 
APVGSINART TSDIAKELEE QFYTLVPFGR NELLSPLKDS DEIRSLWVK1 DLTLLSRIPE 780 
HLPQEPGVLS APATTCDSESA PPSHTSDTPA EKALPKSKRK RKCDNEDDYR EIKKSQGEKD 840 
SSSRLATSTS NTLSANHCNM NINSVAIPIN KNEKMLRSPI SPLSDASKHK YTSEDLTSSS 900 
RPNGNSLFTS ASSSKKPKAD SQLQPHGGDL TKAAHNNSEN IPLHKSRPQT KPWSPGSNGH 960 
RDCKRQKXVF DDMPRSADYF MQEAKRMKHK ADAMVEKFGK ALNYAEAALS FIECGNAMEQ 1020 
GPMESKSPYY LMYSETVELI RYAMRUCTHS GPNATPEDKQ LAALCYRCLA LLYWRMFRLK 1080 
RDHAVKYSKA LIDYFKNSSK AAQAPSPWGA SGKSTGTPSP ISPNPFPGSS VGSQGSLSNA 1 140 
SALSPSTIVS IPQRIHQMAA NHVSITNSIL HSYDYWEMAD NLAKENREFF NDLDLLMGPV 1200 
TLHSSMhHLV QYSQQGLHWL RNSAHLS 

Seq ID NO: 47 DNA sequence 

Nucleic Acid Accession ft: NM 033151 

Coding sequence: 351 -4499(underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 
I I I I I I 

ACTGGGATAA AGCAAGAAGA CTGATTTTAT GAGCAGGGGT TTGATACATC AAAGGAGATT 60 
GCCCAGGATC AAGGG TGCGG TGTTGGGGGT GGGTTGGGGA GGGTGGTTAG AGAAGGTTTC 120 
ACTAAGTGAT TTGGGCCTGA GGCCTGAGAA GATGTTTAAA AAGAGGGATC AAGCACAGGC 180 
TAAGG AGAGG AAAGAGCAGG CACCCAAACC TCTGCATGGC CCCAATATGC TCCCTGCAGG 240 
GTAGTGCCCC CTCTTCTGGC TGCTCAAGGC GAGATCTAAG CTTCTTCTAA CTCCTGCTGT 300 
CTTTTCATAT TCTCTGATTC TGGGAAACGA AGAATTGGCA GG AACTGAAA AIGACTAGGA 360 
AG AGG ACATA CTGGGTGCCC AACTCTTCTG GTGGCCTCGT GAATCGTGGC ATCGACATAG 420 
GCGATGACAT GGTTTCAGGA CTTATTTATA AAACCTATAC TCTCCAAGAT GGCCCCTGGA 480 
GTCAGCAAGA G AG A AATCCT G AGGCTCCAG GGAGGGCAGC TGTCCCACCG TGGGGGAAGT 540 
ATGATGCTGC CTTGAGAACC ATGATTCCCT TCCGTCCCAA GCCGAGGTTT CCrGCCCCCC 600 
AGCCCCTGGA CAATGCTGGC CTGTTCTCCT ACCTCACCGT GTCATGGCTC ACCCCGCTCA 660 
TGATCCAAAG CTTACGGAGT CGCTTAGATG AGAACACCAT CCCTCCACTG TCAGTCCATG 720 
ATGCCTCAGA CAAAAATGTC CAAAGGCTTC ACCGCCTTTG GGAAGAAG AA GTCTCAAGGC 780 
GAGGGATTGA AAAAGCTTCA GTGC1TCTGG TGATGCTGAG GTTCCAGAGA ACAAGGTTGA 840 
TTTTCGATGC ACTTCTGGGC ATCTGCTTCT GCATTGCCAG TGTACTCGGG CCAATATTGA 900 
TTATACCAAA GATCCTGGAA TATTCAGAAG AGCAGTTGGG GAATGTTGTC CATGGAGTGG 960 
GAcrcTGcrr tgcccttttt'ctctccgaat GTGTGAAGTC TCTGAGTTTC TCCTCCAGTT 1020 
GGA TCATCAA CCAACGCACA GCCATCAGGT TCCGAGCAGC TGTTTCCTCC TTTGCCTTTG 1080 
AGAAGCrCAT CCAATTTAAG TCTGTAATAC ACATCACCTC AGG AGAGG CC ATCAGCTTCT 1 140 
TCACCGGTGA TGTAAACTAC CTGTITGAAG GGGTGTGCTA TGGACCCCTA GTACTGATCA 1200 
CCTGCGCATC GCTGGTCATC TGCAGCATTT CTTCCTACTT CATTATTGGA TACACTGCAT 1260 
TTATTGCCAT CTTATGCTAT CTCCTGGTTTTCCCACTGGC GGTATTCATG ACAAGAATGG 1320 
CTGTGAAGGC TCAGCATCAC ACATI7TGAGG TCAGCGACCA GCGCATCCGT GTGACCAGTG 1380 
AAGTTCTCAC TTGCATTAAG CTGA1TAAAA TGTACACATG GGAGAAACCA TTTGCAAAAA 1440 
TCATTGAAGA CCTAAGAAGG AAGGAAAGGA AACTATTGGA GAAGTGCGGG CTTGTCCAGA 1500 
GCCTGACAAG TATAACCTTG TTCATCATCC CCACAGTGGC CACAGCGGTC TGGGTTCTCA 1560 
TCCACACATC CTTAAAGCTG AAACTCACAG CGTCAATGGC CTTCAGCATG CTGGCCTCCT 1620 
TGAATCTCCT TCGGCTGTCA GTGTTCTTTG TGCCTATTGC AGTCAAAGGT CTCACGAATT 1680 
CCAAGTCTGC AGTG ATGAGG TTCAAGAAGT TTTTCCTCCA GG AGAGCCCT GTTTTCTATG 1 740 
TCCAGACATT ACAAGACCCC AGCAAAGCTC TGGTCTTTGA GGAGGCCACC TTGTCATGGC 1800 
AACAGACCTG TCCCGGGATC GTCAATGGGG CACTGGAGCT GGAGAGGAAC GGGCATGCTT 1860 
CTGAGGGGAT GACCAGGCCT AGAGATGCCC TCGGGCCAGA GGAAGAAGGG AACAGCCTGG 1920 
GCCCAGAGTT GCACAAGATC AACCTGGTGG TGTCC AAGGG GATGATGTTA GGGGTCTGCG 1980 
GCAACACGGG GAGTGGTAAG AGCAGCCTGT TGTCAGCCAT CCTGGAGGAG ATGCACTTGC 2040 
TCGAGGGCTC GGTGGGGGTG CAGGGAAGCC TGGCCTATGT CCCCCAGCAG GCCTGGATCG 2100 
TCAGCGGGAA CATCAGGGAG AACATCCTCA TGGGAGGCGC ATATGACAAG GCCCGATACC 2160 
TCCAGG TGCT CCACTGCTGC TCCCIGAATC GGGACCTGG A ACTTCTGCCC TTTGGAGACA 2220 
TGACAGAGAT TGGAGAGCGG GGCCTCAACC TCTCTGGGGG GCAGAAACAG AGGATCAGCC 2280 
TGGCCCGCGC CGTCTATTCC GACCGTCAGA TCTACCTGCT GGACGACCCC CTGTCTGCTG 2340 

tggacgccca cgtggggaag CACArrmG aggagtgcat taagaagaca ctcaggggga 2400 

AGACGGTCGT CCTGGTGACC CACCAGCTGC AGTACTTAGA ATTTTGTGGC CAGATCATTT 2460 
TGTTGGAAAA TGGGAAAATC TGTGAAAATG GAACTCACAG TGAGTTAATG CAGAAAAAGG 2520 
GGAAATATGC CCAACTTATC CAGAAGATGC ACAAGGAAGC CACTTCGG AC ATGTTGCAGG 2580 
ACACAGCAAA GATAGCAGAG AAGCCAAAGG TAG A A AG TC A GGCTCTGGCC ACCTCCCTGG 2640 
AAGAGTCTCT CAACGGAAAT GCTGTGCCGG AGCATCAGCT CACACAGGAG GAGGAGATGG 2700 
AAGAAGGCTC CITGAG'ITGG AGGGTCIACC ACCACTACAT CCAGGCAGCT GGAGGTTACA 2760 
TGGTCTCTI'G CATAATTTTC TTCTTCGTGG TGCTGATCGT CTTCTrAACG ATCrTCAGCT 2820 
TCTGGTGGCr GAGCTACrGG TTGGAGCAGG GCTCGGGGAC CAATAGCAGC CGAGAGAGCA 2880 
ATGGAACCAT GGCAGACCTG GGCAACATTG CAGACAATCC TCAACTGTCC TTCTACCAGC 2940 
TGGTGTACGG GCTCAACGCC CTGCTCCTCA TCTGTGTGGG GGTCFGCTCC TCAGGGATTT 3000 
TCACCAAAGT CACGAGGAAG GCATCCACGG CCCIGCACAA CAAGCTCTTC AACAAGGTTT 3060 
TCCGCTGCCC CATGAGTTTC TTTGACACCA TCCCAATAGG CCGGCTTITU AACFGCn CG 31 20 
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CAGGGG\CTT GGAACAGCTG GACCAGCTCT TGCCCATCTT TTCAGAGCAG TTCCTGGTCC 3 1 80 
TGTCCTTAAT GGTGATCGCC GTCCTGTTGA TTGTCAGTGT GCTGTCTCCA TATATCCTGT 3240 
TAATGGGAGC CATA ATCATG GTTATTTGCT TCATTTATTA TATGATGTTC AAG AAGGCCA 3300 
TCGGTGTGTT CAAGAGACTG GAGAACTATA GCCGGTCTCC TTTATTCTCC CACATCCTCA 3360 
5 ATTCTCTGCA AGGCCTGAGC TCCATCCATG TCTATGGA AA AACTGAAGAC TTCATCAGCC 3420 
AGTTTAAGAG GCTGACTGAT GCGCAGA ATA ACTACCTGCT GTTGTTTCTA TCTTCCACAC 3480 
GATGGATGGC ATTGAGGCTG GAGATCATGA CCAACCTTGT GACCTTGGCT GTTGCCCTGT 3540 
TCGTGGCTTT TGGCATTTCC TCCACCCCCT ACTCCTTTAA AGTCATGGCT GTCAACATCG 3600 
TGCTGCAGCT GGCGTCCAGC TTCCAGGCCA CTG CCCGGAT TGGCTTGGAG ACAGAGGCAC 3660 

1 0 AGTTCACGGC TGTAG AGAGG ATACTGCAGT ACATG AAGAT GTGTGTCTCG GAAGCTCCTT 3720 
TACACATGGA AGGCACAAGT TGTCCCCAGG GGTGGCCACA GCATGGGGAA ATCATATTTC 37S0 
\GGATTATCA CATGAAATAC AGAGACAACA CACCCACCGT GCTTCACGGC ATCAACCTGA 3840 
CCATCCGCGG CCACGAAGTG GTGGGCATCG TGGGA AGGAC GGGCTCTGGG AAGTCCTCCT 3900 
TGGGCATGGC TCTCTTCCGC CTGGTGG AGC CCATGGCAGG CCGGATTCTC ATTGACGGCG 3960 

1 5 TGGACATTTG CAGCATCGGC CTGGAGG ACT TGCGGTCCAA GCTCTCAGTG ATCCCTCAAG 4020 
VTCCAGTGCT GCTCTCAGGA ACCATCAG AT TCAACCTAGA TCCCTTTGAC CGTCACACTG 40S0 
MXAGCAGAT CTGGGATGCC TTGGAGAGGA CATTCCTGAC CAAGGCCATC TCAAAGTTCC 4140 
CCAAAAAGCT GCATACAG AT GTGGTGGAAA ACGGTGGAAA CTTCTCTGTG GGGGAGAGGC 4200 
AGCTGCTCTG CATTGCCAGG GCTGTGCTTC GCAACTCCAA GATCATCCTT ATCGATGAAG 4260 

20 CCACAGCCTC CATTG ACATG GAGACAG ACA CCCTGATCCA GCGCACAATC CGTGAAGCCT 4320 
TCCAGGGCTG CACCGTGCTC GTCATTGCCC ACCGTGTCAC CACTGTGCTG AACTGTGACC 4380 
ACATCCTGGT TATGGGCAAT GGG AAGGTGG TAGAATTTGA TCGGCCGG AG GTACTGCGGA 4440 
AGAAGCCTGG GTCATTGTTC GCAGCCCTCA TGGCC ACAGC CACTTCTTCA CTGAGAT_AAG 4500 
GAGATGTGGA GACTTCATGG AGGCTGGCAG CTGAGCTCAG AGGTTCACAC AGGTGCAGCT 4560 

25 TCGAGGCCCA CAGTCTGCGA CCTTCTTGTT TGGAGATGAG AACTTCTCCT GGAAGCAGGG 4620 
GTAAATGTAG GGGGGGTGGG GATTGCTGG A TGGAAACCCT GGAATAGGCT ACTTGATGGC 4680 
TCTCAAGACC TTAGAACCCC AGAACCATCT AAGACATGGG ATTCAGTGAT CATGTGGTTC 4740 
TCCTTTTAAC TTACATGCTG AATAATTTTA TAATAAGGTA AAAGCTTATA GTTTTCTGAT 4800 
CTGTGTTAGA AGTGTTGCAA ATGCTGTACT GACrTTGTAA AATATAAAAC TAAGGAAAAC 4860 
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Seq ID NO: 48 Protein sequence: 
Protein Accession U: NP J 491 63.2 

35 MTRKRTYWV PNSSG G L VMRG IDIG DDM V SG L IYKTYTLQDG PWSQQE RNPEA PGRAA V P PW GK YD A AL RT 
MIPFRPKPRFPAJQPLDNAGLFSYLTVSWLTPUyIIQSLRSRLDENTIPPI^^ 

VSRRGIEKASVLLVM LRFQRTRLIFDALLGICFCLASVLGPI LIIPK1LEYSEEQLGNVVHGVGLCFALF 
LSECVKSLSFSSSWIINQRTAIRFRAAVSSFAFEKLIQraSVlHn 

VUTCASLVICSISSYFIIGYTAFIAJLCYLLVFPLAVFMTRMAVK.^QHHTSa'SrXJRIRVTSEVLTCIK 

40 LIKMYTOEKPFAKITEDLRJ^ERKLI^KCGLVQSLTSITLFIIFTVATAVW^ 

L ASU^LRLSVFFVPI A VKGL TNSKi> A VMRFKJCFFLQE SPV F YVQTLQDPSKAL VF EEAT L SWQQTCPG I 
VNGALELERNGHASEGMTRPRDALGPEEEGNSLGPELIIKINLWSKGMMLGVCGNTGSGKSSLLSAILEE 
MHLLEGSVGVOGSI^YVPQQAWIVSGNIRENILMGGAYDKARYLQVLHCCSLNRDLELLPFGDMTEIGER 
G LN l^GG QKQRISI^RA VY SDRQIY LLDD PL S A VD AHVG Kl nFEECDCKTLRGKT V V LVTHQLQ YL EF C G 

45 Ol 1L LENGKICENGTHS ELM QKKG KY AQ L 1QKM HK£ AT SDM LQDT AKI AEKP KV ESQ A LATSLEESLNGN 
AVPEHGLTQEEEMEEGSI^WRVYIfflYIQAAGGYMVSClIFFFWLlVFLTlFSFWW 
RT^bKJTMADLGNlADNPQLSFYQLVYGLNALLLlCVGVCSSGIFTKVTRKASTALHNKLJ 
FDTIPIGRLI^CTAGDLEQLDQLLnFSEQFLVL^LMVIAVLUVSVI^PYILl>IGAIIMVlCFIYYMMF 

KKAIGVIttRLENYSRSPLJSfflLNSLQGLSSIHVYGKTC^ 
50 HMTNLVTLAVALFVAFGISSTPYSFKVMAVMVLQ 

EAPLHMEGTSCPQGWPQHGEIffQDYHMKYRDNTPTVLHGn^LTlRGHEVVGIVGRTGSG 
LVEFMAGRJLlDGVDlCSIGLEDLRSKLSVIPQDPVLLSGTIRr^UDPFDRHTTXJQIWDALERTFLTKAI 
SKFPKKLHTDVVENGGNFSVGERQLI^IARAVLRNSKIILIDEATASIDMETDTUQRTIREAFG^CTVL 
V1AHRVTTVLNCDH1LVMGNGKVVEFDRPEVLRKKPGSLFAALMATATSSLR 
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Seq ID NO: 49 DNA sequence 

Nucleic Acid Accession U: NM_0334I9 

Coding sequence: 1 8-980 (underlined sequences correspond to start and stop codons) 



60 



60 1 11 21 31 41 51 

CGAGCCAGGG AGAAAGGATG GCCGGCCTGG CGGCGCGGTT GGTCCTGCTA GCTGGGGCAG 
CGGCGCTGGC GAGCGGCTCC CAGGGCGACC GTGAGCCGGT GTACCGCGAC TGCGTACTGC 120 
AGTGCGAAGA GCAG AACTGC TCTGGGGGCG CTCTGAATCA CTTCCGCTCC CGCCAGCCAA 1 80 

65 TCTACATGAG TCTAGCAGGC TGGACCTGTC GGGACG ACTG TAAGTATGAG TGTATGTGGG 240 
TCACCGTTGG GCTCTACCTC CAGGAAGGTC ACAAAGTGCC TCAGTTCCAT GGCAAGTGGC 300 
CCTTCTCCCG GTTCCTGTTC TTTCAAGAGC CGGCATCGGC CGTGGCCTCG TTTCTCAATG 360 
GCCTGGCCAG CCTGGTGATG CTCTGCCGCT ACCGCACCTT CGTGCCAGCC TCCTCCCCCA 420 
TGTACCACAC CTGTGTGGCC TTCGCCTGGG TGTCCCTCAA TGCATGGTTC TGGTCCACAG 480 

70 TTTTCCACAC CAGGGACACT G ACCTC ACAG AGAAAATGG A CTACTTCTGT GCCTCCACTG 540 
TCATCCTACA CTCAATCTAC CTGTGCTGCG TCAGGACCGT GGGGCTGCAG CACCCAGCTG 600 
TGGTCAGTGC CTTCCGGGCT CTCCTGCTGC TCATGCTG AC CGTGCACGTC TCCTACCTGA 660 
GCCTCATCCG CTTCGACTAT GGCTACAACC TGGTGGCCAA CGTGGCTATT GGCCTGGTCA 720 
ACGTGGTGTG GTGGC TGGCC TGGTGCCTGT GGAACCAGCG GCGGCTGCCT CACGTGCGCA 780 

7 5 AGTGCGTGGT GGTGGTCTTG CTGCTGC AGG GGCTGTCCCT GCTCGAGCTG CTTGACTTCC 840 
CACCGCTCTT CTGGGTCCTG GATGCCCATG CCATCTGGCA CATCAGCACC ATCCCTGTCC 900 
ACGTCCTCTT TTTCAGCTTT CTGGAAGATG ACAGCCTGTA CCTGCTGAAG GAATCAGAGG 960 
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ACAAGTTCAA GCTGGACTGA AGACCTTGGA GCGAGTCTGC CCCAGTGGGG ATCCTGCCCC 1020 
CGCCCTGCTG GCCTCCCTTC TCCCCTCAAC CCTTGAGATG ATTTTCTCTT TTCAACTTCT 1080 
TGAACTTGGA CATGAAGGAT GTGGGCCCAG AATCATGTGG CCAGCCCA CC CCCTGTTGGC 1 140 
CCTCACCAGC CTTGGAGTCT GTTCTAGGGA AGGCCTCCCA GCATCTGGGA CTCGAGAGTG 1200 
5 GGCAGCCCCT CTACCTCCTG GAGCTGAACT GGGGTGGAAC TGAGTGTGCT CTTAGCTCTA 1260 
CCGGGAGGAC AGCTGCCTGT TTCCTCCCCA TCAGCCTCCT CCCCACATCC CCAGCTGCCT 1320 
GGCTGGGTCC TGAAGCCCTC TGTCTACCTG GGAGACCAGG GACCACAGGC CTTAGGGATA 1380 
CAGGGGGTCC CCTTCTGTTA CCACCCCCCA CCCTCCTCCA GGACACCACT AGGTGGTGCT 1440 
GGATGCTTGT TCTTTGGCCA GCCAAGGTTC ACGGCGATTC TCCCCATGGG ATCTTGAGGG 1500 

1 0 ACCAAGCTGC TGGGATTGGG AAGGAGTTTC ACCCTGACCA TTGCCCTAGC CAGGTTCCCA 1 560 
GGAGGCCTCA CCATACTCCC TTTCAGGGCC AGGGCTCCAG CAAGCCCAGG GCAAGGATCC 1620 
TGTGCTGCTG TCTGGTTGAG AGCCTGCCAC CGTGTGTCGG GAGTGTGGGC CAGGCTGAGT 1680 
GCATAGGTGA CAGGGCCGTG AGCATGGGCC TGGGTGTGTG TGAGCTCAGG CCTAGGTGCG 1740 
CAGTGTGGAG ACGGGTGTTG TCGGGGAAGA GGTGTGGCTT CAAAGTGTGT GTGTGCAGGG 1800 

1 5 GGTGGGTGTG TTAGCGTGGG TTAGGGGAAC GTGTGTGCGC GTGCTGGTGG GCATGTGAGA I860 
TGAGTGACTG CCGGTGAATG TGTCCACAGT TGAGAGGTTG GAGCAGGATG AGGGAATCCT 1920 
GTCACCATCA ATAATCACTT GTGGAGCGCC AGCTCTGCCC AAGGCGCCAC CTGGGCGGAC 1980 
AGCCAGGAGC TCTCCATGGC CAGGCTGCCT GTGTGCATGT TCCCTGTCTG GTGCCCCTTT 2040 
GCCCGCCTCC TGCAAACCTC ACAGGGTCCC CACACAACAG TGCCCTCCAG AAGCAGCCCC 2100 

20 TCGGAGGCAG AGGAAGGAAA ATGGGGATGG CTGGGGCTCT CTCCATCCTC CTTTTCTCCT 2160 
TGCCTTCGCA TGGCTGGCCT TCCCCTCCAA AACCTCCATT CCCCTGCTGC CAGCCCCTTT 2220 
GCCATAGCCT GATTTTGGGG AGGAGGAAGG GGCGATTTGA GGGAGAAGGG GAGAAAGCTT 2280 
ATGGCTGGGT CTGGTTTCTT CCCTTCCCAG AGGGTCTTAC TGTTCCAGGG TGGCCCCAGG 2340 
GCAGGCAGGG GCCACACTAT GCCTGCGCCC TGGTAAAGGT GACCCCTGCC ATTTACCAGC 2400 

25 AGCCCTGGCA TGTTCCTGCC CCACAGGAAT AGAATGGAGG GAGCTCCAG A AACTTTCCAT 2460 
CCCAAAGGCA GTCTCCGTGG TTGAAGCAGA CTGGATTTTT GCTCTGCCCC TGACCCCTTG 2520 
TCCCTCTTTG AGGGAGGGGA GCTATGCTAG GACTCCAACC TCAGGGACTC GGGTGGCCTG 2580 
CGCTAGCTTC TTTTGATACT GAAAACTTTT AAGGTGGGAG GGTGGCAAGG GATGTGCTTA 2640 
ATAAATCAAT TCCAAGCCTC AAAAAAAAAA AAAAAAAAAA AA 

30 

Seq ID NO: 50 Protein sequence: 
Protein Accession #: NP_2 19487.1 

1 11 21 31 41 51 

35 | | i i i i 

MKDVGPESCG QPTPCWPSPA LESVLGKASQ HLGLESGQPL YLLELNWGGT ECALSSTGRT 60 
AACFLPISLL PTSPAAWLGP EALCLPGRPG TTGLRDTGGP LLLPPPTLLQ DTTRWCWMLV 120 
LWPAKVHGDS PHG1 LRDQAA GIGKEF1IPDH CPSQVPRRPH HTPFQGQGSS KPRARILCCC 180 
LVESLPPCVG SVGQAECIGD RAVSMGLGVC ELRPRCAVWR RVLSGKRCGF KVCVCRGWVC 

40 

Seq ID NO: 51 DNA sequence 

Nucleic Acid Accession U- XM 059098. 1 

Coding sequence: 1 78-61 8(underlined sequences correspond to start and stop codons) 

45 

I II 21 31 41 51 

I 1 I I ! I 

GATGTACACT CTGAAGTGAG CACATTCCTG TTGGCAGGAC ATGACACCTT GGCAGCAAGC 60 
ATCTCCTGGA TCCTTTACTG CCTGGCTCTG AACCCTGAGC ATCAAGAGAG ATGCCGGGAG 120 

50 GAGGTCAGGG GCATCCTGGG GGATGGGTCT TCTATCACTT GGGACCAGCT GGGTGAGATG 1 80 
TCGTACACCA CAATGTGCAT CAAGGAGACG TGCCGATTGA TTCCTGCAGT CCCGTCCATT 240 
TCCAGAGATC TCAGCAAGCC ACTTACCTTC CCAGATGGAT GCACATTGCC TGCAGGGATC 300 
ACCGTGGTTC TTAGTATTTG GGGTCTTCAC CACAACCCTG CTGTCTGG A A A A ACCCAAAG 360 
GTCTTTGACC CCTTGAGGTT CTCTCAGGAG AATTCTGATC AGAGACACCC CTATGCCTAC 420 

55 TTACCATTCT CAGCTGGATC AAGGAACTGC ATTGGGCAGG AGTTTGCCAT GATTGAGTTA 480 
AAGGTAACCA TTGCCTTGAT TCTGCTCCAC TTCAGAGTGA CTCCAGACCC CACCAGGCCT 540 
CTTACTTTCC CCAACCATTT TATCCTCAAG CCCAAGAATG GGATGTATTT GCACCTG AAG 600 
AAACTCTCTG AATGTTAGAT CTCAGGGTAC AATGATTAA A CGTACTTTGT TTTTCGAAGT 660 
TAAATTTACA GCTAATGATC CAAGCAGATA GAAAGGGATC AATGTATGGT GGGAGGATTG 720 

60 GAGGTTGGTG GGATAGGGGT CTCTGTGAAG AGATCCAAAA TCAT TTCTA G GTACACAGTG 780 
TGTCAGCTAG ATCTGTTTCT ATATAACTTT GGGAGATTTT CAGATCTTTT CTGTTAAACT 840 
TTCACTACTA TTAATGCTGT ATACACCAAT AGACTTTCAT ATATTTTCTG TTGTTTTTAA 900 
AATAGTTTTC AGAATTATGC AAGTAATAAG TGCATGTATG CTCACTGTCA AAAATTCCCA 960 
ACACTAGAAA ATCATGTAGA ATAAAAATTT TAAATCTCAC TTCACTTAGC CGACATTCCA 1020 

65 TGCCCTGACC AATCCTACTG CTTTTCCTAA AAACAGAATA ATTTGGTGTG CATTCTTTCA 1080 
GACTTTTTCC TATACATTTT ATATGTAGAA ATGTAGCAAT GTATTTGTAT AGATGTGATC 1 140 
ATTCCTATAT TGTTATTGAT TTTTTTCACT TAATAAAAAT TCACCTTATT CCTT 

Seq ID NO: 52 Protein sequence: 
70 Protein Accession U; XP 059098.1 

1 11 21 3t 41 51 

I 1 I I I : 

MSYTTMCIKE TCRLIPAVPS ISRDLSKPLT FPDGCTLPAG ITVVLSIWGL IIHNPA VWKNP 60 
75 KVFDPLRFSQ ENSDQRHPYA YLPFSAGSRN C1GQEFAMIE LKVT1AULL HFRVTPDPTR 120 
PLTFPNHFIL KPKNGMYLHL KKLSEC 



370 



WO U2/05W77 



PCT/LS02/02242 



10 



Seq ID NO. 53 DNA sequence 
Nucleic Acid Accession #: NM.030916 

Coding sequence: 1-1533 { underlined sequences correspond to start and stop codons) 
1 II 21 31 41 51 

ATr^rrCTGT CCCTGGGAGC CGAGATGTGG GGGCCTGAGG CCTGGCTGCT GCTGCTGCTA 60 
T^rrc AT CATTTACAGG CCGGTGCCCC GCGGGTGAGC TGGAGACCTC AGACGTGGTA 120 
Zr^TCCTCC TGG^CC a£ ?A CC^AAAACTG CCCTGCTTCT ACCG AGGGG A CTCCGGCGAG 1 80 
cS^^^^TC^^CTG GACGCGGGCG AAGGCGCCCA GGAACTACCG 240 
^^rCKCTCCAAA^CGG GCTTCATGTG AGCCCGGCTT ACGAGGGCCG CGTGGAGCAG 300 

C A^rAfiGGCG ACTAC^GTG CCGGGTCAGC ACCTTCCCCG CCGGCAGCTT CCAGGCGCGG 420 
C^C^^CQ^^^^CICCCCTG CCCTCACTGA ATCCTGOTCC AGCACTAGAA 480 

1 S r ^^^CGC^A^OCCAGCCTCC TGCACAGCTG AGGGCAGCCC AGCCCCCAGC 540 

1 ^JC^CCTCGG A^CACGG AGGT CAAAGGC ACA ACGTCCAGCC GTTCCTTCAA GCACTCCCGC 600 

?ct£K T^^^ OTCCACTTC GTGCCTAGCC GCAGCATGAA TGGGCAGCCA 660 
CTGACTTGTG TGGTGTCCCA TCCTGGCCTG CTCCAGG ACC AAAGGATCAC CCACATCCTC 720 
rACrTGTCCT TCCTTGCTGA GGCCTCTGTG AGGGGCCTTG AAGACCAAAA TCTGTGGCAC 780 

?0 A^rJrlrAG AAGGAGCTAT ^GCTCAAGTGC CTGAGTGAAG GGCAGCCCCC TCCCTCATAC 840 
Sa GCCTCTGCCC AGTGGGGTAC GAGTGGATGG GGACACTTTG 900 

^^CCCC^A^TG^CAGC GGCATCTACG TCTGCCATGT CAGCAATGAG 960 
T^CT^CAA GGGATTCTCA GGTCACTGTG GATGTTCTTG ACCCCCAGGA AGACTCTGCG 1020 
Ya^^G^^G^I^XCYCGGTG GTGGTGGTGG GTGTGATCGC CGCACTCTTG !O80 

25 T^G^OTGCTGCTC ATGTCCCGAT ACCATCGGCG CAAGGCCCAG 1140 

VFr A^A^CC ■ AGAAATATGA GGAGGAGCTG ACCCTGACCA GGGAGAACTC CATCCGGAGG 1200 
CTCCA^CCC ^A^ACG^ COTAGGAGC CAGCCGGAGG AGAGTGTAGG GCTGAGAGCC 1260 
^^^ACC ^T^^AGG\CAAC AGTAGCTGCT CTGTGATGAG TGAAGAGCCC 1320 
r ACf^VCCCA GTTACTCCAC GCTGACCACG GTGAGGGAGATAGAAACACA GACTGAACTG 1380 

^0 ctgtctS 

30 ^Sa^C^^TCA GGAGAATGGG ACCCTACGGG CCAAGCCCAC GGGCAATGGC !500 
ATCTACATCA ATGGGCGGGG ACACCTGGTC TCA 

Seq ID NO: 54 Protein sequence: 
35 Protein Accession ft: NPJ12178.1 



1 n 21 31 41 51 



MPI SI GAEMW GPEAWLLLLL LLASFTGRCP AGELETSDW TVVLGQDAKL PCFYRGDSGE 60 
40 OVGOVAWA^V d^K^^ LLHSItYGLHV SPAYEGRVEQ PPPPRNPEDC SVLLRNAVQA 120 
Sy^ PSLNPGPALE EGQGLTLAAS CTAEGSP^S 180 

v^nTCVKGT TSSRSFKHSR SAAVTSEFHL VPSRSMNGQP LTCVVSHPGL LQDQRITHIL 240 
H IS RGLeSwH IGREGAMLKC LSEGQPPPSY NWTRLDGPLP SGVRVDGDTL 300 
^p PI ™ s GiYVCHVSNE FSSRDSQVTV DVLDPQEDSG KQVDLVSASV VVVGVIAALL 360 

45 F^WY^ 

EGWDSUCDN SSCSVMSEEP EGRSYSTLTT VRE1ETQTEL LSPGSGRAEE EEDQDEGIKQ 480 

AMNHFVQENG TLRAKPTGNG IYfNGRGHLV 

Seq ID NO: 55 DNA sequence 
50 Nucleic Acid Accession U: AF007170.I 

Coding sequence: 73-1725 (underlined sequences correspond to start and stop codons) 

1 U 21 31 41 51 

S S a ACGAGGCGG CCTCCGGGAA AAGCGACCGC AGGACTCCTG AGAGCAGCCT CCATGAGGCC 60 
^ rrrCACCAGJ GCATGA^CCGC CCTGGACCTC TTCCTCACCA ACCAGTTCTC AGAAGCACTC 1 

£ctaot?I agcccagaac caaggaaagc atgtaccact CACTGACATA tgccaccatc 1 80 

CTCC A^IGC AG^CATGAT GACCTTTGAC CCTCAGG ACA TCCTGCTTGC CGGCAACATG 240 
Jrr AACcI^G CA^GATGCT GTGTCAGAGG CACCGGAGGA AGTCTTCTGT AACAGATTCC 300 
*0 ^C^CaPcC ?TC^ aSScCACGCTO GGCCAATTCA CTGAAGAAGA AATCCACGCT 360 
6 ^rCTCTGCT ATGCAG AGTG CCTGCTGC AG CGAGCAGCCC TGACCTTCCT GCAGGACGAG 420 

Sg^ 

rArnnfiACA GCCTTGTTCA GTCCTCACAA TACTGCAAGG GTGAGAACCA CCCGCAC 1 M ju 
r ^ ACC ^AG TG^GOTGO TGTAGGGGCC TTCAACCTGA CACTGTCCAT GCTTCCTACT 600 
M Arr ATC^GA ^GOTGTTOA CTTTGTGGGG TTTTCAGGAA ACAAGGACTA TGGGCTGCTG 660 
6 r a r rrAGG aSgSaGOTC AGGGCACAGC TTCCGCTCTG TGCTCTGTGT CATGCTCCTG 720 

^c^cca^^c^ttcgig ctcggtactg ggaacgtcaa wtcgaggag o 

^rAriAr/^ CTACCTGAAC CGGTACCCTA AGGGTGCCAT CTTCCTGTTc a4u 

^rr^CM^ AACATTGATG CAGCCATCCG GCGTTTCGAG 900 

70 T^c^Ag AgSScAGCA GCACTGGAAG CAGTTTCACC ACATGTGCTA CTGGGAGCTO 960 
70 Irrr^Cl TCAOTAC^GGGCCAGTGG AAGATGTCCT ACTTCTACGC CG ACCTGCTC 1020 
ATGTGGTGCT T CACCJACAA c TAC ATTTACA TGAAGGCCGC CTACCTCAGC 1080 

IS GTG ™tS rr^r^Arrr CTCCTCCAAC CCTATCTCGC TGCCAGTGCC TGCTCTGGAA 1260 
75 CGGAAGTCCC ^CTACTT ^^J^. ATrGGG AAGC AGCCGAA ACT CACGGATGGG 1 320 
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TCAGTGGATG ACGAGTGCTT GGTGAAATTG TTGAAAGGCC TGTGTCTGAA ATACCTGGGC 1440 
CGTGTCCAGG AGGCCGAGGA GAATTTTAGG AGCATCTCTG CCAATGAAAA GAAGATTAAA 1500 
TATGACCACT ACTTGATCCC AAACGCCCTG CTGGAGCTGG CCCTGCTGCT TATGGAGCAA 1 560 
GACAGAAACG AAGAGGCCAT CAAACTTTTG GAATCTGCCA AGCAAAACTA CAAGAATTAC 1620 
5 TCCATGGAGT CAAGGACACA CTTTCG AATC CAGGCAGCCA CACTCCAAGC CA AGTCTTCC 1 680 
CTAGAGAACA GCAGCAG ATC CATGGTCTCA TCAGTGTCCT T GTAGC TTTG TGC AGCAGTT 1 740 
CCGGGCTGGA AGACAGAGAC AGCTGGACAG AGCTCCTGAA AACATTTCAA AATACCCCCT 1 800 
CCCCCTGCCC TGCCCTGCCT TTGGGGTCCA CCGGCACTCC AGTTGGATGG CACAACATAG 1 860 
TGTATCCGTG CAGAAGCCGA GCTGGCATTT TCACCAGTGT AGCCAAGGGC CTTTGCCAAG 1 920 

10 GGCAGAGCAG GTGGAGCCCT CTGCCTGCCC TATCACACAT ACGGGTACTT GCTTTTCACT 1980 
GTGATGTTTA AGAGAATGTA TGAACAGTTT ACATTTTCCT TAGAAATACA TTG ATGGGAT 2040 
CACAGTTGGC TTTAAAAACC AACAACAATC AACCACCTGT AAGTCTTTGT CTTCACCTAT 2100 
TATCATCTGG AGGTAAATCT CTTTATATGA TGATGCCAAA GGGCAAATTG CTTTTCAAAT 2160 
TCAGCAAGTT CTCAGCTTGT GTGACGGAAG GTCCTTCAGA GGACCTGAGG AATGOCTGGG 2220 

1 5 AGAGGCTAAG CCTCAGGCTT CAATGCTTCT GGGGTTGGGC ATGAGGATGT ACACAGACAC 2280 
CCACTACCTT ACTACTCACA CTTCATTTCA CTCCTTTTGT AAATTTCCAA TTTAAAAATC 2340 
AAGCACGTCT TTTTAGTGAG ATAAAATCTG AGCTCTTCTG TAGAAAAATC AATCTCTACC 2400 
AGTAGAAAAT GCCAGGGCTT GATGGAAGAG CTGTGTAGCC CTTTCTATGC CAAAGCCAGG 2460 
AAATTTGGGG GGCAGGAGGA GGTTCTCAGA ATCCAGTCTG TATCTTTGCT GTATGCCAAA 2520 

20 CTGAAACCAC TGGGAATAAT TTATGAAACA TAAAAATCTT CTGTACTTCA CTCCAAGGTA 2580 
CATTTGCTTA CTGACAGCAT TTTTGTTAAA ACTGTTATTC TTGAAAAAAA A AAAAAAAAA 2640 
AA 

Seq ID NO: 56 Protein sequence: 
25 Protein Accession U: AAC39582. 1 

1 11 21 31 41 51 

1 I 1 I I ! 

MTALDLFLTN QFSEALSYLK PRTKESMYHS LTYATILEMQ AMMTFDPQDI LLAGNMMKEA 60 
QMLCQRHRRK SSVTDSFSSL VNRPTLGQFT EEE1HAEVCY AECLLQRAAL TFLQDENM VS 1 20 
F1KGG1KVRN SYQTYKELDS LVQSSQYCKG ENHPHFEGGV KLGVGAFNLT LSMLPTRILR 180 
LLEFVGFSGN KDYGLLQLEE GASGHSFRSV LCVMLLLCYH TFLTFVLGTG NVN1EEAEKL 240 
LKPYLNRYPK GAIFLFFAGR IEVIKGNIDA AIRRFEECCE AQQHWKQFHH MCYWELMWCF 300 
TYKGQWKMSY FYADLLSKEN CWSKATY1YM KAAYLSMFGK EDHKPFGDDE VELFRAVPGL 360 
KLKIAGKSLP TEKFAIRKSR RYFSSNPISL PVPALEMMYI WNGYAV1GKQ PKLTDGILE1 420 
ITKAEEMLEK GPENEYSVDD ECLVKLLKGL CLKYLGRVQE AEENFRStSA NEKKIKYDHY 480 
LIPNALLELA LLLMEQDRNE EAIKLLESAK QNYKNYSMES RTHFRJQAAT LQAKSSLENS 540 
SRSMVSSVSL 

40 Seq ID NO: 57 DNA sequence 

Nucleic Acid Accession U: NM_006670. 1 

Coding sequence: 1-927 (underlined sequences correspond to start and stop codons) 

1 II 21 31 41 51 

,11 

ATGCCTGGGG GGTGCTCCCG GGGCCCCGCC GCCGGGGACG GGCGTCTGCG GCTGGCGCGA 60 
CTAGCGCTGG TACTCCTGGG CTGGGTCTCC TCGTCTTCTC CCACCTCCTC GGCATCCTCC 120 
TTCTCCTCCT CGGCGCCGTT CCTGGCTTCC GCCGTGTCCG CCCAGCCCCC GCTGCCGGAC 1 80 
CAGTGCCCCG CGCTGTGCGA GTGCTCCGAG GCAGCGCGCA CAGTCAAGTG CGTTAACCGC 240 
AATCTGACCG AGGTGCCCAC GGACCTGCCC GCCTACGTGC GCAACCTCTT CCTTACCGGC 300 
AACCAGCTGG CCAGCAACCA CTTCCTTTAC CTGCCGCGGG ATGTGCTGGC CCAACTGCCC 360 
AGCCTCAGGC ACCTGGACTT AAGTA ATA AT TCGCTGGTGA GCCTGACCTA CGTGTCCTTC 420 
CGCAACCTGA CACATCTAGA AAGCCTCCAC CTGGAGGACA ATGCCCTCAA GGTCCTTCAC 480 
AATGGCACCC TGGCTG AGTT GCAAGGTCTA CCCCACATTA GGGTTTTCCT GGACAACAAT 540 
CCCTGGGTCT GCGACTGCCA CATGGCAGAC ATGGTGACCT GGCTCAAGGA AACAGAGGTA 600 
GTGCAGGGCA AAGACCGGCT CACCTGTGCA TATCCGGAAA AAATGAGGAA TCGGGTCCTC 660 
TTGGAACTCA ACAGTGCTGA CCTGGACTGT GACCCGATTC TTCCCCCATC CCTGCAAACC 720 
TCTTATGTCT TCCTGGGTAT TGTTTTAGCC CTGATAGGCG CTATTTTCCT CCTGGTTTTG 780 
TATTTGAACC GCAAGGGGAT AAAAAAGTGG A TGC AT A AC A TCAGAGATGC CTGCAGGGAT 840 
CACATGGAAG GGTATCATTA CAGATATGAA ATCAATGCGG ACCCCAGATT AACAAACCTC 900 
AGTTCTAACT CGGATGTCCT CGAGTGA 

Seq ID NO: 58 Protein sequence 
Protein Accession #: NP_006661.1 

65 

1 11 21 31 41 51 

I I ; I I 
MPGGCSRGPA AGDGRLR1AR LALVLLGWVS SSSPTSSASS FSSSAPFLAS AVSAQPPLPD 60 
QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLASNHFLY LPRDVLAQLP 120 
SLRHLDLSNN SLVSLTYVSF RNLTHLESLH LEDNALKVLH NGTLAELQGL PHIRVFLDNN 180 
PWVCDCIIMADMVTWLKETEV VQGKDRLTCA YPEKMRNRVL LELNSADLDC DPILPPSLQT 240 
SY'VFLGIVLA LIGAIFLLVL YLNRKGIKKW MHNIRDACRD HMEGYHYRYE INADPRLTNL 300 
SSNSDVLE 

Seq ID NO: 59 DNA sequence 
Nucleic Acid Accession ft: NM 024022 
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Coding sequence: 1 -1362(underlined sequences correspond to start and stop codons) 
I 11 21 31 41 51 

5 ATGGGGG AAA ATGATCCGCC TGCTGTTGAA GCCCCCTTCT CATTCCGATC CCTTnTCOC 60 
OTGATGATT TGAAAATAAG TCCTGTTGCA CCAGATGCAC ^TOCTGTTOC ^TGCACAG ATC UO 
OTGTCACTGC TGCCATTGAA GTTTTTTCCA ATCATCGTCA TTGGGATCAT TGCATTGATA 1 80 
SSaCTCG CCATTGGTCT GGGCATCCAC TTCGACTGCT CAGGGA ACTA CAGATGTCGC 240 
TCA^A AGTCTATCGA GCTGATAGCTCGATGTGACG f ^CTCGGA^GCAAAGAC JW 
1 0 rrrrAC.GACG AfiTACCGCTG TGTCCGGGTG GGTGGTCAGA ATGCCGTGCT CCAGGTGTTC 360 
1 SgAC CATGTGCTCC G ATGACTGGA AGGGTCACTA CGCAAATGTT 420 

rccT^GCCC AACTGGGTTT CCCAAGCTAT GTGAGTTCAG ATAACCTCAG AGTGAGCTCG 480 
CTGG AGGGGC agTOCCGGA GGAGTTTGTG TCCATCG ATC ACCTCTTGCC AG ATCACAAG 540 
C7C ACTGCAT TACACCACTC AGTATATGTG AGGGAGGGAT GTGCCTCTGG CCACGTGGTT 600 
1 5 ACCTTGCAGT I CAC AGCCTG TGGTCATAGA AGGGGCTACA GCTCACGCAT CGTGGGTGGA 660 
a AC aTTTCCT TGCTCTCGCA GTGGCCCTGG CAGGCCAGCC TTCAGTTCCA GGGCTACCAC 720 
CTG^G^GGG^ G^TCTGTCAT C TGGATCATCA CTGCTGCACA CTGTGTTTAT 780 

SStoATC TCOTCAAGTC ATGGACCATC CAGGTGGGTC TAGTTTCCCT GTTGGACAAT 840 
™ AT CCCAOTGGT GGAGAAGATT GTCTACCACA GCAAGTACAA GCCAAAGAGG 900 
10 rTrrGT A ATG AC ATCGCCCT TATGAAGCTG GCCGGGCCAC TCACGTTCAA TGAAATGATC 960 
cTc?Sctg?gcctgc7c7a CTCTGAAGAG AACTTCCCCG ATGGAAAAGTGTGCTGGACG 1020 
TCAgSItGGG ^^G AgStGGAGGT GACGCCTCCC CTGTCCTGAA OCACGOGGCC 1080 
^XCCCmG A TTTCC AACAA G ATCTGCAAC CACAGGGACG TGTACGGTGG CATCATCTCC 1 1 40 
SctCC^GC^CTGCGCGGG CTACCTGACG GGTGGCGTGG ACAGCTGCCA GGGGGACAGC 1200 

GGCATCGGCT GCGCAGAGGT GAACAAGCCT GGGGTGTACA CCCGTGTCAC CTCCTTCCTG 13.0 
GACTGGATCC ACGAGCAGAT GGAGAGAGAC CTAAAAACCTGA 

Seq ID NO: 60 Protein sequence 
30 Protein Accession #: NP„076927 

I 11 21 31 41 51 

MGENDPPAVE APFSFRSLFG LDDLKISPVA PDADAVAAQI LSLLPLKFFP IIVIGI1ALI 60 
1 S LAL^L^H I ^DCSGK YRCR SSFKCIELI A RCDGVSDCKD GEDEYRCVRV GGQNAVLQVF 120 
^ScSOTWKGIWANV ACAQLGFPSY VSSDNL*VSS LEGQFREEFV SIDHLLPDDK 180 
VTALHHS™ V rIgCASGHVV TLQCTACGHR RGYSSRIVGG NMSLLSQWPW QASLQFQGYH 240 
L^G™ W^TAAHCVY DLYLPK.SWTI QVGLVSLLDN PAPSHLVEKJ WHSKYKPKR 300 
LG^iXkL AGPLTFNEM1 QPVCLPNSEE NFPDGKVCWT SGWGATEDGG DASPVLNHAA ^360 
40 VPLJSNK^N HRDVYGGIIS PSMLCAGYLT GGVDSCQGDS GGPLVCQERR LWKLVGATSF 420 
GIGCAEVNKP GVYTRVTSFL DW1HEQMERD LKT 

Seq ID NO: 61 DNA sequence 
Nucleic Acid Accession #: NMJW6475 
45 Coding sequence: 28-2538 (underlined sequences correspond to start and stop codons) 
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I 1 
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aaca'gaactg caacggagag actcaagais ™ c c^™? A A ™£™g^ 

c A T-n-r-rnrTTA TTGTTAACCC TATAAACGCC AACAATCATT ATGACAAGAT CTTGGCTC A I 

aISaagaS Z 

A AA ACG ACTG TTHTA T ATG A ATGTTGCCCT GGTTATATGA GAATGGAAGG AATGAAAGGC 300 
T^rCAGTAG TTTTCCCCAT TGACCATGTT TATGGCACTC TGGGCATCGT GGGAGCC ACC 360 
I^ArrCAGC OTATTCTGA CGCCTCAAAACTGAGGGAGG AGATCGAGGG AAAGGGATCC 420 
^^^Cn^C^TTG^ ACC^^G TAATCAGGCT TGGG ACAACT TGGATTCTGA TATCCGTAGA 480 
J^toSag ^"caACCT^AA TGTOAATTA CTGAATGCTT TACATAGTCA CATGATTAAT 540 
SgAATGT TGACCAAGGA CTTAAAAAAT GGCATGATTA TTCCTTCAAT GTATAACAAT 600 

"tSacca ttatcctaat 

ATCCATGGG A ACCAGATTGC AACAAATGGT GTTGTCCATG TCAnGACCG ™TCCTTACA 720 

caaattggta cctcaattca agacttcatt gaagcagaag atgacctttc ^atotttaga tw 

7-!rir.rTr.rCA TCACATCGGA CATATTGGAG GCCCTTGGAA GAGACGGTCA CTTCACACTC 840 
mG^^ 

rr AC, ACAAAG TGGCTTCCGA AGCTCTTATG AAGTACCACA TCTTAAATAC TCTCCAGTGT 96U 

65 

nr ATGTGACG GTGACAGTAT AACAGTAAAT GGAATCAAAA TGGTGAACAA AAAGGATATT 1080 

r^A aOTTATTG AGCTGGCTGG AAAACAGCAA ACCACCTTCA CGGATCTTGT GGCCCAATTA 1200 
SgCAT ScS C^CAGATGGA GAATACACTI ^CTOOCACC L^TAAT .260 
7fi nTATTTTCTG ATG AT ACTCT C AGCATGGTT CAGCGCCTCC TTAA ATTAAT TCTGCAG AAT 1320 

' ^^CaTa^T^^^^C^A AAAGT^TiGCCTTAAT GAGCTTTACA ACGGGCAAAT ACTGGAAACC 1380 

ATCGG AGCK^A ^^o!(KTC AG^iGTCTTCGTA ^ ATCGrTACAG f^^ST!^^^!?-^^^^^-^' ^500 
TGCATGGAGA AAGGG AGTAA GCAAGGGAGA 1S» 
ATCAAGCCAG CAGAGAAATC CCTCCATGAA A AGTTAAAAC -A AGATAAGCC 1^™^ 1560 
nc ■rT-nnCACCC TArTTnAAGCTGCAGACTTG AAAGAGCTCC TGACACAACC TGGAGACTGG 1620 

75 Ica™^ TGO^^CC A A TG A^GCTTTT AAGGGAATGA ^CT^C A^GC^ ,7« 
CTGATACGGG ACAAAAATGC TCTTCAAAAC ATCATTCTTT ATCACCTGAC ACCAGGAGTT 1740 
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TTCATTGGAA AAGGATTTGA ACCTGGTGTT ACTAACATTT TAAAGACCAC ACAAGGAAGC 1 800 
AAAATCTTTC TGAAAGAAGT AAATGATACA CTTCTGGTGA ATGAATTGAA ATCAAAAGAA I860 
TCTGACATCA TGACAACAAA TGGTGTAATT CATGTTGTAG ATAAACTCCT CTATCCAGCA 1920 
GACACACCTG TTGGAAATGA TCAACTGCTG GAAATACTTA ATAAATTAAT CAAATACATC 1980 
5 CAAATTAAGT TTGTTCGTGG TAGCACCTTC AAAGAAATCC CCGTGACTGT CTATACAACT 2040 
AAAATTATAA CCAAAGTTGT GGAACCAAAA ATTAAAGTGA TTGAAGGCAG TCTTCAGCCT 2100 
ATTATCAAAA CTGAAGGACC CACACTAACA AAAGTCAAAA TTGAAGGTGA ACCTGAATTC 2160 
AGACTGATTA AAGAAGGTGA AACAATAACT GAAGTGATCC ATGGAGAGCC AATTATTAAA 2220 
AAATACACCA AAATCATTGA TGGAGTGCCT GTGGAAATAA CTGAAAAAGA GACACGAGAA 2280 

10 GAACGAATCA TTACAGGTCC TGAAATAAAA TACACTAGGA TTTCTACTGG AGGTGGAGAA 2340 
ACAGAAGAAA CTCTGAAGAA ATTGTTACAA GAAGAGGTCA CCAAGGTCAC CAAATTCATT 2400 
GAAGGTGGTG ATGGTCATTT ATTTGAAGAT GAAGAAATTA AAAGACTGCT TCAGGGAGAC 2460 
ACACCCGTGA GGAAGTTGCA AGCCAACAAA AAAGTTCAAG GTTCTAGAAG ACGATTAAGG 2520 
GAAGGTCGTT CTCAGTGAAA ATCCAAAAAC CAGAAAAAAA TGTTTATACA ACCCTAAGTC 2580 

1 5 AATAACCTGA CCTTAGA AAA TTGTGAGAGC CAAGTTGACT TCAGGAACTG AAACATC AGC 2640 

ACAAAGAAGC AATCATCAA A TAATTCTGAA CACAAATTTA ATATTTTTTT TTCTGAATGA 2700 
GAAACATGAG GG AAATTGTG GAGTTAGCCT CCTGTGGTAA AGGAATTGAA GAAAATATAA 2760 
CACCTTACACCC1 11 1TCAT CTTGACATTA AAAGTTCTGG CTAACTTTGG AATCCATTAG 2820 
AGAAAAATCC TTGTCACCAG ATTCATTACA ATTCAAATCG AAGAGTTGTG AACTGTTATC 2880 

20 CCATTGAAAA GACCGAGCCT TGTATGTATG TTATGGATAC ATAAAATGCA CGCAAGCCAT 2940 
TATCTCTCCA TGGGAAGCTA AGTTATAAAA ATAGGTGCTT GGTGTACAAA ACTTTTTATA 3000 
TCAAAAGGCT TTGCACATTT CTATATGAGT GGGTTTACTG GTAAATTATG TTATTTTTTA 3060 
CAACTAATTT TGTACTCTCA GAATGTTTGT CATATGCTTC TTGCAATGCA TATTTTTTAA 3120 
TCTCAAACGT TTCAATAAAA CCATTTTTCA G ATATAAAGA GAATTACTTC AAATTGAGTA 3180 

25 ATTCAGAAAA ACTCAAGATT TAAGTTAAAA AGTGGTTTGG ACTTGGGAA 

Seq ID NO: 62 Protein sequence: 
Protein Accession #: NP_ 006466 

30 1 11 21 31 41 51 

I 1 I I I I 

MrPFLPMFSL LLLUVNPIN ANNHYDKJLA HSRIRGRDQG PNVCALQQIL GTKXKYFSTC 60 
KNWYKKSICG QKTTVLYECC PGYMRMEGMK GCPAVLPIDH VYGTLG1VGA TTTQRYSDAS 120 
KLREE1EGKG SFTYFAPSNE AWDNLDSD1R RGLESNVNVE LLNALHSHMI NKRMLTKDLK 1 80 

3 5 NGM1IPSMYN NLGLFINHYP NGWTVNCAR 11HGNQIATN GVVHV1DRVL TQIGTSIQDF 240 
1EAEDDLSSF RAAA1TSDIL EALGRDGHFT LFAPTNEAFE KLPRGVLERF MGDKVASEAL 300 
MKYHILNTLQ CSESIMGGAV FETLEGNTIE 1GCDGDSITV NGIKMVNKKD IVTNNG VIHL 360 
IDQVLIPDSA KQVIELAGKQ QTTFTDLVAQ LGLASALRPD GEYTLLAPVN NAFSDDTLSM 420 
VQRLLKLILQ NHILXVKVGL NELYNGQ1LE TIGGKQLRVF VYRTAVCIEN SCMEKGSKQG 480 

40 RNGA1HIFRE 1IKPAEKSLH EKLKQDKRFS TFLSLLEAAD LKELLTQPGD WTLFVPTNDA 540 
FKGMTSEEKE ILIRDKNALQ NIlLYHLTPG VF1GKGFEPG VTN1LKTTQG SKIFLKEVND 600 
TLLVNELKSK ESDIMTTNGV IHVVDKLLYP ADTPVGNDQL LEILNKLIKY IQIKFVRGST 660 
FKFJPVTVYT TKIITKVVEP KIK VIEGSLQ PIIKTEGPTL TKVKJEGEPE FRUKFGETI 720 
TEVIHGEPU KKYTKJIDGV PVE1TEKETR EER11TGPEI KYTRISTGGG ETEETLKKLL 780 

45 QEEVTKVTKF 1EGGDGI ILFE DEEIKRLLQG DTPVRKLQAN KKVQGSRRRL REGRSQ 

Seq ID NO: 63 DNA sequence 

Nucleic Acid Accession U: NM 020974 

Coding sequence; 81-3080 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I 1 i 

GGCGTCCGCG CACACCTCCC CGCGCCGCCG CCGCCACCGC CCGCACTCCG CCGCCTCTGC 60 
CCGCAACCGC TGAGCCATCC ATG GGGGTCG CGGGCCGCAA CCGTCCCGGG GCGGCCTGGG 120 

5 5 CGGTGCTGCT GCTGCTGCTG CTGCTGCCGC CACTGCTGCT GCTGGCGGGG GCCGTCCCGC 1 80 

CGGGTCGGGG CCGTGCCGCG GGGCCGCAGG AGGATGTAGA TGAGTGTGCC CAAGGGCTAG 240 
ATGACTGCCA TGCCGACGCC CTGTGTCAGA ACACACCCAC CTCCTACAAG TGCTCCTGCA 300 
AGCCTGGCTA CCAAGGGGAA GGCAGGCAGT GTGAGGACAT CGATGAATGT GGAAATGAGC 360 
TCAATGGAGG CTGTGTCCAT GACTGTTTG A ATATTCCAGG CAATTATCGT TGCACTTGTT 420 

6 0 TTG ATGGCTT CATGTTGGCT CATG ACGGTC ATA ATTGTCT TG ATGTGGAC GAGTGCCTGG 480 

AGAACAATGG CGGCTGCCAG CATACCTGTG TCAACGTCAT GGGGAGCTAT GAGTGCTGCT 540 
GCAAGGAGGG GTTTTTCCTG AGTGACAATC AGCACACCTG CATTCACCGC TCGGAAGAGG 600 
GCCTGAGCTG CATG AATAAG G ATCACGGCT GTAGTCACAT CTGCAAGGAG GCCCCAAGGG 660 
GCAGCGTCGC CTGTGAGTGC AGGCCTGGTT TTGAGCTGGC CAAGAACCAG AGAGACTGCA 720 

65 TCTTGACCTG TAACCATGGG AACGGTGGGT GCCAGCACTC CTGTGACGAT ACAGCCGATG 780 
GCCCAGAGTG CAGCTGCCAT CCACAGTACA AGATGCACAC AGATGGGAGG AGCTGCCTTG 840 
AGCGAGAGGA CACTGTCCTG GAGGTGACAG AGAGCAACAC CACATCAGTG GTGGATGGGG 900 
ATAAACGGGT GAAACGGCGG CTGCTCATGG AAACGTGTGC TGTCAACAAT GGAGGCTGTG 960 
ACCGCACCTG TAAGGATACT TCGACAGGTG TCCACTGCAG TTGTCCTGTT GGATTCACTC 1020 

70 TCCAGTTGGA TGGGAAGACA TGTAAAGATA TTGATGAGTG CCAGACCCGC AATGGAGGTT 1080 
GTGATCATTT CTGCAAAAAC ATCGTGGGCA GTTTTGACTG CGGCTGCAAG AAAGGATTTA 1 140 
AATTATTAAC AGATGAGAAG TCTTGCCAAG ATGTGGATGA GTGCTCTTTG GATAGGACCT 1 200 
GTGACCACAG CTGCATCAAC CACCCTGGCA CATTTGCTTG TGCTTGCAAC CGAGGGTACA 1260 
CCCTGTATGG CTTCACCCAC TGTGGAGACA CCAATGAGTG CAGCATCAAC AACGGAGGCT 1320 

75 GTCAGCAGGT CTGTGTGAAC ACAGTGGGCA GCTATGAATG CCAGTGCCAC CCTGGGTACA 1380 
AGCTCCACTG GAATAAAAAA GACTGTGTGG AAGTGAAGGG GCTCCTGCCC ACAAGTGTGT 1440 
CACCCCGTGT GTCCCTGCAC TGCGGTAAGA GTGGTGGAGG AGACGGGTGC TTCCTCAGAT 1500 
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GTCACTCTGG CATTCACCTC TCTTCAGATG TCACCACCAT CAGGACA AGT GTAACCTTTA 1560 
AGCTAAATGA ACGCAAGTGT AGTTTGAAAA ATGCTGAGCT GTTTCCCGAG GGTCTGCGAC 1620 
CAGCACTACC AGAGAAGCAC AGCTCAGTAA AAGAGAGCTT CCGCTACGTA AACCTTACAT 1680 
GCAGCTCTGG CAAGCAAGTC CCAGGAGCCC CTGGCCGACC AAGCACCCCT AAGGAAATGT 1740 
5 TTATCACTGT TGAGTTTGAG CTTGAAACTA ACCAAAAGGA GGTGACAGCT TCTTGTGACC 1800 
TGAGCTGCAT CGTAAAGCGA ACCGAGAAGC GGCTCCGTAA AGCCATCCGC ACGCTCAGAA I860 
AGGCCGTCCA CAGGGAGCAG TTTCACCTCC AGCTCTCAGG CATGAACCTC GACGTGGCTA 1920 
A^AAGCCTCC CAGAACATCT GAACGCCAGG CAGAGTCCTG TGGAGTGGGC CAGGGTCATG 1980 
CAGAAAACCA ATGTGTCAGT TGCAGGGCTG GGACCTATTA TGATGGAGCA CGAGAACGCT 2040 

1 0 GCATTTTATG TCCAAATGGA ACCTTCCAAA ATGAGGAAGG ACAAATGACT TGTGAACCAT 2100 
GCCCAAGACC AGGAAATTCT GGGGCCCTGA AGACCOCAGA AGCTTGGAAT ATGTCTGAAT 2160 
GTGGAGGTCT GTGTCAACCT GGTGAATATT CTGCAGATGG CTTTGCACCT TGCCAGCTCT 2220 
GTGCCCTGGG CACGTTCCAG CCTGAAGCTG GTCGAACTTC CTGCTTCCCC TGTGGAGGAG 2280 
GCCTTGCCAC CAAACATCAG GG AGCTACTT CCTTTCAGGA CTGTGAAACC AGAGTTCAAT 2340 

1 5 GTTCACCTGG ACATTTCTAC AACACCACCA CTCACCGATG TATTCGTTGC CCAGTGGGAA 2400 
CATACCAGCC TGAATTTGG A A AAA AT A ATT GTGTTTCTTG CCCAGGAAAT ACTACGACTG 2460 
ACTTTGATGG CTCCACAAAC ATAACCCAGT GTAAAAACAG AAGATGTGGA GGGGAGCTGG 2520 
GAGATTTCAC TGGGTACATT GAATCCCCAA ACTACCCAGG CAATTACCCA GCCAACACCG 2580 
AGTGTACGTG GACCATCAAC CCACCCCCCA AGCGCCGCAT CCTGATCGTG GTCCCTGAGA 2640 

20 TCTTCCTGCC CATAGAGGAC GACTGTGGGG ACTATCTGGT GATGCGGAAA ACCTCTTCAT 2700 
CCAATTCTGT GACAACATAT GAAACCTGCC AGACCTACG A ACGCCCCATC GCCTTCACCT 2760 
CCAGGTCAAA GAAGCTGTGG ATTCAGTTCA AGTCCAATGA AGGGAACAGC GCTAGAGGGT 2820 
TCCAGGTCCC ATACGTGACA TATGATGAGG ACTACCAGGA ACTCATTGAA GACATAGTTC 2880 
GAGATGGCAG GCTCTATGCA TCTGAGAACC ATCAGGAAAT ACTTAAGGAT AAGAAACTTA 2940 

25 TCAAGGCTCT GTTTGATGTC CTGGCCCATC CCCAGAACTA TTTCAAGTAC ACAGCCCAGG 3000 
AGTCCCGAGA G ATGTTTCCA AGATCGTTCA TCCGATTGCT ACGTTCCAAA GTGTCCAGGT 3060 
TTTTGAGACC TTACAAATGA CTCAGCCCAC GTGCCACTCA ATACAAATGT TCTGCTATAG 3120 
GGTTGGTGGG ACAGAGCTGT CTTCCTTCTG CATGTCAGC A CAGTCGGGTA TTGCTGCCTC 3180 
CCGTATCAGT GACTCATTAG AGTTCAATTT TTATAGATAA TACAGATATT TTGGTAAATT 3240 

30 GAACTTGGTT TTTCTTTCCC AGCATCGTGG ATGTAGACTG AGAATGGCTT TGAGTGGCAT 3300 
CAGCTTCTCA CTGCTGTGGG CGG ATGTCTT GGATAGATCA CGGGCTGGCT GAGCTGGACT 3360 
TTGGTCAGCC TAGGTGAGAC TCACCTGTCC TTCTGGGGTC TTACTCCTCC TCAAGGAGTC 3420 
TGTAGTGGAA AGG AGGCCAC AGAATAAGCT GCTTATTCTG AAACTTCAGC TTCCTCTAGC 3480 
CCGGCCCTCT CTAAGGGAGC CCTCTGCACT CGTGTGCAGG CTCTGACCAG GCAGAACAGG 3540 

35 CAAGAGGGGA GGGAAGGAGA CCCCTGCAGG CTCCCTCCAC CCACCTTG AG ACCTGGGAGG 3600 
ACTCAGTTTC TCCAC AGCCT TCTCCAGCCT GTGTGATACA AGTTTGATCC CAGGAACTTG 3660 
AGTTCTAAGC AGTGCTCGTG AAAAAAAAAA GCAGAAAGAA TTAGAAATAA ATAAAAACTA 3720 
AGCACTTCTG GAGACAT 

40 Seg ID NO: 64 Protein sequence: 

Protein Accession U: NP_066025. 1 



1 II 21 31 41 51 

45 MGVAGRNRPG AAWAVLLLLL LLPPLLLLAG AVPPGRGRAA GPQEDVDECA QGLDDCHADA 60 
LCQNTPTSYK CSCKPGYQGE GRQCEDJDEC GNELNGGCVH DCLNIPGNYR CTCFDGFMLA 120 
HDGHNCLDVD ECLENNGGCQ HTCVNVMGSY ECCCKEGFFL SDNQHTCIHK SEEGLSCMNK 180 
DHGCSH1CKE APRGSVACEC RPGFELAKNQ RDC1LTCNHG NGGCQHSCDD TADGPECSCH 240 
POYKMHTDGR SCLEREDTVL EVTESNTTSV VDGDKRVKRR LLMETCAVNN GGCDRTCKDT 300 

50 STGVHCSCPV GFTLQLDGKT CKDIDECQTR NGGCDHFCKN 1VGSFDCGCK KGFKLLTDEK 360 
SCQDVDECSL DRTCDHSCIN HPGTFACACN RGYTLYGFTH CGDTNECSrN NGGCQQVCVN 420 
TVGSYECQCH PGYKLHWNKK DCVEVKGLLP TSVSPRVSLH CGKSGGGDGC FLRCHSG1HL 480 
SSDVTT1RTS VTFKLNEGKC SLKNAELFPE GLRPALPEKH SSVKESFRYV NLTCSSGKQV 540 
PGAPGRPSTP KEMFITVEFE LETNQKEVTA SCDLSCIVKR TEKRLRKAIR TLRKAVHREQ 600 

5 5 FHLQLSGMNL DVAKKPPRTS ERQAESCGVG QGHAENQCVS CRAGTYYDG A RERCILCPNG 660 
TFONEEGQMT CEPCPRPGNS GALKTPEAWN MSECGGLCQP GEYSADGFAP CQLCALGTFQ 720 
PEAGRTSCFP CGGGLATKHQ GATSFQDCET RVQCSPGHFY NTTTHRC1RC PVGTYQPEFG 780 
KNNCVSCPGN TTTDFDGSTN ITQCKKRRCG GELGDFTGYI ESPNYPGNYP ANTECTWTIN 840 
PPPKRRJL1V VPEIFLPIED DCGDYLVMRK TSSSNS\TTY ETCQTYERPI AFTSRSKKLW 900 

60 IQFKSNEGNS ARGFQVPYVT YDEDYQEUE DIVRDGRLYA SENHQEILKD KKLOCALFDV 960 
LAHPQNYFKY TAQESREMFP R5FIRLLRSK VSRFLRPYK 



Seq ID NO: 65 DNA sequence 
Nucleic Acid Accession ft: NM_007210 
65 Coding sequence: t-1 869 (underlined sequences correspond to start and Stop codons) 

i ii 21 3i 41 51 

ATGAGGCTCC TCCGCAGACG CCACATGCCC CTGCGCCTGG CCATGGTGGG CTGCGCCTTT 60 
70 GTGCTCTTCC TCTTCCTCCT GCATAGGGAT GTGAGCAGCA G AGAGGAGGC CACAGAGAAG 120 
CCGTGGCTGA AGTCCCTGGT GAGCCGGAAG GATCACGTCC TGGACCTCAT GCTGGAGGCC 180 
ATG AACAACC TTAGAGATTC AATGCCCAAG CTCCAAATCA GGGCTCCAGA AGCCCAGCAG 240 
ACTCTGTTCT CCATAAACCA GTCCTGCCTC CCTGGGTTCT ATACCCCAGC TGAACTGAAG 300 
CCCTTCTGGG AACGGCCACC ACAGGACCCC AATGCCCCTG GGGCAGATGG AAAAGCATTT 360 
75 CAGAAGAGCA AGTGGACCCC CCTGG AG A CC CAGGAAAAGG AAGAAGGCTA TAAGAAGCAC 420 
TGTTTCAATG CCTTTGCCAG CGACCGGATC TCCCTGCAGA GGTCCCTGGG GCCAGACACC 480 
CGACCACCTG AGTGTGTGGA CCAGAAGTTC CGGCGCTGCC CCCCACTGGC CACCACCAGC 540 
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GTGATCATTG TGTTCCACAA CG A AG CCTGG TCCACACTGC TGCGAACAGT GTACAGCGTC 600 
CTACACACCA CCCCTGCCAT CTTGCTCAAG GAGATCATAC TGGTGGATGA TGCCAGCACA 660 
GAGGAGCACC TAAAGGAGAA GCTGG AGCAG TACGTGAAGC AGCTGCAGGT GGTGAGGGTG 720 
GTGCGGCAGG AGGAGCGGAA GGGGTTGATC ACCGCCCGGC TGCTGGGGGC CAGCGTGGCA 780 
5 CAGGCGGAGG TGCTCACGTT CCTGG ATGCC CACTGTGAGT GCTTCCACGG CTGGCTGGAG 840 
CCCCTCCTGG CTCGAATCGC TGAGGACAAG ACAGTGGTGG TGAGCCCAGA CATCGTCACC 900 
ATCGACCTTA ATACTTTTGA GTTCGCCAAG CCCGTCCAGA GGGGCAGAGT CCATAGCCGA 960 
GGCAACTTTG ACTGGAGCCT GACCTTCGGC TGGGA AACAC TTCCTCCACA TGAGAAGCAG 1 020 
AGGCGCAAGG ATGAAACATA CCCCATCAAA TCCCCGACGT TTGCTGGTGG CCTCTTCTCC 1080 

1 0 ATCCCCAAGT CCTACTTTGA GCAC ATCGGT ACCTATGATA ATCAG ATGG A G ATCTGGGG A 1 1 40 
GGGGAGAACG TGGAAATGTC CTTCCGGGTG TGGCAGTGTG GGGGCCAGCT GGAGATCATC 1 200 
CCCTGCTCTG TCGTAGGCCA TGTGTTCCGG ACCAAGAGCC CCCACACCTT CCCC AAGGGC 1 260 
ACTAGTGTCA TTGCTCGCAA TCAAGTGCGC CTGGCAGAGG TCTGGATGGA CAGCTACAAG 1320 
AAGATTTTCT ATAGGAGAAA TCTGCAGGCA GCAAAGATGG CCCAAGAGAA ATCCTTCGGT 1380 

1 5 GACATTTCGG AACGACTGCA GCTG AGGGAA CAACTGCACT GTCACAACTT TTCCTGGTAC 1 440 
CTGCACAATG TCTACCCAGA GATGTTTGTT CCTGACCTGA CGCCCACCTT CTATGGTGCC 1500 
ATCAAGAACC TCGGCACCAA CCAATGCCTG G ATGTGGGTG AGAACAACCG CGGGGGGAAG 1 560 
CCCCTCATCA TGTACTCCTG CCACGGCCTT GGCGGCAACC AGTACTTTGA GTACACAACT 1 620 
CAGAGGGACC TTCGCCACAA CATCGCAAAG CAGCTGTGTC TACATGTCAG CAAGGGTGCT 1 680 

20 CTGGGCCTTG GGAGCTGTCA CTTCACTGGC AAGAATAGCC AGGTCCCCAA GGACGAGGAA 1740 
TGGGAATTGG CCCAGGATCA GCTCATCAGG AACTCAGGAT CTGGTACCTG CCTGACATCC 1800 
CAGGACAAAA AGCCAGCCAT GGCCCCCTGC A ATCCCAGTG ACCCCCATCA GTTGTGGCTC 1 860 
TTTGTCTAGG ACCCAGATCA TCCCCAGAGA GAGCCCCCAC AAGCTCCTCA GGAAACAGGA 1920 
TTGCTGATGT CTGGGAACCT GATCACCAGC TTCTCTGGAG GCCGTAAAGA TGGATTTCTA 1980 

2 5 AACCCACTGG GTGGCAAGGC AGG ACCTTCC TAATCCTTGC AACAACATTG GGCCCATTTT 2040 

CTTTCCTTCA CACCGATGGA AGAGACCATT AGGACATATA TTTAGCCTAG CGTTTTCCTG 2 100 
TTCTAGAAAT AGAGGCTCCC AAAGTAGGGA AGGCAGCTGG GGGAGGGTTC AGGGCAGCAA 2160 
TGCTGAGTTC AAGAAAAGTA CTTCAGGCTG GGCACAGTGG CTCATGCCTG AAATCCTAGC 2220 
ACTTTGGGAA GACAATGTGG GAGAATGGCT TGAGCCCAGG AGTTCAAGAC CGGCCTGAGC 2280 

3 0 A ACATAGTGA GGATCCCATC TCTACGCCCA CCCTCCCCCC GGCAAAAAAA AAAGCTGGGT 2340 

ATGGTGGCTT ATGCCTGTAG TCGCAGCTAC TCAGAAGGCT GAGGTGGGAG GATTGCTTGT 2400 
TCCCCGGAGG TTGAAGCTAC AGTGAGCCTT GATTGTGTCA CTGCACTCCA GCCTGGGCAA 2460 
CAGGTAAGAC TCTGTCTCAA AAAAAAAACA AAAAAGAAGA AGAAAAGTAC TTCTAC AG CC 2520 
ATGTCCTATT CCTTG ATCAT CCAAAGCACC TGCAGAGTCC AGTGAAATGA TATATTCTGG 2580 
3 5 CTGGGCACAG TGGCTCACAC CTGTA ATCCT AGCACTTTGG GAGGCCAAGG CAGGTGGATC 2640 
ACCTGAGGTC AGAAGTTTGA AACCAGCCTG GACTACATGG TGAAACTCCA TCTCTACTAA 2700 
AAGTACAAAA ATTAGCTGGG CATGATGGCA CGCACCTGCA GTCCCAGCTA CTTGGGAGGC 2760 
TGAGGCAGGA GAATCACTCG AACCCAGGAG GCAGAGGTTG CAGTGAGCCA AGACAGCACC 2820 
ATTGCACCCC AGCCTGAGCA ACAAGAGCGA AACTCCATCT CAGGAAAAAA AAAAAAAAAA 2880 
A 

Seq ID NO: 66 Protein sequence: 
Protein Accession # NPJX»I41 

45 1 II 21 31 41 51 

III II 

MRLLRRRHMP LRLAMVGCAF VLFLFLLHRD VSSREEATEK PWLKSLVSRK DHVLDLMLEA 60 
MNNLRDSMPK LQIRAPEAQQ TLFSINQSCL PGFYTPAELK PFWERPPQDP NAPGADGKAF 120 
QKSKWTPLET QEKEEGYKKH CFN AF ASDRI SLQRSLGPDT RPPECVDQKF RRCPPLATTS 1 80 

50 VIIVFHNEAW STLLRTVYSV LHTTPAILLK EIILVDDAST EEHLKEKLEQ YVKQLQWRV 240 
VRQEERKGLI TARLLGASVA QAEVLTFLDA HCECFHG WLE PLLAR1AEDK TVVVSPDIVT 300 
IDLNTFEFAK PVORGRVHSR GNFDWSLTFG WETLPPHEKQ RRKDETYP1K SPTFAGGLFS 360 
IPKSYTEHIG TYDNQMEIWG GENVEMSFRV WQCGGQLEII PCS VVGHVFR TKSPHTFPKG 420 
TSVIARNQVR LAEVWMDSYK KIFYRRNLQA AKMAQEKSFG D1SERLQLRE QLHCHNFSWY 480 

55 LHNVYPEMFV PDLTPTFYGA IKNLGTNQCL DVGENNRGGK PLIMYSCHGL GGNQYFEYTT 540 
QRDLRHNIAK QLCLHVSKGA LGLGSCHFTG K.NSQVPKDEE WELAQDQUR NSGSGTCLTS 600 
QDKKPAMAPC NPSDPHQLWL FV 

Seq ID NO: 67 DNA sequence 
60 Nucleic Acid Accession #: NMJH4112 

Coding sequence: 600-4484 (underlined sequences correspond to start and stop codons) 

1 It 21 31 41 51 

cc I E I I I I 

65 TTCCTCCGCG AAGGCTCCTT TGATATTAAT AGTGTTGGTG TCTTGAAACT GACGTAATGC 60 

GCGGAGACTG AGGTCCTGAC AAGCGATAAC ATTTCTGATA AAGACCCGAT CTTACTGCAA 120 
TCTCTAGCGT CC TCTTTTTT GGTGCTGCTG GTTTCTCCAG ACCTCGCGTC CTCTCGATTG 1 80 
CTCTCTCGCC TTCCTATTTC TTTTTTTTTT TTTT AA AC A A AAAACAACAC CCCCTCCCCT 240 
CTCCCACCCG GCACCGGGCA CATCCTTGCT CTATTTCCTT TCTCTTTCTC TCTCTCTCTC 300 

70 TCTCTTTTTT AATAAGGGTG GGGG AGGGAA AGGGGGGGGA GGCAGGAAAG ACCTTTTTCT 360 
CTCCCCCCCG CAATAATCCA AGATCAACTC TGCA AACAAC AGAAGACGGT TCATGGCTTT 420 
GGCCGCCGCG CCACCATCTT TCGGGCTGCC GAGGGTGTTC TTGACGATTA ATCAACAGAT 480 
GTACAGATCA GCTCTCAAAA TGTCTTCTGT GTCTTCTGAG CGTCTTCTAA GACAATTGCA 540 
TTAGCCTCCT GCTAGTTGAC TAATAGAATT AATAATTGTA AAAAGCACTC TAAAGCCACA 600 

75 TGCCTTATGA AGTCAATGCT GGGTATGATT TTACAAATAT GGTCCGGAAA AAGAACCCCC 660 
CTCTGAGAAA CGTTGCAAGT GAAGGCGAGG GCCAGATCCT GGAGCCTATA GGTACAGAAA 720 
GCAAGGTATC TGGAAAGAAC AAAG AATTCT CTGCAGATCA GATGTCAGAA AATACGGATC 780 
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AGAGTGATGC TGCAGAACTA AATCATAAGG AGGAACATAG CTTGCATGTT CAAGATCCAT S40 
CTTCTAGCAG TAAGAAGGAC TTGAAAAGCG CAGTTCTGAG TGAGAAGGCT GGCTTCAATT 900 
ATGAAAGCCC CAGTAAGGGA GGAAACTTTC CCTCCTTTCC GCATGATGAG GTGACAGACA 960 
GAAATATGTT GGCTTTCTCA TTTCCAGCTG CTGGGGGAGT CTGTG AGCCC TTG AAGTCTC 1 020 
5 CGCAAAG AGC AGAGGCAGAT G ACCCTCAAG ATATGGCCTG CACCCCCTCA GGGGACTC AC 1 080 
TGGAGACAAA GGAAGATCAG AAGATGTCAC CAAAGGCTAC AGAGGAAACA GGGCAAGCAC 11 40 
AGAGTGGTCA AGCCAATTGT CAAGGTTTGA GCCCAGTTTC AGTGGCCTCA AAAAACCCAC 1200 
AAGTGCCTTC AGATGGGGGT GTAAGACTGA ATAAATCCAA AACTGACTTA CTGGTGAATG 1260 
ACAACCCAGA CCCGGCACCT CTGTCTCCAG AGCTTCAGGA CTTTAAATGC AATATCTGTG 1320 

10 GATATGGTTA CTACGGCAAC GACCCCACAG ATCTGATTAA GCACTTCCGA AAGTATCACT 1380 
TAGGACTGCA TAACCGCACC AGGCAAGATG CTGAGCTGGA CAGCAAAATC TTGGCCCTTC 1440 
ATAACATGGT GCAGTTCAGC CATTCCAAAG ACTTCCAGAA GGTCAACCGT TCTGTGTTTT 1 500 
CTGGTGTGCT GCAGGACATC AATTCTTCAA GGCCTGTTTT ACTAAATGGG ACCTATGATG 1560 
TGCAGGTGAC TTCAGGTGGA ACATTCATTG GCATTGGACG GAAAACACCA GATTGCCAAG 1620 

1 5 GGAACACCAA GTATTTCCGC TGTAA ATTCT GCAATTTCAC TTATATGGGC A ACTCATCCA 1 680 
CCGAATTAGA ACAACATTTT CTTCAGACTC ACCCAAACAA AATAAAAGCT TCTCTCCCCT 1740 
CCTCTG AGGT TGCAAAACCT TCAG AGAAAA ACTCTAACAA GTCCATCCCT GC A CTTCA AT 1 800 
CCAGTGATTC TGGAGACTTG GGAAAATGGC AGGACAAGAT AACAGTCAAA GCAGGAGATG 1860 
ACACTCCTGT TGGGTACTCA GTGCCCATAA AGCCCCTCGA TTCCTCTAGA CAAAATGGTA 1920 

20 CAGAGGCCAC CAGTTACTAC TGGTGTAAAT TTTGTAGTTT CAGCTGTGAG TCATCTAGCT 1 980 
CACTTAAACT GCTAGAACAT TATGGCAAGC AGCACGGAGC AGTGCAGTCA GGCGGCCTTA 2040 
ATCCAGAGTT AAATGATAAG CTTTCCAGGG GCTCTGTCAT TAATCAGAAT GATCTAGCCA 2100 
AAAGTTCAGA AGGAGAGACA ATGACCAAGA CAGACAAGAG CTCGAGTGGG GCTAAAAAGA 2160 
AGGACTTCTC CAGCAAGGGA GCCGAGGATA ATATGGTAAC GAGCTATAAT TGTCAGTTCT 2220 

25 GTGACTTCCG ATATTCCAAA AGCCATGGCC CTGATGTAAT TGTAGTGGGG CCACTTCTCC 2280 
GTCATTATCA ACAGCTCCAT AACATTCACA AGTGTACCAT TAAACACTGT CCATTCTGTC 2340 
CCAGAGGACT TTGCAGCCCA GAAAAGCACC TTGGAGAAAT TACTTATCCG TTTGCTTGTA 2400 
GAAAAAGTAA TTGTTCCCAC TGTGCACTCT TGCTTCTGCA CTTGTCTCCT GGGGCGGCTG 2460 
GAAGCTCGCG AGTCAAACAT CAGTGCCATC AGTGTTCATT CACCACCCCT GACGTAGATG 2520 

30 TACTCCTCTT TCACTATGAA AGTGTGCATG AGTCCCAAGC ATCGGATGTC AAACAAGAAG 2580 
CAAATCACCT GCAAGGATCG GATGGGCAGC AGTCTGTCAA GGAAAGCAAA GAACACTCAT 2640 
GTACCAAATG TGATTTTATT ACCCAAGTGG AAGAAGAGAT TTCCCGACAC TACAGGAGAG 2700 
CACACAGCTG CTACAA ATGC CGTCAGTGCA GTTTTACAGC TGCCGATACT CAGTCACTAC 2760 
TGGAGCACTT CAACACTGTT CACTGCCAGG AACAGGACAT CACTACAGCC AACGGCGAAG 2820 

3 5 AGGACGGTCA TGCCATATCC ACCATCAAAG AGG AGCCCAA AATTGACTTC AGGGTCTAC A 2 880 
ATCTGCTAAC TCCAGACTCT AAAATGGGAG AGCCAGTTTC TGAGAGTGTG GTGAAGAGAG 2940 
AGAAGCTGGA AGAGAAGGAC GGGCTCAAAG AGAAAGTTTG GACCGAG AGT TCCAGTGATG 3000 
ACCTTCGCAA TGTGACTTGG AGAGGGGCAG ACATCCTGCG GGGGAGTCCG TCATACACCC 3060 
AAGCAAGCCT GGGGCTGCTG ACGCCTGTGT CTGGCACCCA AGAGCAGACA AAGACTCTAA 3 120 

40 GGGATAGTCC CAATGTGGAG GCCGCCCATC TGGCGCGACC TATTTATGGC TTG G CTGTG G 3180 
AAACCAAGGG ATTCCTGCAG GGGGCGCCAG CTGGCGGAGA GAAGTCTGGG GCCCTCCCCC 3240 
AGCAGTATCC TGCATCGGGA GA AAACAAGT CCAAGGATGA ATCCCAGTCC CTGTTACGGA 3300 
GGCGTAGAGG CTCCGGTGTT TTTTGTGCCA ATTGCCTGAC CACAAAGACC TCTCTCTGGC 3360 
GAAAGAATGC AAATGGCGGA TATGTATGCA ACGCGTGTGG CCTCTACCAG AAGCTTCACT 3420 

45 CGACTCCCAG GCCTTTAAAC ATCATTAAAC AAAACAACGG TGAGCAGATT ATTAGGAGGA 3480 
G AACAAG AAA GCGCCTTAAC CCAGAGGCAC TTCAGGCTGA GCAGCTCAAC AAACAGCAGA 3540 
GGGGCAGCAA TGAGGAGCAA GTCAATGGAA GCCCGTTAGA GAGGAGGTCA GAAGATCATC 3600 
TAACTGAAAG TCACCAGAGA GAAATTCCAC TCCCCAGCCT AAGTAAATAC G A AGCCC AGG 3660 
GTTCATTGAC TAAAAGCCAT TCTGCTCAGC AGCCAGTCCT GGTCAGCCAA ACTCTGGATA 3720 

50 TTCACAAAAG GATGCAACCT TTGCACATTC AGATAAAAAG TCCTCAGGAA AGTACTGGAG 3780 
ATCCAGGAAA TAGTTCATCC GTATCTGAAG GGAAAGGAAG TTCTGAGAGA GGCAGTCCTA 3840 
TAGAAAAGTA CATGAGACCT GCGAAACACC CAAATTATTC ACCACCAGGC AGCCCTATTG 3900 
AAAAGTACCA GTACCCACTT TTTGGACTTC CCTTTGTACA TA ATGACTTC CAGAGTGAAG 3960 
CTGATTGGCT GCGGTTCTGG AGTAAATATA AGCTCTCCGT TCCTGGGAAT CCGCACTACT 4020 

5 5 TGAGTCACGT GCCTGGCCTA CCAAATCCTT GCCAAAACTA TGTGCCTTAT CCCACCTTCA 4080 
ATCTGCCTCC TCATTTTTCA GCTGTTGGAT CAGACAATGA CATTCCTCTA GATTTGGCGA 4140 
TCAAGCATTC CAGACCTGGG CCAACTGCAA ACGGTGCCTC CAAGGAGAAA ACGAAGGCAC 4200 
CACCAAATGT AAAAAATGAA GGTCCCTTGA ATGTAGTAAA AACAGAGAAA GTTGATAGAA 4260 
GTACTCAAGA TGAACTTTCA ACAAAATGTG TGCACTGTGG CATTGTCTTT CTGGATGAAG 4320 

60 TGATGTATGC TTTGCATATG AGTTGCCATG GTGACAGTGG ACCTTTCCAG TGCAGCATAT 4380 
GCCAGCATCT TTGCACGGAC AAATATGACT TCACAACACA TATCCAGAGG GGCCTGCATA 4440 
GGAACAATGC ACAAGTGGAA AAAAATGGAA AACCTAAAGA GTAAAACCTT AGCACTTAGC 4500 
ACAATTAAAT AGAAATAGGT TTTCTTGATG GGAATTCAAT AGCTTGTAAT GTCTTATGAA 4560 
GACCTATTAA AAAAATACTT CATAGAGCCT GCCTTATCCA ACATGAAATT CCCTTCTTTT 4620 

65 GTTATTCTTT CTTTTGATGA GTAGGTTACC AAGATTAAAA AGTGAGATAA ATGGTCAATG 4680 
AGA AAGAATG GAAGATGGTA AAC AATCACT TTTTAAA ACC TGTTAAGTCA AAACCATCTT 4740 
GGCTAATATG I ACTGGGGAA ATAATCCATA AGAGATATCA CCAGACTAGA ATTAATATAT 480C 
TTATAAAGAA AGAGACCAAA ACTGTCTAGA ATTTGAAAGG GTTTACATAT TATTATACTA 4860 
AAGCAGTACT GGACTGGCCA TTG G ACC ATT TGTTCCAAAA COCATAAATT GTTGCCTAAA 4920 
70 TTTATAATGA TCATGAAACC CTAGGCAGAG GAGGAGAAAT TGAAGGTCCA GGGCAATGAA 4980 
AGAAAAATGG CGCCCTCTCA ATTTAGTCTT CTCTCATTGG CCATGTTTCA GATTTTGACC 5040 
TAGAAATGCG AG CTGTG GTT AGGCTTGGTT AGAGTGCAGC AAGCAACATG ACAGATGGTG 5100 
GCACGCTGTT TTTACCCAGC CCTGCCTGTA CATACACATG CACACCCTCT CTGATATTTT 5 1 60 
TGTCCTTTAG ATGTTCAAAT ACTCAGTAGT CCTTTTGTTT GCGGTTTAGA TTCATTTTGT 5220 
75 CCACACATGT ACCCATTTTA AAAAACAATG TCCTCGATGC TTCTGTAGTG ATTTCATTTT 5280 
AGCCAGGTAT TTCTTTCTTG TGTGTG ATGA ACCAGTATGG ATTTGCTTTT CTAAGCCTCC 5340 
TGTTGGTTAC TAATCTCACT TGGCACATTA TAACTAAAGG AATCCCCTCA ATTCAA AAGC 5400 
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ATAGATGGAT ACAAATGTCA GACCGTGGGT TTAATTTGTT TAGAACACAT GGCATTTCTT 5460 
CACAAGGTAA CCTGCTGTAT TTATTTATTT TCTTTTGGTT AAATATAATT TCCAAACTTT 5520 
GTGGTCAGGC AGCGTCTAAG GTTACGTTAC CACAGACTG A CAGTTGGTAT ATGTACCAGC 5580 
CAATCCCTTC ATTAAATGTA TACAGATTTA GTTAAGTAGC ATTAAATAGG ATTCTTAGAA 5640 
5 GTATGTCCTC ATAGAACTTT TAATACTTAA GGCTTTGTAA AAACTATCCA TGAAGGGAAA 5700 
GCTCCTCAGC ATAACTGCTC AGGGAAATAG GGCTAAATAA CTGAACATTA AATAATTGGT 5760 
TAAAGGTGCT GTTAGTCGAG CCTCAATGCT TGCTACAAGG ATGTATGTAC AAGGACTGAC 5820 
TTTAATAATT TGCATTATAT TGTCCCAACC AGTAGTTTAT l'I 1 TIGCCAC GGAGATGTAG 5880 
AAGATATTAC AAGCTACTGG ATGCACTGTC AGATTAACTT ATTTCATTAA AGAAGTTGGG 5940 

1 0 AGAACAAATA GGAAAAAAAA AACTTATTTT TCTAGTAAAT ATTAATGTAT TACATTTCAA 6000 
ATAATGGTGC CTGACATATT GAATAATTAT TTTCTACAGT GTACGTATGC AACAAAGATA 6060 
TTCCATCATG CATTAGAGTC AGTTCTGGCT CTGCCTAGCT GTTTACATTT GCAAATGTAG 6120 
CAA ACAAGGT AATG AAGCA A CTATTTCTAT TGCAGTAG AT ATCCTTTTGT GTGTGTGTGT 61 80 
GTGCATTAAA GTTGTAAACG GTAACATGAA ACAAATGAAA GTTCTTGCTA TAATGGTATG 6240 

1 5 GAAAACAAGA AGGAAATGAA AATATTTTTA TGCCTACTTA GGAAAAAAAG GGTAGCACTT 6300 
ATTCATTCCA AGTACTTTTT TTTTTTTAAT TTTTAAGCTC TTAACTCACA TTGTTATGCT 6360 
TAAGATGATA AACATATATC CTCTTTTTAT TGCTTTGTCT ATGTTTCATA TGAAACATTT 6420 
CAGAAATTAT TTTGATAAGT GTTGCTGGAA TCTGCAACGC TGATTTTTTT TTGCATTCTG 6480 
TAGTCGCATT TGCACTCCAT TTTTACATTA ATTCGCAGTT GCTTTGTATC ATTGTTTTGT 6540 

20 TTGGGTTTTG TTTCTTTTTC ACAGTGCCGG GTCTTCGTTT CTTAAAGTTG GATGGCAGGT 6600 
AGAGTTCAAC CAGTTCGTGA CTGTTGTAGC GAATGAAGTT AAAAAAATGT CTTTCTGATG 6660 
TTGTGTTGTC ATTTTCATTT TTGCATTTTT TTGTTTGCAT ATTAAA A AAA GAGAAAAGAG 6720 
AAAGCAAGAG ACAGAAATCA GGACTAAGTC CTCTGCTTCA GTTTCATTGT TAACGGGCCT 6780 
TATTCTGATC TCACCTGTCG CGTAGCTCTA ATATTCACAT AAACTGAAAT AAAGAAGTGG 6840 

25 AATGAGGAGC TTTGACATTC AAATTATGTG ATGTAATTTA TCTTCCTTAG GAATTTTGAT 6900 

GGATGCATCT CAAAATGTAT AGCCAGACTT GAGAGGTG AC AATTAAAGAT CTAAAAAAGA 6960 
GAGGAGATTC CCCCAAACAA CAATATTTAA TTTTCTTAGT AAAAAGAATA ACAGAATGCA 7020 
TCGTGGCAAT CCTTAAGCAA CATTATCTAT GTGGACTGCT TAAATCAGCA AAACACCAGA 7080 
AGTTTGGTTA ACTTGGGCAA TATGACAAGT ATTACTTTTT GGGCAAAACT ACTCATTAAG 7140 

3 0 CAATTTCTCT AGTGTGTCGG ACACAAATAG GTTCTTTATT TTTGGCATGT ATGCCTTTTT 7200 
ATTTTCATTC AATTTTTTTT TTTTCTCAGA CAG ACATAGT AGTATCAACT AGCATTGGAA 7260 
AATACATATC ACTATTCTTG GAATATTTAT GGTCAGTCTA CTTTTTAGTA AAATATTTTT 7320 
GGATAGCGTT GACACGATAG ATCTTATTCC ATACTTCTTT ATTATTGATA ATTTTATTTT 7380 
CATTTTTTGC TTTCATTATT ATACATATTT TGGTGG AG AA GAGGTTGGGC TTTTTTGAAA 7440 

3 5 GAGACAAAAA TTTATTATAA CACTAAACAC TCCTTTTTTG ACATATTAAA GCCTTTATTC 7500 
CATCTCTCAA GATATATTAT AAAATTTATT TTTTTAATTT AAGATTTCTG AATTATTTTA 7560 
TCTTAAATTG TGATTTTAAA CGAGCTATTA TGGTACGGA A CTTTTTTTAA TGAGG AATTT 7620 
CATGATGATT TAGGAATTTT CTCTCTTGGA AAAGGCTTCC CCTGTGATGA AAATGATGTG 7680 
CCAGCTAAAA TTGTGTGCCA TTTAAAAACT GAAAATATTT TAAAATTATT TGTCTATATT 7740 

40 CTAAATTGAG CTTTGGATCA AACTTTAGGC CAGGACCAGC TCATGCGTTC TCATTCTTCC 7800 
TTTTCTCACT CTTTCTCTCA TCACTCACCT CTGTATTCAT TCTGTTGTTT GGGATAGAAA 7860 
AATCATAAAG AGCCAACCCA TCTCAGAACG TTGTGGATTG AGAGAGACAC TACATGACTC 7920 
CAAGTATATG AGAAAAGGAC AGAGCTCTAA TTGATAACTC TGTAGTTCAA AAGGAAAAGA 7980 
GTATGCCCAA TTCTCTCTAC ATGACATATT GAGATTTTTT TTAATCAACT TTTAAGATAG 8040 

45 TGATGTTCTG TTCTAAACTG TTCTGTTTTA GTGAAGGTAG ATTTTT ATAA AACAAGCATG 8100 
GGGATTCTTT TCTAAGGTAA TATTAATGAG AAGGGAAAAA AGTATCTTTA ACAGCTCTTT 8160 
GTTGAAGCCT GTGGTAGCAC ATTATGTTTA TAATTGCACA TGTGCACATA ATCTATTATG 8220 
ATCCAATGCA AATACAGCTC CAAAAATATT AAATGTATAT ATATTTTAAA ATGCCTGAGG 8280 
AAATACATTT TTCTTAATAA ACTGAAGAGT CTCAGTATGG CTATTAAAAT AATTATTAGC 8340 

50 CTCCTGTTGT GTGGCTGCAA AACATCACAA AGTGACCGGT CTTGAGACCT GTGAACTGCT 8400 
GCCCTGTTTA GTAAATAAAA TTAATGCATT TCTAGAGGGG GAATATCTGC CA TCCAG TGG 8460 
TGGAAATGTG GAGTAAAGAA GCTGGTGGTC TGCTTCTGTG CTGTATGCCA GCCTTTTGCC 8520 
TTAAGTTGAG AGGAGGTCAA CTTTAGCTAC TGTCTTTGGT TTGAGAGCCA TGGCAAAAAA 8580 
AAAAAAAGAA AAAAAGATCA AGTCGTCTTT GGTGAGCCAG TAAGGTGAAA GCTTGCTGAC 8640 

55 TGTCCAAGGC ACAAGAGAAA ATTGAGGAAT TGAAATGCAA CCTGAGTATC AAACTAAATA 8700 
TTCTAATCAA AGGTAGGTAC TGTTAGGTGG AATTCTATCA GCAGGCAACT GCAAATGAGA 8760 
AG AAGATAGA AGGACGCCCG TCGGGACTTT GGAGGGCATT GTTATTTTCC CAAAGAAAGA 8820 
CGGCCAAGGG CAGAGGCATG GATTCTTTGC AGAGCACTTC CTTTTGGTTT TTCAGTACTG 8880 
TTTCATAGAC AGTGGGCTCA CATGTTCCTG ATAGTGCTGC AGTTGCTTAG AAAGCATCCC 8940 

60 AGTTAATTGC AGTAATTAGA ACTTCTGGAA TATGCTAGGG CAGAAGTATG TCAAGTATGT 9000 
CACATGAAGA AAATGTGAAA TTCAAGAGTA ATCCACACGT GAGAAACTAG ACAATGTACA 9060 
TTCATGTGTT CTCTTGAAAG GAAAGGGAGA GCTGTAAGCT TCACTCTGTC CTACACCGGA 9120 
GAAAAGCAGG AATAACTTTA CCGTGGAAAT AATGTTTAGC TTTTATCAGA GAAAATTGTC 9180 
CTTCTAGAGC ATAGAGTCCC AAA ACTCAAT TCTGGTTTTC CCCTGTTTTT TTTTTTTTTT 9240 

65 TTTTTCCCAA CATATGAACT GCAGCATATC ACTTTTTCTT TTTGTGCCTC AGGTTCCTCA 9300 

CCTGTAAAAT TGAAAAATAT ATGTATTAAT A ATATTATI A ATAATAATAA TGGTAATGTA 9360 
GTACTTGTTT GTAAAGCACT TTGAGATCCT TGGTTGAAAG GCACCATAGG AGTGCCAAGT 9420 
ATTATTATGT GGCCAAGGGG GTTATTTAAA CTGTCAGTTC CCAAAGGCCA GGAAAGGTTG 9480 
GGGTCATTTT TCTTAAAGAC GAGCTGTAAA TATCAACTAG GCAGCCAATA GTGTTGACTA 9540 

70 TGAAGATGCA AAACTATTAC TAGGCTGATA AAATCATAGT TTCTTAATGG CTACCAATAA 9600 
GGCAAATATC ACAATAATAA ACGCCAAATT CCTTAGGGCG GACTATTTGA CAACCACATG 9660 
GAAAACTTTG GGGGAGGCAT GAGGGGGGAA CATCTCAAAA TGCCAATGTA AAATTTAACT 9720 
TACAGCAATA TTCACCAGCA GAAAATGTCT TTCATATGG A ATGATTTCAT GTTGCTAAGA 9780 
AAAAGAATTC AATTTGTAGT CCTGATTTGA ATACTAGAAT GTTGGCTATA ATAGTTCTGT 9840 

7 5 TCTTACAAC A CATGAAATTT TTTCGTTTTA TTTT ATTTTG TTTTCATAGT GCATGTTCAT 9900 

TTCTACTCAC AAACATGTTC TTGGTGTATT TCTTATGCAA ACAATCTTCA GGCAGCAAAG 9960 
ATGTCTGTTA CATCTAAACT TGAATAATAA AGTTTTACCA CCAGTTACAC A 
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Seq ID NO: 68 Protein sequence; 
Protein Accession #: NPJ)5483I 

5 I 11 21 31 41 51 

I i I I I 

MPYEVNAGYD FTNMVRKKNP PLRNVASEGE GQILEPlGTE SKVSGKNKEF SADQMSENTD 60 
QSDAAELNHK EEHSLHVQDP SSSSKKDLKS AVLSEKAGFN YESPSKGGNF PSFPltDEVTD 120 
RNMLAFSFPA AGGVCEPLKS PQRAEADDPQ DMACTPSGDS LETKEDQKMS PKATEETGQA 1 80 

1 0 QSGQANCQGL SPVSV ASKNP QVPSDGG VRL NKSKTDLLVN DNPDPAPLSP ELQDFK.CNIC 240 
GYG YYGNDPT DLIKHFRKYH LGLHNRTRQD AELDSK1LAL HNM VQFSHSK DFQKVNRSVF 300 
SGVLQDINSS RPVLLNGTYD VQVTSGGTF] GIGRKTPDCQ GNTKYFRCKJ CNFTYMGNSS 360 
TELEQHFLQT HPNK1KASLP SSEVAKPSEK NSNKS1PALQ SSDSGDLGKW QDKITVKAGD 420 
DTPVGYSVPt KPLDSSRQNG TEATSYYWCK FCSFSCESSS SLK.LLEHYGK QHGAVQSGGL 480 

1 5 NPELNDKLSR GSVINQNDLA K.SSEGETMTK TDKSSSG AKK KDFSSKGAED NM VTSYNCQF 540 
CDFRYSKSHG PDVIVVGPLL RHYQQLHN1H KCT1KHCPFC PRGLCSPEKH LGEITYPFAC 600 
RKSNCSHCAL LLLHLSPGAA GSSRVKJ-IQCH QCSFTTPDVD VLLFHYESVH ESQASDVKQE 660 
ANHLQGSDGQQSVKESKEHS CTKCDFITQV EEE1SRHYRR AHSCYKCRQC SFTAADTQSL 720 
LEHFNTVHCQ EQDITTAMGE EDGHAISTIK EEPKIDFRVY NLLTPDSKMG EPVSESVVKR 780 

20 EKLEEKDGLK EKVWTESSSD DLRN VTWRGA DILRGSPSYT QASLGLLTPV SGTQEQTKTL 840 
RDSPNVEAAH LARPrYGLAV ETKGFLQGAP AGGEKSGALP QQYPASGENK SKDESQSLLR 900 
RRRGSGVFCA NCLTTKTSLW RKNANGGYVC NACGLYQKLH STPRPLNIIK QNNGEQIIRR 960 
RTRKRLNPEA LQAEQLNKQQ RGSNEEQVNG SPLERRSEDH LTESHQREIP LPSLSKYEAQ 1020 
GSLTtCSHSAQ QPVLVSQTLD IHKRMQPLHI QIKSPQESTG DPGNSSSVSE GKGSSERGSP 1080 

25 IEKYMRPAKHPNYSPPGSPIEKYQYPLFGLPFVHNDFQSEADWLRFWSKYKLSVPGNPHY 1140 
LSHVPGLPNPCQNYVPYPTF N1PPHFSAVG SDNDIPLDLA IKHSRPGPTA NGASKEKTKA 1200 
PPNVKNEGPL NVVKTEKVDR STQDELSTKC VHCGIVFLDE VMYALHMSCH GDSGPFQCSI 1260 
CQHLCTDKYD FTTH1QRGLH RNNAQVEKNG KPKE 

30 Seq ID NO: 69 DNA sequence 

Nucleic Acid Accession #: XM_073S79 

Coding sequence: 1-3 87( underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

35 i i i | t l 

ATGGGGTTTG GAGACCAGGG AACGGTGGAA GGGAGCCTAG GAACGTCGAA AAAACCACCT 60 
GAAGTGAAAA TGTTTGG AGC CAGTCAAGGT TTGCTGACAA TGGAAACAAA CCAGTCCCTG 1 20 
GCACAAGGCA CAGGCTGCTC AGTAGTAAAG GTAGACACTG TTCTCTTCGA GAGTTTATAC 1 80 
CACTGCGGCT TTGAACACGG GAGCGTG ATG CACTGCCTTG GGGATGATCA CCCCCAGGAA 240 
40 GACAGGAAGG CTCACTTCTC TGCCCCAGTT GCAGCCATCG CCTCTCCAGC ACCGACTCCT 300 
GTCTGTCCTG CACACCACTC AACACAGAGC ATCTGCCAGT TTCTACAGCA CTGCAGGCAG 360 
AACACTCACT TGCAGGCTGC TAACTAA 

Seq ID NO: 70 Protein sequence: 
45 Protein Accession H: XP^_073379 

1 11 21 31 41 51 
1 I I I I I 

MGFGDQGTVE GSLGTSKKPP EVKMFG ASQG LLTM ETNQSL AQGTGCSVVK VDTVLFESLY 60 
50 HCGFEHGSVM HCLGDDHPQE DRKAHFSAPV AAIASPAPTP VCPAHHSTQS ICQFLQHCRQ 120 
NTHLQAAN 

Seq ID NO: 71 DNA sequence 
Nucleic Acid Accession #: ,\B033064 
5 5 Coding sequence: 826- 1 98 6 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

11:111 

GGGGAACAGC AAATTCAGTC ACAGACAATC CTCCACTCGG TCAAGAGCCA CTTTTCTCTT 60 

60 CCTGCCTTGC CCCCCCGCAG GGGGTAAGGA ACTG AG CGTT TAATCTTTAG CCGGTTGGCT 120 
ACCAGCTAAA ATTCTACTTA TCTTAGTTTC TAGTGGATAG CTTTCTTATT TTGCCCATGT 180 
TTTCTTAGAA TCCCTGTTTA ATATACTTTT GTCAGTAGTA GTATCTAGGA GTAGCAGGGA 240 
G AGTGACAAT AAATTAGCCC CTTCTTTTTT CCCTTGTCAT TCAGGCCCCT TTTCCTCTCC 300 
AGAGGGAAAT TACCAGTAAA CTCTTCTAAA TCTTCCACCC CTTCTCAGTC ATACTGTGAA 360 

65 GAAACACACT AAAGTGGACA TTATTTGACC AGTGAACACG AACCCAGCTT CAGGCATTGG 420 
TTTGTTGTGG CACATGGAGA AACATCTCTT TTAAAATATC TCCCAATTAC CCTTTTCACA 480 
ATTTGTATCC ACCTAGGATT TGCTGCTGGG GTAAGTCACT AGATTTATTT CTCAAAGCTC 540 
CCCTCTCTAT GAGCTG A A AG ACTG AC C A AC CATG A AC ACT AGTAGGGGAT GGGG AAAGGG 600 
GACAGAGCAG AGCCAGTTGT TCCACACTTT GGGAAGCAGG AGTAGCTTTT ATCATCTTCC 660 

70 TCTGGGGAGC AGGCATAGAG ACATAAACTG AGTG AAAATG GGTGGAGGAA GAACTTCTAT 720 
ACCCACGAAC AACATGTGAA GAGAGAG AAC CAAACATAAA GTAAGGAGGG TAG A CGTT AC 780 
ATCCAAGAGG AAATAATCCA GGCAAGGAAG CACAAGCTGA TCAAGATQTG TAGTTCTGTG 840 
GCTGCCAAGT TGTGGTTTTT G ACAG ATCGT CGCATCAGGG AAGACTATCC TCAAAAAG AG 900 
ATTTTACGAG CATTG AAGGC CAAATGTTGT GAGGAGGAAC TGGACTTTAG GGCTGTGGTG 960 

75 ATGGATGAGG TGGTGCTGAC AATCGAGCAA GGAAACCTGG GTCTGCGGAT CAATGGAGAG 1020 
CTAATCACTG CCTACCCACA AGTGGTGGTA GTCAGAGTAC CAACCCCTTG GGTGCAAAGT 1030 
GATAGTGACA TCACTGTTTT GCGCCATCTA GAGAAGATGG GATGTCGGTT AATGAACCGA 1 140 
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CCTCAAGCCA TCCTGAACTG CGTTAATAAG TTCTGGACAT TTCAAGAGTr GGCTGGCCAT 1200 
GGTGTTCCTC TGCCGGATAC TTTCTCTTAT GGTGGCCACG AAAATTTTGC TAAAATGATT 1260 
GATGAGGCTG AAGTTCTGGA GTTCCCAATG GTAGTAAAG A ATACGCGGGG TCACAG AGGT 1320 
A AAGCTGTTT TCTTGGCTCG AGATAAGCAC CATTTGGCTG ATCTAAGCCA TCTTATTCGC 1 330 
5 C ATGAAGCGC CATACCTGTT CCAGAAGTAT GTTAAAGAGT CTCATGG ACG GGATGTACGT 1440 
GTCATTGTCG TGGG AGGCCG TGTGGTTGGC ACCATGTTAC GTTGTTCAAC AGATGGGAGA 1 500 
ATGCAAAGCA ACTGCTCATT AGGTGGTGTG GGGATGATGT GCTCATTGAG TGAACAAGGG 1560 
A AGCAGCTAG CI ATCCAGGT GTCTAATATC CTGGGGATGG ATGTGTGTGG CATTGACCTG 1620 
CTGATGAAAG A'IGACGGCTC CTTCTGCGTC TGTGAGGCCA ATGCAAATGT AGGTTTCATC 1680 

1 0 GCCTTTGATA AGGCTTGTAA TCTAGATGTA GCTGGTATCA TAGCAGACTA TGCCGCCTCC 1740 
CTrCTACCCT CTGGCCGGCT CACCCGGCGT ATGTCCCTGC TCTCCGTGGT GTCCACTGCC 1 800 
AGTGAGACTA GTGAGCCGGA GCTGGGTCCC CCAGCCAGCA CTGCTGTTGA CAACATGAGT 1860 
GCAAGTTCCA GCTCTGTTGA CAGCGACCCT GAAAGCACGG AGCGAGAGCT GCTCACCAAG 1920 
CTCCCAGGGG GCCTGTTCAA CATGAACCAG CTGCTAGCCA ATGAAATCAA ACTACTGGTG 1980 

1 5 GACTGACTCC ACTGGTAATT AACCAACAAA ACCCTTGTAA AACTTTCTTT CTTCTTTTCT 2040 

ATTTTTAAAA CCAACTTGCA ATGCTGTTCA TGGAGGATGC TCAGGAAGAT GAGAGAAAAT 2100 
TAGTAGGATT AGTTGGAGAG AGTGGGAGAT AGATG AGACC TCTGCTAGTA AGATGTTACT 2160 
TTCATTTACA AATCCTACAA ATAGAGAGGC AGAATAGGTG GGGTATAGAA AAATGTCAGG 2220 
CTCTCATAGT TACCCTTTTA AATTGCTAAA AAATGTGTAT GCTCATAGGC CATGAGGAAC 2280 

20 AAATACTTTT TT1TTTTCAT GGTCCCTTGC TTTTGTTTTT GTACAAAAAA AAATGGTTTT 2340 

GCTACAAATA TCCAAGTAGC ATAACTTCAC ATTGTGTTGG AAGATTTGTC ATCAGTGAGG 2400 
AAAACATCTG CTTAAATTAC AGGAATTTTT GTATTATACA GCTCTGAAAA TTCTGCCATT 2460 
TCCTTATTAA CTAGCAGCTT TAGTTTGTAG TTTATGAAAT CTTGAGGGGC TCTTTTACTG 2520 
GGATTTCTTA TTnTTTGTT TTTTCCCGCT TAATTTGGTG GGAGGTCAAA TTGAATATAA 2580 

25 CC CA ATA A AG GC TTCTTAAT GACAAAATTG GCATGTTTGC ATGATGAAAT GGAAATGAAC 2640 
AG TATTGCAA TGTCCGGTAT ACAAAATAAC ATTAATTCAA TG TAG AT AAA ATTACACTAG 2700 
TTTAAAATAT G TGCATTCAC TTGTATTTGT TAGTGTTTTA GTCTTTTTTG AAAGATGTGC 2760 
TCTGTTAATG TrGCTTTTTT rriTTTTTrT TAATACATGC TAGTCTAACA TTTCCTGCTC 2820 
TATGCCTGCA TCTTTAACAA TGGCCAAAGT GAAGAAAATG CTACCTTTTT TGTTAACAAG 2880 

30 ACACTGACTT GAAACATGTA CATTTAAAGC CTTTTATTTT TTCCCTTTTT GTTTTGGTAG 2940 

TTGGGCATTT AAATAAGGAC AAGGAAAAAT ATTTTTGGGG GCAAATCAAG AGCCTATGAG 3000 
TTCTAAGTAT AAAGCTGAAG TGATTTCGAA TGCCAGCGTT ATATATTTG C ATTTTTCACA 3060 
TTTTACGAGG G AGTATATGT GTATGTGTGT GCACGCATGC ATGTGTATGT GTTTTGCTTT 3120 
TTGTTTCCAT C A ACT A AT C A AAAAGGATAA TTTAGAAAAT GGAGCATGAT GGGAAACAGA 3180 

35 GTTTTTGACT TTAAAAAACA GATGAGTTGT TTTCATAAGT AGACTCCACT GGGGTAGAGG 3240 
TATTCACCTT AAAACATAGG GTGAGTAGAT GC'l 1 1 1 1 1 AG GCCTTTTTGT GTATATGTAC 3300 
GTTGTTTGTT TTTTTCCTTT TGTTTCTAGC CTGTTCAGTG TACAGTTTAT TCAAGGCTAC 3360 
ATGCTTTTCT TTAATGCTTC TGGCTATGCA TTTTCTCTTT TTACATATAG GATTTGGGAT 3420 
TGGGGGTGGG TTGGATGTTT TTGTTTGGGG ACTTATTTAG TAGTATTGAG TCTCTTATAG 3480 

40 CC CTACTCTT AAGCCTTCAA TACTGTCCAC TCTTTATATT CCTTTACTTG CAGAATTTAT 3540 

AA AAGCCCCC AAACTGCATA TAATATGAGC CTTTAAAACA TGGGTAAAAC TAATCCCATT 3600 
GATGGGTTTG GATGGTATGT TAAGAAATGG AGATGCTGCA GAGCCCAACG TAATTTTTTA 3660 
AACAGCAAGT TTTCCATCTC CCTACGAATC CTCTGAAGCT TTTACCCAAG CCCTTTCTTG 3720 
CCTCTCCAGT GCTATTTTCC TTCAGATGGA CCTTAAACAT AATTTCTTGG ACACTACTAG 3780 

45 AGAGACTTCG AGGCAATAAT AAAAGATCAG TATTAACCAG CTATAACAGA GGTTTG ATCA 3840 
TGCTTACTTG TACAGTTTTI CCCCCGTTTT AAAAAGGAAT GTAATAAAAT TTGTTTTTTC 3900 
CATAGAATTA AATAATATTA AAATTGAGTG AAAGGTTGAT TGTTGATGAA TAGAATAGTA 3960 
CCTCTCATCT GTGCAGTGTC TCATTTCACC TCAGAGAAAA GGATACATAA GAGGAGTTTG 4020 
TA ATTTATCT TAGGATATTC TAATTGCATT TAAAAGAACT TATCTTGCGC AGGGTAAATG 4080 

50 GGGGACTCAC AT ACATATAT TAATACCTCT GACTCATTAA CAGAAAGAAA TACTTGGTAC 4140 
TTCTTTCGCT GAATGACCAT ACTGTGGAGG ATGCATACTA TTTGGTATAG AGAAATAAAT 4200 
GAGGAAGAAA GAACTGCTTA ATTAAATTAT CATTCATATG TTCATATAGA GACCATCTGG 4260 
TTGCCATGTG TATTATGACA CATACACTrT G AATAGTTAC ATATCACAAG TATGTAGTTC 4320 
A ] GTTTGTGT TGGTGGGGTA AGGCATCAGG AAAAATGTAG TTAGTCTTTT CTTAACTTAT 4380 

55 AC CAAATTAA CCA ACT AT AT TATAGGAAAT ATGTGAAATT AGTTCATTAG CTTTATTCAC 4440 
TATTATGCAT TCACATGATA TTAAAACGTA CACTCACATG TTAGAATGAA AAGAGCAGTA 4500 
GTTATCTTAG ATTTTAAAAA CATGGATATC TTCTTGAATT CCTTCAAGAT TGAGGTAGAG 4560 
AATAAGAGCA AATCATTCTG GAAGTACCTT AAGGAAACAA ACAGCAGCAG ATATTTAGGT 4620 
TAAACTTATT TTCATAATTG TTTAATAACT TTTGTATAAT CTTCATTGCT ATTATGAGAG 4680 

60 AGAATGTATA TATCAAATAT GTGTAATGAT AAAATCTGAA TTGTAAAATT TOG TAT ATT 4740 
GTTAAAATTG TAATTCTAAA TTGTATTTCA AAAATGATTA TTTCrGATAT TGTITITATG 4800 
TCACCCATGA T GAAAACTGG ACTTTATATA TCTAAACATA CAAGTATGAA CrATrCTATr 4860 
TAAAATTITI' AA TAGTTTTT TTCTTTTTTG GTGCCTATAA TTGATTGGTC ATTTCTGCTG 4920 
GCTTTTCTCC AATGAACATT GAAATCTTCC TGTATATGTT ACCAATAAGA AAACTACCCT 4980 

65 GGAACAGTAG AAAAACCCAA CAAGAGACTT GGCATTCATC AAGCACATTA TCAGACTTTG 5040 
AGAACATATT GAAGGCATIG ACTTTGAAAA TCATCTCTTT TTCTCAAGAA GAAAG CAATG 5 100 
GAtiAAGCAAA TTTGTTTCAT TCAGTGAATC CCCAGTTTGG GGCTTGTGGG GCTTAGAGAC 5160 
A'l 1 GTGAAAT CAA ATCPI GT G iH ATACTTT TCTCCTGGCT CACTTTTTTT GAGAAGGTTT 5220 
ATGGGCTATT TGGCTGGTGA GACACGATCC CCTCCTAAGA AAATGTAGGT GCTCAGACAG 5280 

70 GTAACCACTG CTGCTACTGT TTTTATTTGT TTGTTTGTrC AATTTTATTT A AG A PrrGTT 5340 
rrrGTTGTAC TAGGATTTTA AAAAATGTAA TATATl'GCAG GATTTATAAC CAG 

Seq 1 D NO: 72 Protein sequence: 
Protein Accession H: BAA86552 
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MCSSVAAKLW FLTDRRIRED YPQKEILRAL KAKCCEEELD FRAVVMDEVV LTIEQGNLGL 60 
RINGELITAY PQVVVVRVPT PWVQSDSD1T VLRHLEKMGC RLMNRPQAIL NCVNKFWTFQ 120 
ELAGHG VPLP DTFSYGGHEN FAKMIDEAEV LEFPMVVK.NT RGHRGKAVFL ARDKHHLADL 180 
SHLIRHEAPY LFQKYVKESH GRDVRVIWG GRVVGTMLRC STDGRMQSNC SLGGVGMMCS 240 
5 LSEQGKQLA1 QVSN1LGMDV CG1DLLMKDD GSFCVCEANA NVGFIAFDKA CNLDVAGIIA 300 
DYAASLLPSG RLTRRMSLLS VVSTASETSE PELGPPASTA VDNMSASSSS VDSDPESTER 360 
ELLTK1PGGL FNMNQLLANE IKLLVD 

Seq ID NO: 73 DNA sequence 
1 0 Nucleic Acid Accession ft: XM_040080.2 

Coding sequence: 159-1 104 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

1 5 CTGAGTGGGG GCGGGGACTG CTGGAGTTGC GGGGCCTGCC TGGGGTAGGG CGGGGCAGGA 60 

CAGCTTGGAG ATAGGGCCCG GAATTGCGGG CGTCACTCTG CTCCTGCGAC CTAGCCAGG C 120 
GTGAGGGAGT GACAGCAGCG CATTCGCGGG ACGAGAGCG A TG AGTGAGAA CGCCGCACCA 1 80 
GGTCTGATCT CAGAGCTGAA GCTGGCTGTG CCCTGGGGCC ACATCGCAGC CAAAGCCTGG 240 
GGCTCCCTGC AGGGCCCTCC AGTTCTCTGC CTGCACGGCT GGCTGGACAA TGCCAGCTCC 300 
20 TTCGACAGAC TCATCCCTCT TCTCCCGCAA G ACTTTTATT ACGTTGCCAT GG ATTTCG GA 360 
GGTCATGGGC TCTCGTCCCA TTACAGCCCA GGTGTCCCAT ATTACCTCCA GACTTTTGTG 420 
AGTGAGATCC GAAGAGTTGT GGCAGCCTTG AAATGG AATC GATTCTCCAT TCTGGGCCAC 480 
AGCTTCGGTG GCGTCGTGGG CGGAATGTTT TTCTGTACCT TCCCCGAGAT GGTGGATAAA 540 
CTTATCTTGC TGGACACGCC GCTCTTTCTC CTGGAATCAG ATGAAATGG A GAACTrGCTG 600 

2 5 ACCTACA AGC GGAG AGCCAT AGAGCACGTG CTGCAGGTAG AGGCCTCCCA GGAGCCCTCG 660 

CACGTGTTCA GCCTGAAGCA GCTGCTGCAG AGGTTACTGA AGAGCAATAG CCACTTGAGT 720 
GAGGAGTGCG GGGAGCTTCT CCTGCAAAGA GGAACCACGA AGGTGGCCAC AGGTCTGGTT 780 
CTGAACAGAG ACCAGAGGCT CGCCTGGGCA GAGAACAGCA TTGACTTCAT CAGCAGGGAG 840 
CTGTGTGCGC ATTCCATCAG GAAGCTGCAG GCCCATGTCC TGTTGATCAA AGCAGTCCAC 900 

3 0 GGATATTTTG ATTCAAGACA GAATTACTCT GAG AAGGAGT CCCTGTCGTT CATGATAGAC 960 

ACGATGAAAT CCACCCTCAA AGAGCAGTTC CAGTTTGTGG AAGTCCCAGG CAATCACTGT 1020 
GTCCACATGA GCG AACCCC A G CACGTGGCC AGTATCATCA GCTCCTTCTT ACAGTGCACA 1080 
CACATGCTCC CAGCCCAGCT GTAGCTCTGG GCCTGGAACT ATGAAGACCT AGTGCTCCCA 1 140 
GACTCAACAC TGGG ACTCTG AGTTCCTGAG CCCCACAACA AGGCCAGGGA TGGTGGGGAC 1200 
35 AGGCCTCACT AGTCTTGAGG CCCAGCCTAG GATGGTAGTC AGGGGAAGGA GCGAGATTCC 1260 
AACTTCAACA TCTGTGACCT CAAGGGGGAG ACAGAGTCTG GGTTCCAGGG CTGCTTTCTC 1320 
CTGGCTAATA ATAAATATCC AGCCAGCTGG AGGAAGGAAG GGCAGGCTGG GCCCACCTAG 1380 
CCTTTCCCTG CTGCCCAACT GGATGGAAAA TAAAAGGTTC TTGTATTCTC A 

40 Scq ID NO: 74 Protein sequence: 

Protein Accession #: XP_040080.1 

1 11 21 31 41 51 

I I I I I I 

45 MSENAAPGLI SELKLAVPWG H1AAKAWGSLQGPPVLCLHG WLDNASSFDR LIPLLPQDFY 60 
YVAMDFGGHG LSSHYSPGVp YYLQTFVSEI RRVVAALKWN RFSILGHSFG GVVGGMFFCT 120 
FPEMVDKML LDTPLFLLES DEMENLLTYK RRAIEHVLQV EASQEPSHVF SUCQLLQRLL 180 
KSNSHLSEEC GELLLQRGTT KVATGLVLNR DQRLAWAENS IDF1SRELCA HSIRKLQAHV 240 
LLIKAVHGYF DSRQNYSEKE SLSFMIDTMK STLKEQFQFV EVPGNHCVHM SEPQHVASII 300 

50 SSFLQCTHML PAQL 

Seq ID NO: 75 DNA sequence 
Nucleic Acid Accession #: NM_005794 
^ ^ Coding sequence: 434-1276 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I 1 I I I I 

GGTTCCCTTC CACGCTGTGA AGCTTTGTTC TTTTGGTCTT CATGATAAAT CTTGCTGCTG 60 
CTCACTCGTT GGGTCCGTGC CACCTTTAAG AGCTGTAACA CTCACCGCGA AGGTCTGCAA 1 20 

60 CTTCACTCCT GGGGCCAGCA AG ACCACG AA TGCACCGAGA GGA ATGAACA ACTCTGG ACA 1 80 
CACCATCTTT AAGAACCGTA ATACTCACCG CAAGGGTCTG CAACTTCATT CTTG AAGTCA 240 
GTG AGGCCAA G AACCCATCA ATTCCGTACA CATTTTGGTG ACTTTG AAGA GACTGTCACC 300 
TATCACCAAG TGGTGAGACT ATTGCCAAGC AGTGAGACTA TTGCCAAGTG GTGAGACCAT 360 
CACCAAGCGG TGAGACTATC ACCTATCGCC AAGTGGCCTG ATTCAGCAGG AAGCATCTCA 420 

65 GACACCAACC ACTATGCTGT CAGCAGTTGC CCGGGGCTAC CAGGGCTGGT TTCATCCCTG 480 
TGCTAGGCTT TCTGTGAGGA TGAGCAGCAC CGGGATAGAC AGGAAGGGCG TCCTGGCTAA 540 
CCGGGTAGCC GTGGTCACGG GGTCCACCAG TGGGATCGGC TTTGCCATCG CCCGACGTCT 600 
GGCCCGGG AC GGGGCCCACG TGGTC ATCAG CAGCCGGA AG CAG CAGAACG TGGACCGGG C 660 
CATGGCCAAG CTGCAGGGGG AGGGGCTGAG TGTGGCGGGC ATTGTGTGCC ACGTGGGGAA 720 

70 GGCTGAGGAC CGGGAGCAGC TGGTGGCCAA GGCCCTGGAG CACTGTGGGG GCGTCGACTT 780 
CCTGGTGTGC AGCGCAGGGG TCAACCCTCT GGTAGGGAGC ACTCTGGGGA CCAGTGAGCA 840 
GATCTGGGAC AAGATCCTAA GTGTG AACGT GAAGTCCCCA GCCCTGCTGC TGAGCCAGTT 900 
GCTGCCCTAC ATGGAGAACA GGAGGGGTGC TGTCATCCTG GTCTCTTCCA TTGCAGCTTA 960 
TAATCCAGTA GTGGCGCTGG GTGTCTACAA TGTCAGCAAG ACAGCGCTGC TGGGTCTCAC 1 020 

75 TAGAACACTG GCATTGGAGC TGGCCCCCAA GGACATCCGG GTAAACTGCG TGGTTCCAGG 1080 
AATTATAAAA ACTGACTTCA GCAAAGTGTT TCATGGGAAT GAGTCTCTCT GGAAGAACTT 1 1 40 
CAAGGAACAT CATCAGCTGC AGAGG ATTGG GGAGTCAG AG GACTGTGCAG GAATCGTGTC 1 200 
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CTTCCTGTGC TCTCCAGATG CCAGCTACGT CAACGGGGAG AACATTGCGG TGGCAGGCTA 1260 
CTCCACrCGG CTCTGAGAGG AGTGGGGGCG GCTGCGTAGC TGTGGTCCCA GCCCAGGAGC 1320 
CTGAGGGGGT GTCTAGGTGA TCATTrGGAT CTGGAGCAGA GTCTGCCATT CTGCCAGACT 1380 
AGCAATTTGG GGGC1TACTC ATGCTAGGCT TGAGGAAGAA GAAAAACGCT TCGGCA'ITCT 1440 
5 CC 

Seq ID NO: 76 Protein sequence : 
Protein Accession ft: NP005785 

10 1 11 21 31 41 51 

11 III 

MLSAVARGYQ GWFHPCARLS VRMSSTGIDR KGVLANRVAV VTGSTSGIGF AIARRLARDG 60 
AHW1SSRKQ QNVDRAMAKL QGEGLSVAGI VCIIVGKAEDR EQLVAKALEH CGGVDFLVCS 120 
AG VNPLVGST LGTSEQIWDK ILSVNVKSPA LLLSQLLPYM ENRRGAVILV SSIAAYNPVV 1 80 
15 ALGVYNVSKT ALLGLTRTLA LELAPKDIRV NCVVPGIIKT DFSKVFHGNE SLWKNFKEHH 240 
QLQRIGESED CAGIVSFLCS PDASY VNGEN IAVAGYSTRL 



Seq ID NO: 77 DNA sequence 
20 Nucleic Acid Accession U: NM_002425 

Coding sequence: 26-1453 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
oc I I III 

25 AAAGAAGGTA AGGGCAGTGA GAATQATGCA TCTTGCATTC CTTGTGCTGT TGTGTCTGCC 60 
AGTCTGCTCT GCCTATCCTC TGAGTGGGGC AGCAAAAGAG GAGG ACTCCA ACAAGGATCT 120 
TGCCCAGCAA TACCTAGAAA AGTACTACAA CCTCGAAAAG GATGTGAAAC AGTTTAGAAG 1 80 
AAAGGACAGT AATCTCATTG TTAAAA AAAT CCAAGGAATG CAGAAGTTCC TTGGGTTGGA 240 
GGTGACAGGG AAGCTAGACA CTGACACTCT GGAGGTGATG CGCAAGCCCA GGTGTGGAGT 300 
3 0 TCCTGACGTT GGTCACTTCA GCTCCTTTCC TGGCATGCCG AAGTGGAGGA AAACCCACCT 360 
TACATACAGG ATTGTG AATT ATACACCAG A TTTGCCAAGA GATGCTGTTG ATTCTGCCAT 420 
TGAGAAAGCT CTG AA AGTCT GGGAAG AGGT G ACTCCACTC ACATTCTCC A GGCTGTATGA 480 
AGGAGAGGCT GATATAATGA TC rCTITCGC AG TTAAAG AA CATGGAGACT TTTACTCTTT 540 
TGATGGCCCA GGACACAGTT TGGCTCATGC CTAOCCACCT GGACCTGGGC TTTATGGAGA 600 
3 5 TATTCACT1T GATG ATGATG AAA AATGGAC AG AAGATGCA TCAGGCACCA ATTTATTCCT 660 
CGTTGCTGCT CATGAACTTG GCCACTCCCT GGGGCTCTTT CACTCAGCCA ACACTG AAGC 720 
TTTGATGTAC CCACTCTACA ACTCATTCAC AGAGCTCGCC CAGTTCCGCC TTTCGCAAGA 780 
TGATGTGAAT GGCATTCAGT CTCTC TACGG ACCTCCCCCT GCCTCTACTG AGGAACCCCT 840 
GGTGCCCACA AAATCTGTTC CTI CGGG ATC TGAG ATGCCA GCCAAGTGTG ATCCTGCTTT 900 
40 GTCCTTCG AT GCCATCAGCA CTC TGAGGGG AG AATATCTG TTCTTTAAAG ACAGATATTT 960 
TTGGCGAAGA TCCCACTGGA ACCCTGAACC TGAATTTCAT TTGATTTCTG CATTTTGGCC 1020 
CTCTCTTCCA TCATATTTGG ATGCTGCATA TGAAGTTAAC AGCAGGGACA CCGTTTTTAT 1080 
TTTTAAAGGA A ATG AGTTCT GGGCCATCAG AGGAAATGAG GTACAAGCAG GTTATCCA AG 1 140 
AGGCATCCAT ACCCTGGGTT TTCCTCCAAC CATAAGGAAA ATTGATGCAG CTGTTTCTGA 1200 
45 CAAGGAAAAG AAGAAAACAT ACTTCTTTGC AGCGGACAAA TACTGGAGAT TTGATGAAAA 1260 
TAGCCAGTCC ATGGAGCAAG GCTTCCCTAG ACTAATAGCT G ATGACTTTC CAGGAGTTGA 1320 
GCCTAAGGIT GATGCTGTAT TACAGGCATT TGGATTTTTC TACTTCTTCA GTGGATCATC 1 380 
ACAGTTTG AG TTTGACCCCA ATGCCAGGAT GGTGACACAC ATATTAAAGA GTAACAGCTG 1440 
GTTACATTGC TAGGCGAGAT AG G GGGAAG A CAGATATGGG TGTTTTTAAT AAATCTAATA 1500 
5 0 ATTATTCATC TAATGTATTA TGAG CCA AAA TGGTTAATTT TTCCTGCATG TTCTGTG ACT 1 560 
G A AG A AG ATG AGCCTTGCAG AT ATCTGCAT GTGTCATG AA GAATGTTTCT GGAATTCTTC 1620 
ACTTGCTTTT GAATTGCACT G AACAGA ATT A AG AAAT ACT CATGTGCAAT AGGTGAGAGA 1680 
ATGTATTTTC ATAG ATGTGT TAT rACTTCC TCAATAAAAA GTTTTATTTT GGGCCTGTTC 1 740 
CTT 
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Seq ID NO: 78 Protein sequence: 
Protein Accession #: NP 0024 1 6 



1 11 21 31 41 5] 

60 | [ | | | ) 

MHLAFLVLLC LPVCSAYPLS GAAKEEDSNK DLAQQYLEKY YNLEKDVKQF RRKDSNLIVK 60 
KJQGMQKFLG LEVTGKLDTD TLEVMRKPRC GVPDVGHFSS FPGMPKWRKT HLTYRIVNYT 120 
PDLPRDAVDS AIEKALKVWE EVTPLTI r SRl YEGEADIMIS FAVKEHGDFY SFDGPGHSLA 180 
HAYPPGPGLY GDIHFDDDEK WTEDASGTNL FLVAAHELGH SLGLFHSANT EALMYPLYNS 240 

65 FTELAQFRLS QDDVNGIQSL YGPPPASTEE PLVPTKSVPS GSEMPAKCDP ALSFDAISTL 300 

RGEYLFFKDR YFWRRSHWNP EPEFI ILISAF WPSLPSYLDA AYEVNSRDTV FIFKGNEFWA 360 
IRGNEVQAGY PRGIHTLGFP PTIRKIDAAV SDKEKKKTYF FAADKYWRFD ENSQSMEQGF 420 
PRLIADDFPG VEPKVDAVLQ AFGFFYFFSG SSQFEFDPNA RMVTH1LKSN SWLHC 

70 Seq ID NO. 79 DNA sequence 

Nucleic Acid Accession U: MM _000493.1 

Coding sequence: 1-2043 (underlined sequences correspond to start and stop codons) 

I 11 21 31 41 51 
75 | | | I I I 

ATGCTGCCAC AAATACCCTT TTTGCTGCTA GTA TCCTTG A ACTI'GGTTCA TGGAGTGTTT 60 
TACGCTGAAC GATACCAAAC GCCCACAGGC ATAAAAGGCC CACTACCCAA CACCAAGACA 120 
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CAGTTCTTCA TTCCCTACAC CATAAAG AGT AAAGGTATAG CAGTAAGAGG AGAGCAAGGT ISO 
ACTCCTGGTC CACCAGGCCC TGCTGGACCT CGAGGGCACC CAGGTCCTTC TGGACCACCA 240 
GGAAAACCAG GCTACGGAAG TCCTGGACTC CAAGGAGAGC CAGGGTTGCC AGGACCACCG 300 
GGACCATCAG CTGTAGGGAA ACCAGGTGTG CCAGGACTCC CAGGAAAACC AGGAGAGAGA 360 
5 GGACCATATG GACCAAAAGG AGATGTTGGA CCAGCTGGCC TACCAGGACC CCGGGGCCCA 420 
CCAGGACCAC CTGGAATCCC TGGACCGGCT GGAATTTCTG TGCCAGGAAA ACCTGGACAA 480 
CAGGGACCCA CAGGAGCCCC AGGACCCAGG GGCTTTCCTG GAGAAAAGGG TGCACCAGGA 540 
GTCCCTGGTA TGAATGGACA GAAAGGGGAA ATGGGATATG GTGCTCCTGG TCGTCCAGGT 600 
GAGAGGGGTC TTCCAGGCCC TCAGGGTCCC ACAGGACCAT CTGGCCCTCC TGGA GTGG GA 660 

1 0 AAAAGAGGTG AAAATGGGGT TCCAGGACAG CCAGGCATCA AAGGTGATAG AGGTTTTCCG 720 
GGAGAAATGG GACCAATTGG CCCACCAGGT CCCCAAGGCC CTCCTGGGGA ACGAGGGCCA 780 
GAAGGCATTG GAAAGCCAGG AGCTGCTGG A GCCCCAGGCC AGCCAGGGAT TCCAGGAACA 840 
AAAGGTCTCC CTGGGGCTCC AGGAATAGCT GGGCCCCCAG GGCCTCCTGG CTTTGGGAAA 900 
CCAGGCTTGC CAGGCCTGAA GGGAGAAAGA GG ACCTGCTG GCCTTCCTGG GGGTCCAGGT 960 

1 5 GCCAAAGGGG AACAAGGGCC AGCAGGTCTT CCTGGGAAGC CAGGTCTGAC TGGACCCCCT 1020 
GGGAATATGG GACCCCAAGG ACCAAAAGGC ATCCCGGGTA GCCATGGTCT CCCAGGCCCT 1080 
AAAGGTGAGA CAGGGCCAGC TGGGCCTGCA GGATACCCTG GGGCTAAGGG TGAAAGGGGT 1140 
TCCCCTGGGT CAGATGGAAA ACCAGGGTAC CCAGGAAAAC CAGGTCTCGA TGGTCCTAAG 1200 
GGTAACCCAG GGTTACCAGG TCCAAAAGGT GATCCTGGAG TTGGAGGACC TCCTGGTCTC 1260 

20 CCAGGCCCTG TGGGCCCAGC AGGAGCAAAG GGAATGCCCG GACACAATGG AGAGGCTGGC 1320 
CCAAGAGGTG CCCCTGGAAT ACCAGGTACT AGAGGCCCTA TTGGGCCACC AGGCATTCCA 1380 
GGATTCCCTG GGTCTAAAGG GGATCCAGG A AGTCCCGGTC CTCCTGGCCC AGCTGGCATA 1440 
GCAACTAAGG GCCTCAATGG ACCCACCGGG CCACCAGGGC CTCCAGGTCC AAGAGGCCCC 1500 
TCTGGAGAGC CTGGTCTTCC AGGGCCCCCT GGGCCTCCAG GCCCACCAGG TCAAGCAGTC 1560 

25 ATGCCTGAGG GTTTTATAAA GGCAGGCCAA AGGCCCAGTC TTTCTGGGAC CCCTCTTGTT 1620 
AGTGCCAACC AGGGGGTAAC AGGAATGCCT GTGTCTGCTT TTACTGTTAT TCTCTCCAAA 1680 
GCTTACCCAG CAATAGGAAC TCCCATACCA TTTGATAAAA TTTTGTATAA CAG GCAAC AG 1740 
CATTATGACC CAAGGACTGG AATCTTTACT TGTCAGATAC CAGGAATATA CTATTTTTCA 1800 
TACCACGTGC ATGTG AAAGG GACTCATGTT TGCGTAGGCC TGTATAAG AA TGGCACCCXrr 1 860 

3 0 GTAATGTACA CCTATGATG A ATACACCAAA GGCTACCTGG ATCAGGCTTC AGGGAGTGCC 1920 
ATCATCGATC TCACAGAAAA TGACCAGGTG TGGCTCCAGC TTCCCAATGC CGAGTCAAAT 1980 
GGCCTATACT CCTCTGAGTA TGTCCACTCC TCTTTCTCAG G ATTCCTAGT GGCTCCAATG 2040 
TGAGTACACC CCACAGAGCT AATCTAAATC TTGTGCTAGA AAAAGC ATTC TCTAACTCTA 2100 
CCCCACCCTA CAAAATGCAT ATGGAGGTAG GCTGAAAAGA ATGTAATTTT TATTTTCTGA 2160 

3 5 A ATA C AG ATT TGAG CTATCA GACCAACAAA CCTTCCCCCT G AAAAGTGAG CAGCAACGTA 2220 
AAAACGTATG TGAAGCCTCT CTTGAATTTC TAGTTAGCAA TCTTAAGGCT CTTTAAGGTT 2280 
TTCTCCAATA TTAAAAAATA TCACCAAAGA AGTCCTGCTA TGTTAAAAAC AAACAACAAA 2340 
AAACAAAGCA ACAAAAAAAA AAATTAAAAA AAAAAACAGA AATAGAGCTC TAAGTTATGT 2400 
GAAATTTGAT TTGAGAAACT CGGCATTTCC TTTTTAAAAA AGCCTGTTTC TAACTATGAA 2460 

40 TATGAGAACT TCTAGGAAAC ATCCAGGAGG TATCATATAA CTTTGTAGAA CTTAAATACT 2520 
TGAATATTCA AATTTAAAAG ACACTGTATC CCCTAAAATA TTTCTGATGG TGCACTACTC 2580 
TGAGGCCTGT ATGGCCCCTT TCATCAATAT CTATTCAAAT ATACAGGTGC ATATATACTT 2640 
GTTAAAGCTC TT A TATA AAA AAGCCCCAAA ATATTGAAGT TCATCTGAAA TGCAAGGTGC 2700 
TTTCATCAAT GAACCTTTTC AAAACTTTTC TATGATTGCA G AGAAGCTTT TTATATACCC 2760 

45 AGCATAACTT GGAAACAGGT ATCTGACCTA TTCTTATTTA GTTAACACAA GTGTGATTAA 2820 
TTTGATTTCT TTAATTCCTT ATTGAATCTT ATGTG ATATG ATTTTCTGGA TTTACAGAAC 28S0 
ATTAGCACAT GTACCTTGTG CCTCCCATTC AAGTGAAGTT ATAATTTACA CTGAGGGTTT 2940 
CAAAATTCGA CTAGAAGTGG AGATATATTA TTTATTTATG CACTGTACTG TATT TTTATA 3000 
TTGCTGTTTA AAACTITTAA GCTGTGCCTC AC TT ATT AAA GCACAAAATG TTTTACCTAC 3060 

5 0 TCCTTATTTA CG ACACAATA AAATAACATC AATAGATTTT TAGGCTGAAT TAATTTGAAA 3120 
GCAGCAATTT GCTGTTCTCA ACCATTCTTT CAAGGCTTTT CATTCGACAC AATAAAATAA 3180 
CATCAATAGA TTTTTAGG 

Seq ID NO: 80 Protein sequence: 
55 Protein Accession #: NP_OO0484.1 

1 II 21 31 41 51 

1 I I I I I 

60 MLPQIPFLLL VSLNLVHGVF YAERYQTPTG DCGPLPNTKT QFFIPYTIKS KGIAVRGEQG 60 

TPGPPGPAGP RGHPGPSGPP GKPGYGSPGL QGEPGLPGPP GPSAVGKPGV PGLPGKPGER 120 
GPYGPKGDVG PAGLPGPRGP PGPPGIPGPA GISVPGKPGQ QGPTGAPGPR GFPGEKGAPG 180 
VPGMNGQKGE MGYGAPGRPG ERGLPGPQGP TGPSGPPGVG KRG ENG VPGQ PGIKGDRGFP 240 
GEMGP1GPPG PQGPPGERGP EGIGKPGAAG APGQPGIPGT KGLPGAPGIA GPPGPPGFGK 300 

65 PGLPGLKGER GPAGLPGGPG AKGEQGPAGL PGKPGLTGPP GNMGPQGPKG IPGSHGLPGP 360 
K.G ETGPAGPA GYPGAKGERG SPGSDGKPGY PGKPGLDGPK GNPGLPGPKG DPGVGGPPGL 420 
PGPVGPAGAK GMPGHNGEAG PRGAPGIPGT RGPIGPPGIP GFPGSKGDPG SPGPPGPAGI 480 
ATKGLNGPTG PPGPPGPRGP SGEPGLPGPP GPPGPPGQAV MPEGFIKAGQ RPSLSGTPLV 540 
SANQGVTGMP VSAFTVILSK AYPAIGTPIP FDKILYNRQQ HYDPRTGIFT CQIPGfYYFS 600 

70 YHVHVKGTHV WVGLYKNGTP VMYTYDEYTK GYLDQASGSA IIDLTENDQV WLQLPNAESN 660 
GLYSSEYVHS SFSGFLVAPM 

Seq ID NO: 81 DNA sequence 
Nuc leic Acid Accession U: NM_000786 
75 Coding sequence: 332-1861 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
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CGCGATTCTC AGGGATTGAT CCGCCTCTTC AGGTAAGTTA TCITCCGGCC CCGTACCACT 60 
GTGCCACAGG CGCAGCCCGC TTCCTCAGGT GCCCTATCCC GCGCAGAAGA CCACGGCTTC 120 
ACAGAGTGTT ATITAAGGGC GTGGCCAGCG GAACATCCCG CCCCAT TCI G TGACGCACGG 1 80 
5 GGTGGOGCGC GTGGGACCCG AGGGGTGGGG CTGGGTTTAG TAGGAGACCT GGGGCAAGGC 240 
CCCCTGTGGA CG ACCATCTG CCAGCTTCTC TCGTTCCGTC GATTGGGAGG AGCGGTGGCG 300 
ACCTCGGCCT TCAGTGTTTC CGACGGAGTG AATG.GCGGCG GCGGCTGGGA TGCTGCTGCT 360 
GGGCTTGCTG CAGGCGGGTG GGTCGGTGCT GGGCCAGGCG ATGGAGAAGG TGACAGGCGG 420 
CAACCTCTTG TCCATGCTGC TGATCGCCTG CGCCITCACC CTCAGCCTGG TCTACCTGAT 480 

1 0 CCGTCTGGCC GCCGGCCACC TGGTCCAGCT GCCCGCAGGG GTGAAAAGTC CTCCATACAT 540 
TTTCTCCCCA ATTCCATTCC TTGGGCATGC CATAGCATTT GGGAAAAGTC CAATTGAATT 600 
TCTAGAAAAT GCATATGAGA AGTATGGACC TGTATTTAGT TTTACCATGG TAGGCAAG AC 660 
ATTTACTTAC CTTCTGGGGA GTG ATGCTGC TGCACTGCTT TTTAATAGTA AAAATGAAG A 720 
CCTGAATGCA GAAGATGTCT ACAGTCGCCT GACAACACCT GTGTTTGGGA AGGGAGTTGC 780 

1 5 ATACGATGTG CCTAATCCAG TTTTCTTGGA GC AG A AG AAA ATGTTAAAAA GTGGCCTTAA 840 
CATAGCCCAC TTTAAACAGC ATGTTTCTAT AATTGAAAAA GAAACAAAGG AATACTTTG A 900 
GAGTTGGGGA GAAAGTGGAG AAAAAAATGT GTTTGAAGCT CTTTCTGAGC TCATAATTTT 960 
AACAGCTAGC CATTGTTTGC ATGG AAAGGA AATCAGAAGT CAACTCAATG AAAAGGTAGC 1020 
ACAGCTGTAT GCAGATTTGG ATGGAGGTTT CAGCCATGCA GCCTGGCTCT TACCAGGTTG 1080 

20 GCTGCCTTTG CCTAGTTTCA GACGCAGGGA CAGAGCTCAT CGGGAAATCA AGGATATTTT 1 140 
CTATAAGGCA ATCCAG AAAC GCAG ACAGTC TCAAGAAAAA ATTGATGACA TTCTCCA A AC 1 200 
TTTACTAGAT GCTACATACA AGGATGGGCG TCCTTTGACT GATG ATGAAG TAGCAGGGAT 1260 
GCTTATTGGA TTACTCTTGG CAGGGCAGCA TACATCCTCA ACTACTAGTG CTTGGATGGG 1320 
CTTCTTTTTG GCCAGAGACA AAACACTTCA AAAAAAATGT TATTTAGAAC AGAAAACAGT 1380 

25 CTGTGGAGAG AATCTGCCTC CTTTAACTTA TGACCAGCTC AAGGATCTAA ATTTACTTGA 1 440 
TCGCTGTATA AAAGAAACAT TAAGACTTAG ACCTCCTATA ATGATCATGA TGAGAATGGC 1500 
CAGAACTCCT CAGACTGTGG CAGGGTATAC CATTCCTCCA GGACATCAGG TGTGTGTTTC 1560 
TCCCACTGTC AATCAAAG AC TTAAAGACTC ATGGGTAGAA CGCCTGG ACT TTAATCCTG A 1 620 
TCGCTACTTA CAGGATAACC CAGCATCAGG GGAAAAGTTT GCCTATGTGC CATTTGGAGC 1680 

30 TGCGCGTCAT CGTTGTATTG GGGAAAATTr TGCCTATGTT CAAATTAAGA CAATTTGGTC 1740 
CACTATGCTT CGTTTATATG AATTTG ATCT CATTGATGGA TACTTTCCCA CTGTGAATTA 1 800 
TACAACTATG ATTCACACCC CTGAGAACCC AGTTATCCGT TACAAACGAA GATCAAAATG 1 860 
AAAAAGGTTG CAAGGAACGA ATATATGTGA TTATCACTGT AAGCCACAAA GGCATTCGAA 1920 
GAG A ATGAAG TGTACAAAAC AACTCTTGTA GTTTACTG'IT TTnTAAGTG TGTAATTCTA 1980 

3 5 AAAGCCAGTT TATGATTTAG G ATTTTGTTA ACTGAATGGT TCTATCAAAT ATAATAGCAT 2040 
TTGACACATT TTCTAATAGT TATGATACTT ATACATGTGC TTTCAGGAAG TTCCTTGGTG 2 100 
AAACAATTGT TGAGGGGGGA TCTAGGTAAT TGGCAGATTC TAAATAATAT AATTTCCAGA 2160 
TAGTAATTTT AAGAGTACTC ATCGCTCTTG CCA A ATAAGT TCAGGGTATT CAAATCTTGG 2220 
ACTAGTCCTG CAAGGTATAA AGAATAAAAA TCCCAGTGAG ATACTTGG A A ACCACAGTTT 2280 

40 ATTATTAriT ATCTGGGCAA TTATTGTGTG TGTGAGGATG GAAGGGTAGG GAATAATCG A 2340 
ACATCTAAAG CCTTGAATAA GAGAATACTA ATTGTTTTGG TATGATGATA CTCAGAAATG 2400 
GAGATATTAT AGGAAAAAGA AATCCTTTGG AATTTTAACT AAAATCACTG CATATGGG AA 2460 
ATTAAGAGAT CCAGGACCAT ATTTGATAAG AGTTCCTAAA AATAATGTAA TTATTAATGC 2520 
TAAAGACTGC TCATGTATCT TGATCTAATT ACTAAATAAA TTACATATTT ATTTACCTGA 2580 

45 TAAATATGTA TCTAGTTCTA CAAGGTCACA TTTATGTGGA AGTCCAAAGT CAAGTCCTTA 2640 
GGGGATAATT TTGTTTTGGG CTCAGTTGTT CCCTGCTTCC TTTTTTTTTT TTTTTTTTTT 2700 
TTGAGATGGA GTCTCGCTCT GTTGCCCAGG CTGGAGTGCA GTGGTGCGAT CTCAGCTCAC 2760 
TGCATCCTCT GCCTCCCGGG TTCAAGCAAT TCTCTGCCTC AGCCTCCCAA GTAGTTGGG A 2820 
TTACAGGCAC CTGCCACCAT GCCTGGCTAA TTTTTTGTATTTTTAGTAGA GACGGGGGTT 2S80 

5 0 TCACTATGTT GGCTAGGCTG GTCTTG AACT CCTGAGCCTC GTG AGTCCAC CCGCCTTGGC 2940 
CTCCCAAAGT GCTGGG ATT A CAGGCATGAG CCACCGCACC TGGCCTTCCC TGCTTCCTCT 3000 
CTAGAATCCA ATTAGGGATG TTTGTTACTA CTCATATTGA TTAAAACAGT TAACAAACTT 3060 
TTTTCTTTTT AAAATGTGAG ATCAGTGAAC TCTGGTTTTA AGATAATCTG AAACAAGGTC 3120 
CTTGGGAGTA ATAAAATTGG TCACA TTCTG TAAAGCACAT TCTGTTTAGG AATCAACTTA 3180 

5 5 TCTCAAATTG TAACTCGGGG CCTAACTATA TG AGATGGCT GAAAAAATAC CACATCGTCT 3240 
GTTTTCACTA GGTGATGCCA AAATATITTG CTTTATGTAT ATTACAGTTC TTTTTAAAAC 3300 
ACTGGAAGAC TCATGTTAAA CTCTAA ITGT GAAGGCAG AA TCl'CTGCTAA TTTTTCAGAT 3360 
TAAAATTCTC TTTGAAAAAA T 

60 Seg ID NO: 82 Protein sequence: 

Protein Accession #: NP_000777 

1 11 21 31 41 51 
S C I I I I I I 

05 MAAAAGMLLL GLLQAGGSVL GQAMEKVTGG NLLSMLL1AC AFTLSLVYLI RLAAGHLVQL 60 
PAGVKSPPYI FSPIPFLGHA IAFGKSPIEF LENAYEKYGP VFSFTMVGKT FTYLLGSDAA 120 
ALLFNSK.NED LNAEDVYSRL TTPVFGKG VA YDVPNPVFLE QKKMLKSGLN IAHFKQHVSI 180 
IEKFIKEYFE SWGESGEKNV FEALSELIIL TASHCLHGKE IRSQLNEKVA QLYADLDGGF 240 
SHAAWLLPGW LPLPSFRRRD RAHREJKDIF YKAIQKRRQS QEK.IDDILQT LLDATYKDGR 300 

70 PLTDDEVAGM LIGLLLAGQH TSSTTSAWMG FFLARDKTLQ KKCYLEQKTV CGENLPPLTY 360 
DQLKDLNLLD RCIKETLRLR PPIMIMMRWA RTPQTVAGYT IPPGHQVCVS PTVNQRLKDS 420 
WVERLDFNPD RYLQDNPASG EKFAYA'PFGA GR1 IRCIGFJNF AYVQIKTIWS TMLRLYEFDL 480 
IDGYFPTVNY TTMIHTPENP V1RYKRRSK 

75 Seq ID NO: 83 DNA sequence 

Nucleic Acid Accession #: NM_()06551.2 

Coding sequence: 64-336 (underlined sequences correspond to start and stop codons) 
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'aaTTCTAG AA GTCCAAATCA CTCATTGTTT GTGAAAGCTG AGCTCACAGC AAAACAAGCC 60 
5 \CCATGAAGC TGTCGGTGTG TCTCCTGCTG GTCACGCTGG CCCTCTGCTG CTACCAGGCC 120 
AATGCCGAGT TCTGCCCAGC TCTTGTTTCT GAGCTGTTAG ACTTCTTCTT CATTAGTGAA ISO 
CCTCTGTTCA AGTTA AGTCT TGCCAAATTT GATGCCCCTC CGG AAGCTGT TGCAGCCAAG 240 
TTAGGAGTGA AGAGATGCAC GGATCAGATG TCCCTTCAGA AACGAAGCCT CATTGCGGAA 300 
GTCCTGGTG A AAATATTGAA GAAATGTAGT GTGTGACATG TAAAAACITT CATCCTGGTT 360 
1 0 TCCACTGTCT TTCAATGACA CCCTGATCTT CACTGCAGAA TGTAA AGGTT TCAACGTCTT 420 
GCTTTAATAA ATCACTTGCT CTAC 

Seq ID NO; 84 Protein sequence: 
Protein Accession NP_006542. 1 

15 

1 11 21 31 41 51 

MKLSVCLliv TLALCCYQAN AEFCPALVSE LLDFFFISEP LFKLSLAKFD APPEAVAAKL 60 
GVKRCTDQMS LQKRSLIAEV LVK1LKKCSV 

20 

Seq ID NO: 85 DNA sequence 

Nucleic Acid Accession U: NM JXJ23 1 7. 1 

Coding sequence: 231-1484 (underlined sequences correspond to start and stop codons) 
25 1 11 21 31 41 51 

GGGCCAGGAC TGAG AAAGGG GAAAGGGAAG GGTGCCACGT CCGAGCAGCC GCCTTGACTG 60 
GGGAAGGGTC TGAATCCCAC CCTTGGCATT GCTTGGTGGA GACTGAGATA CCCGTGCTCC 120 
GCTCGCCTCC TTGGTTG AAG ATTTCTCCTT CCCTCACGTG ATTTGAGCCC CGTTTTTATT 1 30 

30 TTCTGTGAGC CACGTCCTCC TCGAGCGGGG TCAATCTGGC AAAAGGAGTG ATGCGCTTCG 240 
CCTGGACCGT GCTCCTGCTC GGGCCTTTGC AGCTCTGCGC GCTAGTGCAC TGCGCCCCTC 300 
CCGCCGCCGG CCAACAGCAG CCCCCGCGCG AGCCGCCGGC GGCTCCGGGC GCCTGGCGCC 360 
AGCAGATCCA ATGGGAG AAC AACGGGCAGG TGTTCAGCTT GCTGAGCCTG GGCTCACAGT 420 
ACCAGCCTCA GCGCCGCCGG GACCCGGGCG CCGCCGTCCC TGGTGCAGCC AACGCCTCCG 480 

35 CCCAGCAGCC CCGCACTCCG ATCCTGCTGA TCCGCGACAA CCGCACCGCC GCGGGGCGAA 540 
CGCGGACGGC CGGCTCATCT GG AGTCACCG CTGGCCGCCC CAGGCCCACC GCCCGTCACT 600 
GGTTCCAAGC TGGCTACTCG ACATCTAGAG CCCGCGAAGC TGGGCCCTCG CGCGCGGAGA 660 
ACCAGACAGC GCCGGGAGAA GTTCCTGCTC TCAGTAACCT GCGGCCGCCC AGCCGCGTGG 720 
ACGGCATGGT GGGCGACGAC CCTTACAACC CCTACAAGTA CTCTGACGAC AACCCTTATT 780 

40 ACAACTACTA CGATACTTAT GAAAGGCCCA GACCTGGGGG CAGGTACCGG CCCGGATACG 840 
GCACTGGCTA CTTCCAGTAC GGTCTCCCAG ACCTGGTGGC CGACCCCTAC TACATCCAGG 900 
CGTCCACGTA CGTGCAGAAG ATGTCCATGT ACAACCTGAG ATGCGCGGCG GAGGAAAACT 960 
GTCTGGCCAG TACAGCATAC AGGGCAG ATG TCAG AG ATT A TGATCACAGG GTGCTGCTCA 1020 
GATTTCCCCA AAGAGTGAAA AACCAAGGGA CATCAGATTT CTTACCCAGC CGACCAAGAT 1080 

45 ATTCCTGGG A ATGGCACAGT TGTCATCAAC ATTACCACAG TATGGATG AG TTTAGCCACT 1 140 
TGTACCTGCT TGATGCCAAC ACCCAGAGGA GATGGGCTGA AGGCCACAAA GCAAGTTTCT 1200 
GTCTTGAAGA CACATCCTGT GACTATGGCT ACCACAGGCG ATTTGCATGT ACTGCACACA 1260 
CACAGGGATT GAGTCCTGGC TGTTATGATA CCTATGGTGC AGACATAGAC TGCCAGTGGA 1320 
TTG ATATTAC AGATGTAAAA CCTGGAAACT ATATCCTAAA GGTCAGTGTA AACCCCAGCT 1380 

50 ACCTGGTTCC TGAATCTGAC TATACCAACA ATGTTGTGCG CTGTGACATT CGCTACACAG 1440 
GACATCATGC GTATGCCTCA GGCTGCACAA TTTCACCGTA TTAGAAGGCA AAGCAAAACT 1500 
CCCAATGGAT AAATCAGTGC CTGGTGTTCT GAAGTGGGAA AAAATAGACT AACTTCAGTA 1560 
GGATTTATGT ATTTTGAAAA AGAGAACAGA AAACAACAAA AGAATTTTTG TTTGGACTGT 1620 
TTTCAATAAC AAAGCACATA ACTGGATTTT GAACGCTTAA GTCATCATTA CTTGGGAAAT 1 680 

5 5 TTTTAATGTT TATTATTTAC ATCACTTTGT GAATTAACAC AGTGTTTCA A TTCTGTAATT 1 740 
ACATATTTGA CTCTTTCAAA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 86 Protein sequence: 
Protein Accession U: NPJJ02 308.1 

60 

1 11 21 31 41 51 

MRFAWTVLLL GPLQLC ALVH CAPPAAGQQQ PPREPPAAPG AWRQQIQWEN NGQVFSLLSL 60 
GSOYQPQRRR DPGAAVPGAA NASAQQPRTP ILLIRDNRTA AGRTRTAGSS GVTAGRPRPT 120 
65 ARHWFQAGYS TSRAREAGPS RAENQTAPGE VPALSNLRPP SRVDGMVGDD PYNPYKYSDD 180 
NPYYNYYDTY ERPRPGGRYR PGYGTGYFQY GLPDLVADPY YIQASTYVQK MSMYNLRCAA 240 
EENCLASTAY RADVRDYDHK VLLRFPQRVX NQGTSDFLPS RPRYSWEWHS CHQHYHSMDE 30C 
FSHLYLLDAN TQRRWAEGHK ASFCLEDTSC DYGYHRRFAC TAHTQGLSPG CYDTYGADID 360 
CQWIDITDVK PGNY1LKVSV NPSYLVPESD YTNNVVRCDI RYTGHHAYAS GCTISPY 

Seq ID NO: 87 DNA sequence 

Nucleic Acid Accession #: NM_006419.1 

Coding sequence: 91-420 (underlined sequences correspond to start and stop codons) 
75 1 11 21 31 41 51 

TTCGGCACTT GGG AGAAG AT GTTTGAAAAA ACTGACTCTG CTAATGAGCC TGG ACTCAG A 60 
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GCICAAGTCT GAACTCTACC TCCAGACAGA ATGAAGTTCA TCTCGACATC TCTGCTTCTC 1 20 
ATGCTGCTGG TCAGCAGCCT CTCTCCAGTC CAAGGTGTTC TGGAGGTCTA TTACACAAGC 180 
rrGAGGTGTA GATGTGTCCA AGAGAGCTCA GTCTTTATCC CTAGACGCTT CATTGATCGA 240 
ATTCAAATCT TGCCCCGTGG G AATGGTTGT CCAAGAAAAG AAATCATAGT CTGGAAGAAG 300 
5 AACAAGTCAA TTGTGTGTGT GGACCCTCAA GCTGAATGGA TACAAAGAAT GATGGAAGTA 360 
VI GAGAAAAA GAAGTTCTTC AACTCTACCA GTTCCAGTGT TTAAGAGAAA GATTCCCTGA 420 
TGCTGATATT TCCACTAAGA ACACCTGCAT TCTTCCCITA TOCCTGCTCT GGATTTTAGT 480 
TTTGTGCTTA GTTAAATCTT TTCCAGGGAG AAAGAACTTC CCCATACAAA TAAGGCATGA 540 
GGACTATGTG AAAAATAACC TTGCAGGAGC TGATGGGGCA AACTCAAGCT TCTTCACTCA 600 

1 0 CAGCACCCTA TATACACTTG GAGTTTGCAT TCTTATTCAT CAGGGAGGAA AGTTTCTTTG 660 
AAAATAGTTA TTCAGTTATA AGTAATACAG GATTATTTTG ATTATATACT TGTTGTTTAA 720 
TGTITAAAAT TTCTTAGAAA ACAATGGAAT GAGAATTTAA GCCTCAAATT TGAACATGTG 780 
GCTI GAATTA AGAAGAAAAT TATGGCATAT ATTAAAAGCA GGCTTCTATG AAAGACTCAA 840 
AAAGCTGCCT GGGAGGCAG A TGGAACTTGA GCCTGTCAAG AGGCAAAGGA ATCCATGTAG 900 

1 5 TAGATATCCT CTGCTTA AAA ACTCACTACG G AGG AG AATT AAGTCCTACT TTTAAAG AAT 960 
TTCTTTATAA AATTTACTGT CTAAGATTAA TAGCATTCGA AGATCCCCAG ACTTCATAGA 1020 
ATACTCAGGG AAAGCATTTA AAGGGTGATG TACACATGTA TCCTTTCACA CATTTGCCTT 1080 
GACAAACTTC TTTCACTCAC ATCTTTTTCA CTGACTTTTT TTGTGGGGGC GGGGCCGGGG 1 1 40 
GGACTCTGGT ATCTAATTCT TTAATGATTC CTATAAATCT AATGACATTC AATAAAGTTG 1200 

20 AGCAAACATT TTACTT 

Seq ID NO: 88 Protein sequence: 
Protein Accession #: NP_0064 1 0. 1 

25 1 11 21 31 41 51 

I 1 ! I 1 I 

MKHSTSLLL MLLVSSLSPV QGVLEVYYTS LRCRCVQESS VFIPRRTIDR 1QILPRGNGC 60 
PRKEIIVWKK NKSIVCVDPQ AEWIQRMMEV LRKRSSSTLP VPVFKRKIP 

30 Seq ID NO 89 DNA sequence 

Nucleic Acid Accession #: NM_0O2652 

Coding sequence: 37-477 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

35 till I I 

CTTCTCTGGG ACACATTGCC TTCTGTTTTC TCCAGCATGC GCTTGCTCCA GCTCCTGTTC 60 
AGGGCCAGCC CTGCCACCCT GCTCCTGGTT CTCTGCCTGC AGTTGGGGGC CAACAAAGCT 1 20 
CAGGACAACA CICGGAAGAT CATAATAAAG AATTTTG A CA TTCCCAAGTC AGTACGTCCA 180 
AATGACGAAG TCACTGCAGT GCTTGCAGTT CAAACAG AAT TGAAAGAATG CATGGTGGTT 240 

40 AAAACTTACC TCATTAGCAG CATCCCTCTA CAAGGTGCAT TTAACTATAA GTATACTGCC 300 
TGCCTATGTG ACGACAATCC AAAAACCTTC TACTGGGACT TTTACACCAA CAG AACTGTG 360 
CAAATTGCAG CCGTCGTTGA TGTTATTCGG GAATTAGGCA TCTGCCCTGA TGATGCTGCT 420 
GTAATCCCCA TCAAA A ACAA CCGGTTTTAT ACTATTG AA A TCCTAAAGGT AGAATAATGG 480 
AAGCCCTGTC TGTTTGCCAC ACCCAGGTGA TTTCCTCTAA AGAAACTTGG CTGG AATTTC 540 

45 TGCI GTGGTC TATAAAATAA ACTTCTTAAC ATGCTT 

Seq Ih NO: 90 Protein sequence: 
Protein Accession #: NP_002643. 1 

50 1 11 21 31 41 51 
I I i ! I I 

MRLLQLLFRA SPATLLLVLC LQLGANKAQD NTRKIIIKNF DIPKSVRPND EVTAVLA VQT 60 
ELKECMVVKT YU SSIPLQG AFNYKYTACL CDDNPKTFYW DFYTNRTVQI AAVVDVIREL 120 
GICPDDAAVI PIKNNRFYTI EILKV 

Seq ID NO: 91 DNA sequence 

Nucleic Acid Accession #: AKO0034 1 

Coding sequence: 35-975 (underlined sequences correspond to start and stop codons) 

60 1 II 21 31 41 51 
I I I I I I 

GATAGCGCCG GGCAGAGGGA CCCGGCTACC CTGGACAGCG CATCGCCGCC CGCCCGGGTC 60 
GCCGCGCCAC AGCCGCTGCG GATC ATG GAA CATCTAAAGG CCTTTGATGA TGAAATCAAT 120 
GCTI nTTGG ACAATATGTT TGGACCGCG A GATTCTCGAG TCAGAGGGTG GTTCACGTTG 180 

65 GACTCTTACC TTCCTACCTT TTTTCTTACT GTCATGTATC TGCTCTCAAT ATGGCTGGGT 240 

AACAAGTATA TGAAGAACAG ACCTGCTCTT TCTCTCAGGG GTATCCTCAC CTTGTATAAT 300 
CITGGAATCA CACTTCTCTC CGCGTACATG CTGGCAGAGC TCATTCTCTC CACTTGGGAA 360 
GGAGGCTACA ACTTACAGTG TCAAGATCTT ACCAGCGCAG GGGAAGCTGA CATCCGGGTA 420 
GCCAAGGTGC TTTGGTGGTA CTATTTCTCC AAATCAGTAG AGTTCCTGGA CACAA1TITC 480 

70 TTCGTTTTGC GGAAAAAAAC GAG 1 CAG ATT ACITTTCTTC ATGTATATCA TCATGCTTCT 540 
ATGTTTAACA TCrGGTGGTG TGTCTTG A AC TGG ATACCTT GTGGACAAAG TTTCTTTGGA 600 
CC AACACTG A ACAGT'ITTGT CCACATTCTT ATGTACTCCT ACTATGGACT TTCTGTGTTT 660 
CCATCTATGC ACAAGTATCT TTGGTGGAAG AAATATCTCA CACAGGCTCA GCTGGTGCAG 720 
TTCGTGCTCA CCATCACGCA CACCATGAGC GCCGTCGTGA AACCGTGTGG CTTCCCCTTC 780 

75 GGTIGTCTCA TCTTCCAGTC ATCTTATATG CTAACGTTAG TCATCCTCTT CTTAAATTTT 840 

TATGTTCAGA CATACCGAAA AAAGCCAATG AAGAAAGATA TGCAAGAGCC ACCTGCAGGG 900 
AAAGAAGTCJA AGAATGGTTT TTCCAAAGCC TACTTCACTG CAGCAAATGG AGTGATGAAC 960 

386 



WO 02/(159377 



AAGAAAGCAC AATAAAAATG AGTAACAGAA AAAGCACATA TACTAGCCTA ACAGATTGGC 1020 
TTGTTTTAAA GCAAAGACTG AATTGAAGGT TACATGTTTT AGGATAAACT AATTTCTTTT 1080 
GAGTTCATAA ATCATTTGTA CCCAGAATGT ATTAATATAT TGCTATTAGG TTAATCTGTT 1 140 
AACTGAATGC TTTG ATCAGC ATTGAGGTGA TGCTCACCTC CGAGGACCTC AGAACTGGTG 1200 
CAGCTTCTCT CTCCCTCCCT CCCACAGACT GA^CCTTTCG CCAGAAGCTG TCCTTATAAC 1260 
GCCTTATACG CATACACAGC CAGGAAACGT GGAGCATTGT TTCTCACAGA GAGTCTCCAA 1320 
ATAAAAAGGG TTTTGTTCAG ATTAAAATGT TTACAACAAA A TG IT A ATT A TATTCTAAAT 13S0 
ACAGGGTATG TTCTAATCTA TATTAAGCAA TAATGCCAGT GCATAATCAT TCCATTTGTT 1440 
CCTTTAGCAA TCAACCCCAG AAA AT ATT A A AATGGGATCA TACACAG AAG ATAGAA AAAT 1 500 
CTAGCAAAAC TTCTCTTTCT GTAAGCCAGA GTCTTGTCTA TCAGATTCCC ACAACCACTC 1560 
CTGATTCTAA ATTTAGTG AT ATGGTAATGA AATTGGTATT TATTTTAAAT ATTAGTTATT 1 620 
CTAAGGAGAA AAAAATGCTT CTGCAAGATT TTCATAATTC AGGGGCTGTG G ATAGGATTG 1 680 
TTCCTCTGTT TCCCTAATCA TTCATCTGTT CATGTCTCCC TCTTGTGCCA GTCAGCCTAG 1 740 
GTTATACAGA TGCCATGCTC CACACCACGA GCAGTGTACA AATCTGGCTG CCCGTTTACT 1800 
TTCTGAGCAA GCACTGGAGT CCACTCCGAC CTTTTTCTTT G AACATGCAT GCTGCTGGAA 1860 
TATGTATAAA TCAGAACTAG CAGAAGTAGC AGAGTGATGG GAGCAAAATA GGCACTGAAT 1920 
TCGTCAACTC TTTTTTGTGA GCCTACTTGT GAATATTACC TCAGATACCT GTTGTCACTC 1980 
TTCACAGGTT ATTTAAGTTC TTGAAGCTGG G AG G A AAA AG ATGGAGTAGC TTGGAAAGAT 2040 
TCCAGCACTG AGCCGTGAGC CGGTCATGAG CCACGATAAA AAATGCCAGT TTGGCAAACT 2100 
CAGCACTCCT GTTCCCTGCT CAGGTATATG CGATCTCTAC TGAGAAGCAA GCACAAAAGT 2160 
AG A CCA A AG T ATTAATGAGT ATTTCCTTTC TCCATAAGTG CAGG ACTGTT ACTCACTACT 2220 
AAACTCTACC AAGAATGGAA ACCAAGAATA TTTTCTGAAG ATTTTTTTGA AGATTAATTT 2280 
ATACCCTATA AAATAAAACT TGTTAGCTTC GATGAAGTCA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 92 Protein sequence: 
Protein Accession #: BAA91096.1 

1 11 21 31 41 51 

MEHLKAFDDE INAFLDNMFG PRDSRVRGWF TLDSYLFTFF LTVMYLLSrW LG NK YMKNRP 60 
ALSLRG1LTL YNLGITLLSA YMLAELILST WEGGYNLQCQ DLTSAGEAD1 RVAKVLWWYY 1 20 
FSKSVEFLDT IFFVLRKKTS QITFLHVYHH ASMFNIWWCV LNWIPCGQSF FGPTLNSFVH 180 
ILMYSYYGLS VFPSMHKYLW WKKYLTQAQL VQFVLTITHT MSAWKPCGF PFGCLIFQSS 240 
YMLTLVILFL NFYVQTY RKK PMKKDMQEPP AGKEVKNGFS KAYFTAAMGV MNKKAQ 

Seq ID NO: 93 DNA sequence 

Nucleic Acid Accession #: NM_000044 

Coding sequence: 1 1 1 5-3874 (underlined sequences correspond to start and stop codons) 
I 11 21 31 41 51 

CGAGATCCCG GGGAGCCAGC TTGCTGGGAG AGCGGGACGG TCCGGAGCAA GCCCACAGGC 60 
AGAGGAGGCG ACAGAGGG AA AAAGGGCCGA GCTAGCCGCT CCAGTGCTGT ACAGGAGCCG 120 
AAGGGACGCA CCACGCCAGC CCCAGCCCGG CTCCAGCGAC AGCCAACGCC TCTTGCAGCG 180 
CGGCGGCTTC GAAGCCGCCG CCCGGAGCTG CCCTTTCCTC TTCGGTGAAG TTTTTAAAAG 240 
CTGCTAAAGA CTCGGAGGAA GC A AGGAAAG TGCCTGGTAG GACTGACGGC TGCCTTTGTC 300 
CTCCTCCTCT CCACCCCGCC TCCCCCCACC CTGCCTTCCC CCCCTCCCCC GTCTTCTCTC 360 
CCGCAGCTGC CTCAGTCGGC TACTCTCAGC CAACCCCCCT CACCACCCTT CTCCCCACCC 420 
GCCCCCCCGC CCCCGTCGGC CCAGCGCTGC CAGCCCGAGT TTGCAGAGAG GTAACTCCCT 480 
TTGGCTGCGA GCGGGCGAGC TAGCTGCACA TrGCAAAGAA GGCTCTTAGG AGCCAGGCGA 540 
CTGGGGAGCG GCTTCAGCAC TGCAGCCACG ACCCGCCTGG TTAGAATTCC GGCGGAGAGA 600 
ACCCTCTOTT TTCCCCCACT CTCTCTCCAC CTCCTCCTGC CTTCCCCACC CCG AGTGCGG 660 
AGCAG AGATC AAAAGATG AA AAGGCAGTCA GGTCTTCAGT AGCCAAAAAA CAAAACAAAC 720 
AAAAACAAAA AAGCCGA AAT AAA AG AA AAA GATAATAACT CAGTTCTTAT TTGCACCTAC 780 
TTCAGTGGAC ACTGAATTTG G AAGGTGGAG GATTTTGTTT TTTTCTTTTA AGATCTGGGC 840 
ATCTTTTGAA TCTACCCTTC AAGTATTAAG AGACAGACTG TGAGCCTAGC AGGGCAGATC 900 
TTGTCCACCG TGTGTCTTCT TCTGCACGAG ACTTTGAGGC TGTCAGAGCG CTTTTTGCGT 960 
GGTTGCTCCC GCAAGTTTCC TTCTCTGGAG CTTCCCGCAG GTGGGCAGCT AGCTGCAGCG 1020 
ACTACCGCAT CATCACAGCC TGTTGAACTC TTCTGAGCAA GAGAAGGGGA GGCGGGGTAA 1080 
GGGAAGTAGG TGGAAGATTC AGCCAAGCTC AAGGATGGAA GTGCAGTTAG GGCTGGGAAG 1 140 
GGTCTACCCT CGGCCGCCGT CCAAGACCTA CCGAGGAGCT TTCCAGAATC TGTTCCAGAG 1200 
CGTGCGCGAA GTGATCCAGA ACCCGGGCCC CAGGCACCCA GAGGCCGCGA GCGCAGCACC 1260 
TCCCGGCGCC AGTTTGCTGC TGCTGCAGCA GC AG C AGCAG CAGCAGCAGC AGCAGCAGCA 1320 
GCAGCAGCAG CAGCAGCAGC AGCAGCAAGA GACTAGCCCC AGGCAGCAGC AGCAGCAGCA 1380 
GGGTGAGGAT GGTTCTCCCC AAGCCCATCG TAGAGGCCCC ACAGGCTACC TGGTCCTGGA 1440 
TGAGGAACAG CAACCTTCAC AGCCGCAGTC GGCCCTGGAG TGCCACCCCG AGAGAGGTTG 1500 
CGTCCCAGAG CCTGGAGCCG CCGTGGCCGC CAGCAAGGGG CTGCCGCAGC AGCTGCCAGC 1560 
ACCTCCGGAC GAGGATGACT CAGCTGCCCC ATCCACGTTG TCCCTGCTGG GCCCCACTTT 1620 
CCCCGGCTTA AGCAGCTGCT CCGCTGACCT TAAAGACATC CTGAGCGAGG CCAGCACCAT 1680 
GCAACTCCTT CAGCAACAGC AGCAGGAAGC AGTATCCGAA GGCAGCAGCA GCGGGAGAGC 1740 
GAGGGAGGCC TCGGGGGCTC CCACTTCCTC CAAGGACAAT TACTTAGGGG GCACTTCGAC 1800 
CATTTCTGAC AACGCCAAGG AGTTGTGTAA GGCAGTGTCG GTGTCCATGG GCCTGGGTGT 1860 
GGAGGCGTTG GAGCATCTGA GTCCAGGGGA ACAGCTTCGG GGGGATTGCA TGTACGCCCC 1920 
ACTTTTGGGA GTTCCACCCG CTGTGCGTCC CACTCCTTGT GCCCCATTGG CCGAATGCAA 1980 
AGGTTCTCTG CTAGACGACA GCGCAGGCAA GAGCACTGAA GATACTGCTG AGTATTCCCC 2040 
TTTCAAGGGA GGTTACACCA AAGGGCTAGA AGGCGAGAGC CTAGGCTGCT CTGGCAGCGC 2100 
TGCAGCAGGG AGCTCCGGGA CACTTGAACT GCCGTCTACC CTGTCTCTCT ACAAGTCCGG 2160 
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AGCACTGGAC GAGGCAGCTG CGTACCAGAG TCGCGACTAC TACAACTTTC CACTGGCTCT 2220 
GGCCGGACCG CCGCCCCCTC CGCCGCCTCC CCATCCCCAC GCTCGCATCA AGCTGGAGAA 2280 
CCCGCTGGAC TACGGCAGCG CCTGGGCGGC TGCGGCGGCG CAGTGCCGCT ATGGGGACCT 2340 
GGCGAGCCTG CATGGCGCGG GTGCAGCGGG ACCCGGTTCT GGGTCACCCT CAGCCGCCGC 2400 
5 TTCCTCATCC TGGCACACTC TCTTCACAGC CGAAGAAGGC CAGTTGTATG GACCGTGTGG 2460 

TGGTGGTGGG GGTGGTGGCG GCGGCGGCGG CGGCGGCGGC GGCGGCGGCG GCGGCGGCGG 2520 
CGGCGGCGGC GAGGCGGGAG CTGTAGCCCC CTACGGCTAC ACTCGGCCCC CTCAGGGGCT 2580 
GGCGGGCCAG GAAAGCGACT TCACCGCACC TGATGTGTGG TACCCTGGCG GCATGGTGAG 2640 
CAGAGTGCCC TATCCCAGTC CCACTTGTGT CAAAAGCGAA ATGGGCCCCT GGATGGATAG 2700 

1 0 CTACTCCGGA CCTTACGGGG ACATGCGTTT GGAGACTGCC AGGGACCATG TriTGCCCAT 2760 
TGACTATTAC TTTCCACCCC AGAAGACCTG CCTGATCTGT GGAGATGAAG CTTCTGGGTG 2820 
TCACTATGGA GCTCTCACAT GTGGAAGCTG CAAGGTCTTC TTCAAAAGAG CCGCTGAAGG 2880 
GAAACAGAAG TACCTGTGCG CCAGCAGAAA TGATTGCACT ATT G ATA A AT TCCGAAGGAA 2940 
AAATTGTCCA TCTTGTCGTC TTCGGAAATG TTATGAAGCA GGGATGACTC TGGGAGCCCG 3000 

1 5 GAAGCTGAAG AAACTTGGTA ATCTGAAACT ACAGGAGGAA GGAGAGGCTT CCAGCACCAC 3060 
CAGCCCCACT GAGGAGACAA CCCAGAAGCT GACAGTGTCA CACATTGAAG GCTATGAATG 3120 
TCAGCCCATC TTTCTGAATG TCCTGGAAGC CATTGAGCCA GGTGTAGTGT GTGCTGGACA 3180 
CG ACAACAAC CAGCCCGACT CCTTTGCAGC CTTGCTCTCT AGCCTCAATG AACTGGGAGA 3240 
GAGACAGCTT GTACACGTGG TCAAGTGGGC CAAGGCCTTG CCTGGCTTCC GCAACTTACA 3300 

20 CGTGGACGAC CAGATGGCTG TCATTCAGTA CTCCTGGATG GGGCTCATGG TGTTTGCCAT 3360 
GGGCTGGCGA TCCTTCACCA ATGTCAACTC CAGGATGCTC TACTTCGCCC CTGATCTGGT 3420 
TTTCAATGAG TACCGCATGC ACAAGTCCCG GATGTACAGC CAGTGTGTCC GAATGAGGCA 3480 
CCTCTCTCAA G AGTTTGG AT GGCTCCAAAT CACCCCCCAG GAATTCCTGT GCATGAAAGC 3540 
ACTGCTACTC TTCAGCATTA TTCCAGTGGA TGGGCTGAAA AATCAAAAAT TCTITGATGA 3600 

25 ACTTCGAATG A ACT AC AT CA AGGAACTCGA TCGTATCATT GCATGCAAAA GAAAAAATCC 3660 
CACATCCTGC TCAAGACGCT TCTACC AGCT C ACCAAGCTC CTGGACTCCG TGCAGCCTAT 3720 
TGCGAGAGAG CTGCATCAGT TCACTTTTGA CCTGCTAATC AAGTCACACA TGGTGAGCGT 3780 
GGACTTTCCG GAAATGATGG CAGAGATCAT CTCTGTGCAA GTGCCCAAGA TCCTTTCTGG 3840 
G AAAGTCAAG CCCATCTATT TCCACACCCA GTGAAGCATT GGAAACCCTA TTTCCCCACC 3900 

3 0 CCAGCTCATG CCCCCTTTCA G ATGTCTTCT GCCTGTTATA ACTCTGCACT ACTCCTCTGC 3960 
AGTGCCTTGG GGAATTTCCT CTATTGATGT ACAGTCTGTC ATGAACATGT TCCTGAATTC 4020 
TATTTGCTGG GCTTTTTTTT TCTCTTTCTC TCCTTTCTTT TTCTTCTTCC CTCCCTATCT 4080 
AACCCTCCCA TGGCACCTTC AGACTTTGCT TCCCATTGTG GCTCCTATCT GTGTTTTGAA 4140 
TGGTGTrGTA TGCCTTTAAA TCTGTGATGA TCCTCATATG GCCCAGTGTC AAGTTGTGCT 4200 

3 5 TGTTTACAGC ACTACTCTGT GCCAGCCACA CAAACGTTTA CTTATCTTAT GCCACGGGAA 4260 

GTTTAGAGAG CTAAGATTAT CTGGGG AAAT CAAAACAAAA AACAAGCAAA C AAA AAAA A A 4320 
A 

Seq ID NO: 94 Protein sequence : 
40 Protein Accession #: NP 000035.1 

1 11 21 31 41 51 
I I I I I I 

MEVQLGLGRV YPRPPSKTYR GAFQNLFQSV REV1QNPGPR HPEAASAAPP GASLLLLQQQ 60 
45 QQQQQQQQQQ QQQQQQQQET SPRQQQQQQG EDGSPQAHRR GPTGYLVLDE EQQPSQPQSA 120 
LECHPERGCV PEPGAAVAAS KGLPQQLPAP PDEDDSAAPS TLSLLGPTFP GLSSCSADLK 1 80 
DILSEASTMQ LLQQQQQEAV SEGSSSGRAR EASGAPTSSK DNYLGGTSTI SDNAKELCKA 240 
VSVSMGLGVE ALEHLSPGEQ LRGDCMYAPL LG VPPAVRPT PCAPLAECK.G SLLDDSAGKS 300 
TEDTAEYSPF KGGYTKGLEG ESLGCSGSAA AGSSGTLELP STLSLYKSGA LDEAAAYQSR 360 
50 DYYNFPLALA GPPPPPPPPH PHARIKLENP LDYGSAWAAA AAQCRYGDLA SLHGAGAAGP 420 

GSGSPSAAAS SSWHTLFTAE EGQLYGPCGG GGGGGGGGGG GGGGGGGGGG GGEAGAVAPY 480 
GYTRPPQGLA GQESDFTAPD VVYYPGGMVSR VPYPSPTCVK SEMGPWMDSY SGPYGDMRLE 540 
TARDHVLPID YYFPPQKTCL ICG DEASGCH YGALTCGSCK VFFKRAAEGK QKYLCASRND 600 
CTIDKFRRKN CPSCRLRKCY EAGMTLGARK LKKLGNLKLQ EEGEASSTTS PTEETTQKLT 660 
55 VSICEGYECQ PIFLNVLEAJ EPGVVCAGHD NNQPDSFAAL LSSLNELGER QLVHVVKWAK 720 

ALPGFRNLHV DDQMAVIQYS WMGLMVFAMG WRSFTNVNSR MLYFAPDLVF NEYRMHKSRM 780 
YSQCVRMRHL SQEFGWLQIT PQEFLCMKAL LLFSIIPVDG LKNQKFFDEL RMNYIKELDR 840 
IIACKRKNPT SCSRRFYQLT KLLDSVQPIA RELHQFTFDL LIKSHMVSVD FPEMMAEHS 900 
VQVPKJLSGK VKPIYFHTQ 

Seq ID NO: 95 DNA sequence 

Nucleic Acid Accession #: NM 0024')7 

Coding sequence: 1 35-1472 (underlined sequences correspond to start and stop codons) 

65 I 11 21 3! 41 51 
I I I 1 I I 

GGCACXiAGTA GGGGTGGCGG GTCAGTGCTG CTCGGGGGCT TCTCCATCCA GGTCCCTGGA 60 
GTTCCTGGTC CCTGGAGCTC CGCACTTGGC GCGCAACCTG CGTGAGGCAG CGCGACTCTG 1 20 
GCGACTGGCC GGCCATGCCT TCCCGGGCTG AGGACTATGA AGTGTTGTAC ACCATlGGCA 180 

7 0 CAGGCTCCTA CGGCCGCTGC CAG AAGATCC GG AGGAAG AG TGATGGCAAG ATATTAGTTT 240 
GGAAAGAACT TGACTATGGC TCC ATGACAG AAGCTGAGAA ACAGATGCTT GTTTCTGAAG 300 
TGAATTTGCT TCGTGAACTG A A ACATCCA A AC ATCGTTCG TTACTATGAT CGGATTATTG 360 
ACCGGACC^\ TACAACAC7G TACATTGTAA TGGAATATTG TGAAGGAGGG GATCTGGCTA 420 
GTGTAATTAC AAAGGG AAC C AAGG AAAGGC A ATACTTAGA TGAAGAGTTT GTTCTTCG AG 480 

75 TGATGACTCA GTTGACTCTG GCCCTGAAGG AATGCCACAG ACGAAGTGAT GGTGGTCATA 540 
CCGTATTGCA TCGGGATCTT AAACCAGCCA ATGTTTTCCT GGATGGCAAG CAAAACGTCA 600 
AGCTTGGAGA CTTTGGGCTA GCTAG AATAT TAAACCATGA C\CGAGTTTT GCAAAAACAT 660 

388 



WO 02/1159377 



PCT/US02/02242 



TTGTTGGCAC ACCTTATTAC ATGTCTCCTG AACAAATGAA TCGCATGTCC TACAATGAGA 720 
AATCAGATAT CTGGTCATTG GGCTGCTTGC TGTATGAGTT ATGTGCATTA ATGCCTCCAT 780 
TTACAGCTTT TAGCCAGAAA GAACTCGCTG GGAAAATCAG AGAAGGCAAA TTCAGGCGAA 840 
TTCCATACCG TTACTCTGAT G AATTGAATG AAATTATTAC GAGGATGTTA AACTTAAAGG 900 
5 ATTACCATCG ACCTTCTGTT G AAGAAATTC TTGAGAACCC TTTAATAGCA GATTTGGTTG 960 

CAGACGAGCA AAGAAGAAAT CTTGAGAGAA GAGGGCGACA ATTAGGAGAG CCAGAAAAAT 102O 
CGCAGGATTC CAGCCCTGTA TTGAGTGAGC TGAAACTGAA GGAAATTCAG TTACAGGAGC 1080 
GAGAGCGAGC TCTCAAAGCA AGAGAAGAAA GATTGGAGCA GAAAGAACAG GAGCTTTGTG 1 140 
TTCGTGAGAG ACTAGCAGAG GACAAACTGG CTAGAGCAGA AAATCTGTTG A AG A ACT AC A 1200 

1 0 GCTTGCTAAA GGAACGGAAG TTCCTGTCTC TGGCAAGTAA TCCAG AACTT CTTAATCTTC 1 260 
CATCCTCAGT AATTAAGAAG AAAGTTCATT TCAGTGGGGA AAGTAAAGAG AACATCATGA 1320 
GGAGTGAGAA TTCTGAGAGT CAGCTCACAT CTAAGTCCAA GTGCAAGGAC CTGAAGAAAA 1380 
GGCTTCACGC TGCCCAGCTG CGGGCTCAAG CCCTGTCAGA TATTGAGA AA AATTACCAAC 1440 
TGAAAAGCAG ACAGATCCTG GGCATGCGCT AGCCAGGTAG AGAGACACAG AGCTGTGTAC 1500 

1 5 AGGATGTAAT ATTACCAACC TTTAAAGACT GATATTCA AA TGCTGTAGTG TTGAATACTT 1560 
GGCCCCATGA GCCATGCCTT TCTGTATAGT ACACATGATA TTTCGGAATT GGTTTTACTG 1620 
TTCTTCAGCA ACTATTGTAC AAAATGTTCA CATTTAATTT TTCTTTCTTC TTTTAAGAAC 1680 
ATATTATAAA AAGAATACTT TCTTGGTTGG GCTTTTAATC CTGTGTGTGA TTACTAGTAG 1740 
GAACATGAGA TGTGACATTC TAAATCTTGG GAGAAAAAAT AATATTAGGA AAAAAATATT 1800 

20 TATGCAGGAA GAGTAGCACT CACTG AATAG TTTTAAATGA CTG AGTGGTA TGCTTACAAT 1 860 
TGTCATGTCT AGATTTAAAT TTTAAGTCTG AGATTTTAAA TGTTTTTGAG CTTAGAAAAC 1920 
CCAGTTAGAT GCAATTTGGT CATTAATACC ATGACATCTT GCTTATAAAT ATTCCATTGC 1980 
TCTGTAGTTC AAATCTGTTA GCTTTGTGAA AATTCATCAC TGTGATGTTT GTATTCTTTT 2040 
TTTTTTTCTG TTTAACAGAA TATGAGCTGT CTGTCATTTA CCTACTTCTT TCCCACTAAA 2100 

25 T AAA AG A ATT CTTCAGTTA 

Seq ID NO: 96 Protein sequence; 
Protein Accession #: NP 002488 

30 1 11 21 31 41 51 

ill!!! 

GIREFNPNIM ANEVERINMI TOSISGENEA RELATEDKIN ASEHOMOSAP IENSMPSRAE 60 
DYEVLYTIGT GSYGRCQKIR RKSDGK1LVW K.ELDYGSMTE AEK.QMLVSEV NLLRELKHPN 120 
IVRYYDRJID RTNTTLYIVM EYCEGGDLAS VITKGTKERQ YLDEEFVLRV MTQLTLALKE 1 80 

35 CHRRSDGGHT VLHRDLKPAN VFLDGKQNVK LGDFGLARJL NHDTSFAKTF VGTPYYMSPE 240 
QMNRMSYNEK SDIWSLGCLL YELCALMPPF TAFSQKELAG KJREGKFRRI PYRYSDELNE 300 
HTRMLNLKD YHRPSVEEIL ENPLIADLVA DEQRRNLERR GRQLGEPEKS QDSSPVLSEL 360 
KXKEIQLQER ERALKAREER LEQKEQELCV RERLAEDKLA RAENLLKNYS LLKERKFLSL 420 
ASNPELLNLP SSVIKKKVHF SGESKENIMR SENSESQLTS KSKCK.DLKKR LHAAQLRAQA 480 

40 LSDIEKNYQL K.SRQILGMR 

Seq ID NO: 97 DNA sequence 

Nucleic Acid Accession #: NM_007050.2 

Coding sequence: 1 85-4576 (underlined sequences correspond to start and stop codons) 

45 

1 11 21 31 41 51 

CCTCCCGCCT C AGTTCGCGC CGCGCCTCGG CTTGGAACGC AGG AGCGCCG GCTCCGGGAG 60 
CCCGAGCGGA GCC AGCCGCG CGCACAGCCA GCGGCCGCGC CGGCG ATGCG GGGCCACCCC 1 20 

50 GCGCCCGCCC CAGTCCCGGC CCCGGCCCCC GCGGGAAGGG GCTGAGCTGC CCGCCGCCGC 1 80 
CCGGATGGCG AGCCTCGCCG CGCTCGCCCT CAGCCTGCTC CTGAGGCTGC AGCTGCCGCC 240 
ACTGCCCGGC GCCCGGGCTC AGAGCGCCCC AGGTGGCTGT TCCTTTG ATG AGC ACTACAG 300 
CAACTGTGGT TATAGTGTGG CTCTAGGGAC CAATGGGTTC ACCTGGGAGC AGATTAACAC 360 
AACGG AGAAA CCAATGCTGG ACCAGGCAGT GCCCACAGGA TCTTTCATGA TGGTGAACAG 420 

55 CTCTGGGAGA GCCTCTGGCC AGAAGGCCCA CCTTCTCCTG CCAACCCTGA AGGAGAATGA 480 
CACCCACTGC ATCG ACTTCC ATTACTACTT CTCCAGCCGT GACAGGTCCA GCCCAGGGGC 540 
CTTGAACGTC TACGTGAAGG TGAATGGTGG CCCCCAAGGG AACCCTGTGT GGAATGTGTC 600 
CGGGGTCGTC ACTGAGGGCT GGGTGAAGGC AGAGCTCGCC ATCAGCACTT TCTGGCCACA 660 
TTTCTATCAG GTGATATTTG AATCCGTCTC ATTGAAGGGT CATCCTGGCT ACATCGCCGT 720 

60 GGACGAGGTC CGGGTCCTTG CTCATCCATG CAGAAAAGCA CCTCATTTTC TGCGACTCCA 780 
AAACGTGGAG GTGAATGTGG GGCAGAATGC CACATTTCAG TGCATTGCTG GTGGGAAGTG 840 
GTCTCAGCAT GACAAGCTTT GGCTCCAGCA ATGGAATGGC AGGGACACGG CCCTGATGGT 900 
CACCCGTGTG GTCA ACCACA GGCGCTTCTC AGCCACAGTC AGTGTGGCAG ACACTGCCCA 960 
GCGGAGCGTC AGCAAGTACC GCTGTGTGAT CCGCTCTGAT GGTGGGTCTG GTGTGTCCAA 1020 

65 CTACGCGGAG CTG ATCGTGA AAGAGCCTCC CACGCCCATT GCTCCCCCAG AGCTGCTGGC 1080 
TGTGGGGGCC ACATACCTGT GGATCAAGCC AAATGCCAAC TC CATC ATCG GGGATGGCCC 1 140 
CATCATCCTG AAGG AAGTGG AATATCGCAC CACCACAGGC ACGTGGGCAG AG ACCCACAT 1200 
AGTCGACTCT CCCAACTATA AGCTGTGGCA TCTGGACCCC GATGTTGAGT ATGAGATCCG 1260 
AGTGCTCCTC ACACGACCAG GTGAGGGGGG TACGGGACCG CCAGGGGCTC CCCTCACCAC 1320 

70 CAGGACCAAG TGTGCAGATC CGGTACATGG CCCACAGAAC GTGGAAATCG TAGACATCAG 1380 
AGCCCGGCAG CTGACCCTGC AGTGGGAGCC CTTCGGCTAC GCGGTGACCC GCTGCCATAG 1440 
CTACAACCTC ACCGTGCAGT ACCAGTATGT GTTCAACCAG CAGCAGTACG AGGCCGAGGA 1500 
GGTCATCCAG ACCTCCTCCC ACTACACCCT GCGAGGCCTG CGCCCCTTCA TGACCATCCG 1560 
GCTGCGACTC TTGCTGTCTA ACCCCGAGGG CCGAATGGAG AGCGAGGAGC TGGTGGTGCA 1620 

75 GACTGAGGAA GACGTTCCAG G AGCTGTTCC TCTAGAATCC ATCCAAGGGG GGCCCTTTGA 1680 
GGAGAAGATC TACATCCAGT GGAAACCTCC CAATGAGACC AATGGGGTCA TCACGCTCTA 1740 
CG AGATCAAC TACAAGGCTG TCGGCTCGCT GGACCCAAGT GCTGACCTCT CGAGCCAGAG 1800 
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GGGGAAAGTG TTCAAGCTCC GGAATGAAAC CCACCACCTC TTTGTGGGTC TGTACCCAGG 1 860 
GACCACCTAT TCCTTCACCA TCAAGGCCAG CACAGCAAAG GGCTTTGGGC CCCCTGTCAC 1920 
CACTCGGATT GCCACCAAAA TTTCAGCTCC ATCCATGCCT GAGTACGACA CAGACACCCC 1980 
ATTGAATGAG ACAGACACGA CCATCACAGT GATGCTG AAA CCCGCTCAGT CCCGGGGAGC 2040 
5 TCCTGTCAGT GTTTATCAGC TGGTTGTCAA GGAGGAGCGA CTTCAGAAGT CACGGAGGGC 2100 
AGCTGACATT ATTGAGTGCT TTTCGGTGCC CGTGAGCTAT CGGAATGCCT CCAGCCTCGA 21 60 
TTCTCTACAC TACTTTGCTG CTGAGTTGAA GCCTGCCAAC CTGCCTGTCA CCCAGCCATT 2220 
TACAGTGGGT GACAATAAGA CATACAATGG CTACTGGAAC CCTCCTCTCT CTCCCCTGAA 2280 
AAGCTACAGC ATCTACTTCC AGGCACTCAG CAAAGCCAAT GGAGAGACCA AAATCAACTG 2340 

1 0 TGTTCGTCTG GCTACAAAAG CACCAATGGG CAGCGCCCAG GTGACCCCGG GGACTCCACT 2400 
CTGCCTCCTC ACCACAGGTG CCTCCACCCA GAATTCTAAC ACTGTGGAGC CAGAGAAGCA 2460 
GGTGGACAAC ACCGTGAAGA TGGCTGGCGT GATCGCTGGC CTCCTCATGT TCATCATCAT 2520 
TCTCCTGGGC GTGATGCTCA CCATCAAAAG GAGAAGAAAT GCTTATTCCT ACTCCTATTA 2580 
CTTGTCCCAA AGGAAGCTGG CCAAGAAGCA GAAGGAGACC CAGAGTGGAG CCCAGAGGGA 2640 

1 5 GATGGGGCCT GTGGCCTCTG CCGACAAACC CACCACCAAG CTCAGCGCCA GCCGCAATGA 2700 
TGAAGGCTTC TCTTCTAGTT CTCAGGACGT CAACGGATTC ACAGATGGCA GCCGCGGGGA 2760 
GCTTTCCCAG CCCACCCTCA CGATCCAGAC TC ATCCCTAC CGCACCTGTG ACCCTGTGGA 2820 
GATGAGCTAC CCCCGGGACC AGTTCCAACT CGCCATCCGG GTGGCTGACT TGCTGCAGCA 2880 
CATCACGCAG ATGAAGAGAG GCCAGGGCTA CGGGTTCAAG GAGGAATACG AGGCCTTACC 2940 

20 AGAGGGGCAG ACAGCTTCGT GGGACACAGC CAAGGAGGAT GAAAACCGCA ATAAGAATCG 3000 
ATATGGGAAC ATCATATCCT ACGACCATTC CCGGGTGAGG CTGCTGGTGC TGGATGGAGA 3060 
CCCGCACTCT GACTACATCA ATGCCAACTA CATTGACGGA TACCATCGAC CTCGGCACTA 3120 
CATTGCGACT CAAGGTCCGA TGCAGGAGAC TGTAAAGGAC TTTTGGAGAA TGATCTGGCA 3130 
GGAGAACTCC GCCAGCATCG TCATGGTCAC AAACCTGGTG GAAGTGGGCA C.GGTGAAATG 3240 

2 5 TGTGCGATAC TGGCCAGATG ACACGG AGGT CTACGGAG AC ATTAAAGTCA CCCTGATTGA 3300 
AACAGAGCCC CTGGCAGAAT ACGTCATACG CACCTTCACA GTCCAGAAGA AAGGCTACCA 3360 
TGAGATCCGG (iAGCTCCGCC TCTTCCACTT CACCAGCTGG CCTGACCACG GCGTTCCCTG 3420 
CTATGCCACT GGCCTTCTrGG GCTTCGTCCG CCAGGTCAAG TTCCTCAACC CCCCGGAAGC 3480 
TGGGCCCATA GTGGTCCACT GCAGTGCTGG GGCTGGGCGG ACTGGCTGCT TCATTGCCAT 3540 

3 0 TGACACCATG CTTGACATGG CCG AG AATGA AGGGGTGGTG GACATCTTCA ACTGCGTGCG 3600 
TGAGCTCCGG GCCCAAAGGG TCAACCTGGT ACAGACAGAG GAGCAATATG TGTTTGTGCA 3660 
CGATGCCATC CTGGAAGCGT GCCTCTGTGG CAACACTGCC ATCCCTGTGT GTGAGTTCCG 3720 
TTCTCTCTAC TACAATATCA GCAGGCTGGA CCCCCAGACA AACTCCAGCC AAATCAAAGA 3780 
TGAATTTCAG ACCCTCAACA TTGTGACACC CCGTGTGCGG CCCGAGGACT GCAGCATTGG 3840 

3 5 GCTCCTGCCC CGGAACCATG ATAAGAATCG AAGTATGGAC GTGCTGCCTC TGGACCGCTG 3900 
CCTGCCCTTC CTTATCTCAG TGGACGG AG A ATCCAGCAAT TACATCAACG CAGCACTGAT 3960 
GGATAGCCAC AAGCAGCCTG CCGCCTTCGT GGTCACCCAG CACCCTCTAC CCAACACCGT 4020 
GGCAGACTTC TGGAGGCTGG TGTTCGATTA CAACTGCTCC TCTGTGGTGA TGCTG AATGA 4080 
GATGGACACT GCCCAGTTCT GTATGCAGTA CTGGCCTGAG AAGACCTCCG GGTGCTATGG 4140 

40 GCCCATCCAG GTGGAGTTCG TCTCCGCAGA CATCGACGAG GACATCATCC ACAGAATATT 4200 
CCGCATCTGT AACATGGCCC GGCCACAGGA TGGTTATCGT ATAGTOCAGC ACCTCCAGTA 4260 
CATTGGCTGG CCTGCCTACC GGGACACGCC CCCCTCCAAG CGCTCTCTGC TCAAAGTGGT 4320 
CCGACGACTG GAGAAGTGGC AGGAGCAGTA TGACGGGAGG GAGGGACGTA CTGTGGTCCA 4380 
CTGCCTAAAT GGGGGAGGCC GTAGTGGAAC CTTCTGTGCC ATCTGCAGTG TGTGTGAGAT 4440 

45 GATCCAGCAG CAAAACATCA TTGACGTGTT CCACATCGTG AAAACACTGC GTAACAACAA 4500 
ATCCAACATG GTGGAGACCC TGGAACAGTA TAAATTTGTA TACGAGGTGG CACTGGAATA 4560 
TTTAAGCTCC TTTTAGCTCA ATGGGATGGG GAACTGCCGG AGTCCAGAGG CTGCTGTGAC 4620 
CAAGCCCCCT TTTGTGTGAA TGGCAGTAAC TGGGCTCAGG AGCTCTGAGG TGGCACCCTG 4680 
CCTGACTCCA AGGAGAAGAC TGGTGGCCCT GTGTTCCACG GGGGGCTCTG CACCTTCTGA 4740 

5 0 GGGGTCTCCT GTTGCCGTGG G AGATGCTGC TCC A AAAGGC CCAGGCTTCC TTTTCAACCT 4800 
AACCAGCCAC AGCCAAGGGC CCAAGCAGAA GTACACCCAC AAGCAAGGCC TTGGATTTCT 4860 
GGCTCCCAGA CCACCTGCTT TTGTTCTGAG TTTGTGGATC TCTTGGCAAG CC\ACTGTGC 4920 
AGGTGCTGGG GAGTGGGAGG CTCCCCTGCC CTCCTTCTCC TTAGGAGTGG AGGAGATGTG 4980 
TGTTCTGCTC CTCTACGTCA TGGAAAAGAT TGAGGCTCTT GGGGGTCACT GCTCTGCTGC 5040 

5 5 CCCCTGCA AC CTCCTTCAGG GGCCTCTGGC ACCAGACATT TGCAGTCTGG ACCAGTGTGA 5100 
CCTTACGATG TTCCCTAGGC CACA AG AG AG GCCCCCCATC CTCACACCTA ACCTGCATGG 5 1 60 
GGCTTCGCCC ACAACCATTC TGTACCCCTT CCCCAGCCTG GGCCTTGACC G TCCA GCATT 5220 
CACTGGCCGG CCAGCTGTGT CCACAGCAGT TTTTG ATA A A GGTGTTCTTT GCTTTTTTGT 5280 
. GTGGTCAGTG GGAGGGGGTG GAACTGC AGG GAACTTCTCT GCTCCTCCTT GTCTTTGTAA 5340 

60 AAAGGGACCA CCTCCCTGGG GCAGGGCTTG GGCTGACCTG TAGGATGTAA CCCC TGTGTT 5400 
TCTTTGGTGG TAGCTTTCTT TGGAAGAGAC AAACAAGATA AGATTTGATT ATTTTCCAAA 5460 
GTGTATGTGA AAAGAAACTT TCTTTTGGAG GGTGTA AAAT CTTAGTCTCT TATGTCAAAA 5520 
AGAAGGGGGC GGGGGAGTTT GAGTATGTAC CTCTAAGACA AATCTCTCGG GCCTTTTATT 5580 
TTTTCCTGGC AATGTCCTTA AAAGCTCCCA CCCTGGGACA GCATGCCACT GAGCAAGGAG 5640 

65 AGATGGGTGA GCCTGAAGAT GGTCCCTTTG GTTTCTGGGG CAAATAGAGC ACCAGCTTTG 5700 
TGCATAATTT GGATGTCCAA ATTTGAACTC CTTCCTAAAG AAACCCAGCA GCCACCTTGA 5760 
A AAAGCiCCAT TGTGGAGCCC ATTATACTTT GATTTAAAAT AGGCCAAGAG AATCAGGCCT 5820 
GGAGATCTAG GGTCTTGTCC AAAGTGTGAG TGAGTCAATG AGAGGGAACC AACATTTGCT 5880 
AAGTCTCTAC TGTATGCCAG GGATCATGCT TGGCACTTTC CATAGGACAT TTCACACAGT 5940 

70 CCTTAGAACC CCCAGGAGAG AGCTACTGAC TTGTTATCAT CTCCATTTGA TCATCTCCTC 6000 
CAATGAGGAA ACCCACGCAC CTTCCTTAGT AATGAAATCC TGGGTTCCAA AGGGGCAGGT 6060 
AATGGCAATG AGACTTCTCC GTGCTGTTTT CTTCATCTTC TCTAAGCCAA GCAATTATIT 6120 
TATGGAGGGA AAATAAGGCC AGAAACTTCT GAGCAGATAA CTCCACAAAT GGAAATTTAG 6180 
TACTTTCTTC CTGATGCCAG TTCTTCTGGG AAGCGCAGAA TTTCAGATAT ATTTTAGTAA 6240 

75 CACATTCCCA GCTCCCCAGG AA AGCCAGTC TCATCTAATT TCTTAGTCAG TAAAAACAAT 6300 
TCCCTGTTCC TTCAGGCTAT GAATGG ACCA GCCAGGGAAA CTCTCGACCT TGATCTCTAG 6360 
CCAGTGCTTA GGCCCAATAT CTGACAGCCT CAGGTGGGCT GGGACCTAGG AAGCTCCATC 6420 
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TTGAAGGCTG GTCTAGCCCC AGACAGGGCA TGAGGGGCAG AGAATTCAAG AAGGTACAGC 64 SO 
TTTGGCCCTC AAGAGCCCAC TGTATGCTGG GGAAATGGAA CCATGGTGCA GTAGTGTGGA 6540 
GTCGATGAGT GTTCCATGAG CCTAGGAGCA AGAAAGTCTCTTCGGCCTCG GGCTTCCTGG 6600 
AGAAGGGGAC GTCCATTCCT GCTGGGTCTT AACAAGCATA A A A AGG A A A A AAAGGAAACT 6660 
CAGGCAAAGG GATCCATATG TGCAATGGCA AAGAAATGTG AAAAGGCATT GGGAGAAGCA 6720 
GTCTGGGGGA GGCCAGCCCA GTGCGGGCAC AGCACAACAC GGGGAGCAGC AAGAGATGAG 6730 
CCAGGGTCCA GGAGACAGAT GCCCATCGCG AGTACAGACT TTGTCCTATT GGCAACAAGG 6840 
AGTCCATGGA GCTTTAGAGA GATGCACTCA GCTTCGTGTT GGCGAAGACT CCTTCTGGGC 6900 
CAATGGGGCT GCCTCTTTTC CTTTCATCAG ACACTGTGAA AACATTCCCT TAAGCGTGCA 6960 
CTTTTTAATA TCACATCTAT TTGTCTGTCT GCTCATTGTT TTGTTGCTGG AACTAAATAT 7020 
GCAATGGATC ATGAGACTCA GATTCTATGA GAAACCCAGG GTCTCTGCTT TACCACGGAG 7080 
CAGGGTCACC AACCCAGATC TCCAGGCCCA TGAGGATGGA ACATGAAAGG AGCCGACAAA 7140 
AGTTGCTTCC ATTGGCATGG GCTCTGGAGC TGTCCAGAAG TCCAGGGACA CC A GACTTG A 7200 
TCAAGGAAGG GCTGTCACTT TAGAGGTTCA AAAGGAAGTG CCTCAAAGCA AAGGCAAGCA 7260 
AAGGAACCCC ACGATGAACT TGCTCTTTTC CTTTGATGAG CCTCTCCCCA GGTGTATTTC 7320 
AGCAGACCCC GGGGACCCAC CCCCACTGGG CCTGCTGGCC TCCCTCGGCT CCAGCCCAAT 7380 
GCCCCAGCTG GCCTTCCCCA GCCTGCAAGG AGCCTGTAGC ATGGCAAATC TGCCTGCTGT 7440 
ATGCTATTTT CTTAGATCTT GGTACATCCA GACAGGATGA GGGTGGAGGG AGAGCTATTT 7500 
AACACAAATC CTAAGATTTT TTTCTGCTCA GGAAGGGGTG AAATAGCTGG CAGATACAAA 7560 
AGACAGTGGC TTTTATCATT TTAAATGGTA GGAATTTAAG GTGTGACTTC AGGGAGAAAC 7620 
AAACTTGCAA AA AAA A AAA A TCTCAGGCCA TGTTGGGGTA ACCCAGCAAG GGCCAGTG AT 7680 
GATTTCCCCC AGCTCATCCC CTTATTTTCC CACAACCCAA CCATTCTCTA AAGCAGGACA 7740 
GTGAATAGGT CTTAGGCCAG TGCACACAGG AAGAAATTGA GGCTTATGGA TGGGGATGAC 7800 
TTCCCTAAGA TCCCATGGGA CAAGGATGTG GCAAGGCTTG GATGAGATGG GGCACCAGTG 7860 
CCCAGGAATT TGAACATTTT CCTTTACCCA GGAAATCTCC GGAGCCAACA CCACCACCCC 7920 
CAGGGGGTCT CCCCACCCCA CCCCATTTAC AGGGTGAGCT CAGCCTGTCA TGAGC\GAGG 7980 
AAAATATTAT TAATGCTCTC TGAGTCTTTA CAACAGGAGC TCTTACCTCA TAGATGTGGG 8040 
CTCTGTTTGG GGAAGATGCA AGGAAGTAAT GAGAAGCCCA GGAAATTTCT CCACCTGTGT 8100 
TTATGGCCTA A ATAGCTTC A GG ATGTATCT TAGCTGCACT CCAACATTGC ATCCTTTCTG 8 1 60' 
GGGTGAAGAA TCTGGGCCAA CCAGGGGTCC TTGGGCCTCT AGAAGGCCAC AGTAGGCCTC 8220 
TCTTTGTGGG AATGGAAGGG GACAGTTTGC TTTTAGTGCT GGCCCTCTCT GTGGGTGTGG 8280 
CCTGCAAAGG AACCAACAGA CCCTATGCTG GGGACTCTAA CATGTGAGCT CATTAAATTC 8340 
TTCCAGCATT CTAAAGGAGG GTTTGTGATT GTCACCATTT ACTGATGAGG AAACTA AGGC 8400 
TCCTAGGGGA GA AATCACTT GCCCACAGTT CCACAGCTAG TGAGTGAATG AACCAGGATT 8460 
TAAACCGGTT TTTTCTCACT ACAGAGACAA TATTTTTCCA CCATTGTATC TCACATTTTT 8520 
CCCAGGAGGT TACCCATAAC AGAAGAGACT AGAGTGGAAC AGATACGTCA GTGGATAAAG 8580 
CTCAAAGCAA ACAACAGTAA GCTTAA A ATT CCTTCATAGT CTCATGTTTT ACGTTCACAA 8640 
TTCATGCAAA ATTTGCATTC CACTTTCTGA TTTAGCCTTG TTGGTTTTAA TATGACTCTA 8700 
TGAATATTTC AAAAAAAAAT GTGCTCTGTT CCTCATGTTG TTCTGTTCTG TTCACCCCGC 8760 
TATGACGGAC CCTAGGTCAG CTGGTCTTCA GCTTGACCCT AGAATTGACT CTAGGAGCAG 8820 
TGACCCTGCT GCCTCCCAGA GCCAGTTATA GGCTCAAGAT CAAGACCAAC TG ACCTTCTC 8880 
CTAGGCAGCT CCTTTGGTGT GTGGGTGCTC TGACCTCACT GTTCATGAGG GGACCTCAAC 8940 
TAAGGCATCT TCCAGTTGGG TGCTGGAAGG AACCCATTAA CTCACACTAG AATGATGAGG 9000 
ATTTGCTCAT CTGGCGTGGA GAAGGATGAG CCCACAAAAC CCTAAAGGGA AAAGAGAAGC 9060 
TGGACACAGC TGTACTCAGC AGATTCCTGA ATGCTAGGCT GGAAAGTGGT GCCTGTTGTC 9120 
CAAGTGGAGT CACATGGTTG CTAATGTGGG CAAGTCTGAG GACACACTTC ATGAGCAGCT 9180 
GGGGTCTGGA AGGCTCCTCA CTTTACCCTA GCCACACATA ATTACTGGGT GCCTACAGCA 9240 
CCTAGCACCT TGGAGGGGGC ACTATTAGGA AATCGAGATT ACTATGGCAC AATTAATTCC 9300 
TGGGTAAGGC ATGGGGTTGT GGTGGACAGA GCTCAGTCTT TAGTTTGAAC GAAAACATAC 9360 
ATACATGAAA AACATACATG AAAAAAGGAC CCTCATCAAC ATTAGAAGGG GTAGATTTGG 9420 
AGCACTTTAG GCAGGAAAAC AGGAACGCAA GGCCAGGAAA CTGGAACCCA GTGAATACTC 9480 
AGAACCGAGG ATGCAGATGA CTTATTTAGC AAAATGGTCA CTTCTGTGAC ATAGCTGGAG 9540 
AAAGGATGGG TAACAGCTTG CCAGAGCCAC TTGGAACAAG GGCAAATCTC AGTGTCTGGG 9600 
GCAAAAGATG ATGCATTTCC CTCTGACCCA TCATGTTTAT TCATCCTCCA CTCCCCATTG 9660 
CCACACTAGC TCTTGCTGTA AGTCCTCACC AGGATCTACA TTTCCTCGTC GCTGGTGGGA 9720 
ACCCCTTAGA GTACATAGAG GTATCAGTCC AGTAAGACTG CTCTACACAA CAGAAGTGAG 9780 
GCCCAGGGAG TAGCAGCCAG GCCCTTATCC TGTTACCTCT GCAGGAGTGA CTGCCCAACC 9840 
CAGATCCAGA GACATTGAAG GAAATGATAA TTCCTTGGTA CCTCACTGCC TTGGGACAAA 9900 
ATGAAGAAAG CCACCCTTCC TTAGGCTGCA GCTTGCCACT CCTGGGCTGG GTAAACAGGT 9960 
CATCAGCACC AGGCTCAACC AGGAGTAACA TTCTGGAAGA CATGGGTGAG CCCAAGAGGA 10O20 
AGCATGAACA GGACGCTGTT CCTAAGTCAT GTCAACAGGT TGTGCTGGGC CAGGATCCCC 10080 
AGGGAAAAAA ATGGTCAACC CAACTGGAGG GTAGGTTAGA AGAAAAAAAA CATAAACGTG 10140 
GATAGTCATG TCATCTCAAA TCCCTG ACTT GGCTTCCCCA TTACTTGACA GTCTGAGCTC 10200 
CTTCTTAGCC TGTGACCAGC TTCAAATCAC AGCCAAGTAA AACAAGGAAA TAGGAAAAGT 10260 
AAATOCAACT AGAAGAGACA AGCTGAGATT CAGATTTGTT TACTCCTCCC ATGCAAAGTT 10320 
TCCCTGTTGG AGGTTTTCCA TGTATACATG TCTAGAAGTG ATAGAATGCA AGGCCTTGGC 10380 
TTTGTCTTGC AGGGATCTGC CTTTGAGuTC ATAGACTGAA CAGCAGGGAG AGAGGTTAGT 10440 
GGTGGAGTGT GGGGGGAGCT GTTCTAGCTC CAGTTTCTTC TGACACATTT TTCAGGATCA 10500 
TGGATCTGAT CCTCCGAAGC ACAGCAG AG A TATCTAAGCC ATATTTGTGC ACATGAGCAG 10560 
ACTCTTCTAG TTTTTTAGTA ACCAGGGATG GGCTTTTGCA TGGCACTGAC TATAGAGATG 10620 
TCTTGTAGAG ATCAAGCCAG TCTTTTGCAT CCCACCTGCC CACCTCCAGA AGAGATGGGA 10680 
AAAGGTCATC AAAGGGCATT CACCAACTGA AATCCACTCA TGAATGTTAG GTCTCTAAAA 10740 
GGAGGCATCA ACACTCACAA TGGTAGCCTC CAAACCTAGC ATCCCACCTA TCTAAGAGCT 10800 
CAGGGGTGGT CCACTGGGGC AGATACAAGG GAAGTGCAAG GGCTCAGGAT GAAAGAAAAT 10860 
CTMTGGGAA GAGTTTTAGG GGCTTGATCA TTATGGGGCT TCCTTCTATA TCTGAGAACT 10920 
GCTCTGGGTG GTGAGATGTG GACTCTGATC CTTAATTGGA ATGTTCGGAG AATGAGTGTC 10980 
TGGTGGCCTT GAAGTGTTGG ACAGAAAAGT ATCAGTATAA AAGCCTGGAG CTCAGGGTAA 1 1040 
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TTAATGTAGT TCATGGTTCC TTAGTGAGCA GGACTCTTGG ATGTGGAGGA GAAAGGGTCA 1 11 00 
TAGGAAGTAA ACCACCAAAA TTACAAAATT GAGTCTCTGT ACAATTACTT CAGTGCCTTT 11 160 
GGGCTTATGA ATACAAATCA GTGGGCCTTC TCTATGATGG TCCAACAAAC TCTCAGTGTC 11220 
CACCCTGTCC CTGTATCTCC CATGGAAGAT GAATAATGTC AGGTGTTCTT TGGGTCAAAG 1 1280 
5 GCCCC AGGGC AGTCTGG AGG CTTAGAGGGC AG AGTGGTGT C ATTCCATGT AAAGTTAGGC 1 1 340 
TTCTGAGGGG TCAGGCAGAA TATGGTGTCC ATATCTTCCA TAGCTCTGCA GATTCTTGGA 1 1400 
TGAAGTCAAG CACAGTTTGC TAGACCCAGG TCACTCCTCT GAGTATAACT AGGACCCATG 11460 
AGTGAAACTT AATAGCTGTA AGGAAGAACC TGCTGTCTGC C AG AG AGG AT AAGCTGCCCA 1 1520 
TCTCAGCAGC TGTCTAAAAG AAGGCAGGTG TCTCTTTAAA GGGAAGAGAA GCATTGGTGA 1 15S0 

1 0 AATGGATTTC AGGTCACTTC CATTCCAGAT GGGTGAGATC TTGTGGAGCT GGGATCATGT 1 1640 
1TGAACTCAT TCATACCTGT AGAGCACGAA TCCAAGTAGA TTGTGTTTGG TCTGTACAGG 1 1700 
CTGAAGCCCC CTGCTCTCCC ACCCAAGTGC CCCCACTGAG CAGGCCAACA TGCTGTTGTG 11 760 
GCCACATATA CTGGGCTGAT CCAGGCTGGT TATCACCAAA CAGCAAACCA TAGGGAACAG 1 1820 
CTGCTTTGCC ATAGACCCAA TACCCATGTA G ATCTCTCAT GAG AGCAGCC ATAACTCAG A 1 1 880 

1 5 CCCACTG ACC AACAGGGCCA TGAGTGACAG CCAG AACCAG TGAAGGTCCA AGTAGGACAC 1 1 940 
AG AGC AGGGC TTTTCTTACC ATACACATTA TCTCCAGAGG TTATTTCTAC CCCACTCCCT 12000 
ATTCAAGGCC TGTTGGAGCA CACTGCAAAA GCAAAAGCAC AGTAACTCAA TTTACACATG 12060 
ATTATAATCA TTTCCAGTGC ACACATTTCA TCACCAGGTG GATCCTGAGC TAGCCCATGT 12120 
AAATCCGGGT TAACCCATAT TGGTAATCAT ACTCAAAAGC ACTTTTCACC CTACATTCTA 12180 

20 CTAGCCA ATC AAAG AC AAAG AGTTGTGGCC TCTACCATTG CCTTGGCTTC TGG ACACCCT 1 2240 
CACAAGCTAT CCCAAGGTTC CCGCTCAACT CCAGGGAGGC TGACATCTTC ACATCCACTG 12300 
GGCATATAAT ATTGCATGAG ACCAAAGTCT CCACACTCTT TGCAGCCTCC TCCATGAATC 12360 
CCAATGGCCT GCACTTGTAC AGTTTGGGTG TTTGATAGAT AAAGCACGTA TGAGAAGAGA 12420 
AAACAAAATA AATCAACTTT TTAAAAAAGC CAGCACTGTG CTGTCAATGT TTTTTTTTTC 124RO 

25 TTTTCAATTC TAGCTCAGAA AAGCAGAAGG TAAATA ATGT CAGGTCAATG AATATCAGAT 1 2540 
ATATTTTTTG ACTGTACATT ACAGTGAAGT GTAATCTTTT TACACCTGCA AGTCCATCTT 12600 
ATTTATTCTT GTAAATGTTC CCTGACAATG TTTGTAATAT GGCTGTGTTA AAAAATCTAT 12660 
AC A ATA A AGC TGTGACCCTG 

30 Sea ID NO: 98 Protein sequence: 

Protein Accession #: NP00898U 

1 11 21 31 41 51 

~- I 1 1 1 I t 

35 GIREFNPPRO TEINTYROSI NEPHOSPHAT ASERJECEPTO RTYPETHOMO SAPIENSMAS 60 

LAALALSLLL RLQLPPLPGA RAQSAPGGCS FDEHYSNCGY SVALGTNGFT WEQINTTEKP 120 
MLDQAVPTGS FMMVNSSGRA SGQKAHLLLP TLKENDTHCI DFHYYFSSRD RSSPGALNVY 1 80 
VKVNGGPQGN PVWNVSGVVT EGWVKAELAI STFWPHFYQV IFESVSLKGH PGYIAVDEVR 240 
VLAHPCRKAP HFLRLQNVEV NVGQNATFQC 1AGGKWSQHD KLWLQQWNGR DTALMVTRVV 300 

40 NHRRFSATVS VADTAQRSVS KYRCVIRSDG GSGVSNYAEL IVKEPPTPIA PPELLA VGAT 360 
YLWIKPNANS IIGDGPIILK EVEYRTTTGT WAETHIVDSP NYKLWHLDPD VEYEIRVLLT 420 
RPGEGGTGPP GAPLTTRTKC ADPVHGPQNV EIVDIRARQL TLQWEPFGYA VTRCHSYNLT 480 
VQYQY VFNQQ QYEAEEVIQT SSHYTLRGLR PFMTIRLRLL LSNPEGRMES EELVVQTEED 540 
VPGAVPLESI QGGPFEEKJY 1QWKPPNETN G V1TLYEINY KAVGSLDPSA DLSSQRGKVF 600 

45 ICLRNETHHLF VGLYPGTTYS FTTKASTAKG FGPPVTTRIA TKISAPSMPE YDTDTPLNET 660 
DTTFTVMLICP AQSRGAP VSV YQLVVKEERL QKSRRAADII ECFSVPVSYR NASSLDSLHY 720 
FAAELKPANL PVTQPFTVGD NKTYNGYWNP PLSPLKSYSI YFQALSKANG ETKINCVRLA 780 
TKAPMGSAQV TPGTPLCLLT TGASTQNSNT VEPEKQVDNT VKMAGVIAGL LMFDILLGV 840 
MLTIKRRRNA YSYSYYLSQR KLAKKQFCETQ SGAQREMGPV ASADKPTTKL SASRNDEGFS 900 

50 SSSQDVNGFT DGSRGELSQP TLTIQTIIPYR TCDPVEMSYP RDQFQLAIRV ADLLQHITQM 960 

KRGQGYGFKE EYEALPEGQT ASWDTAKEDE NRNKNRYGNI 1SYDHSRVRL LVLDGDPHSD 1020 
YIN ANY IDG Y HRPRHYIATQ GPMQETVKDF WRMIWQENSA SIVMVTNLVE VGRVKCVRYW 1080 
PDDTEVYGDI KVTLIETEPL AEYVIRTFTV QKKGYHEIRE LRLFHFTSWP DHGVPCYATG 1 140 
LLGFVRQVKF LNPPEAGPIV VHCSAGAGRT GCFIAIDTML DMAENEGWD IFNCVRELRA 1200 

55 QRVNLVQTEE QYVFVHDAIL EACLCGNTAI PVCEFRSLYY NTSRLDPQTN SSQJKDEFQT 1260 
LNIVTPRVRP EDCSIGLLPR NHDKNRSMDV LPLDRCLPFL ISVDGESSNY INAALMDSHK 1320 
QPAAFVVTQH PLPNTVADFW RLVFDYNCSS VVMLNEMDTA QFCMQYWPEK TSGCYGPIQV 1380 
EFVSADIDED IIHRIFRJCN MARPQDGYRJ VQHLQYIGWP AYRDTPPSKR SLLKVVRRLE 1440 
KWQEQYDGRE GRTVVHCLNG GGRSGTFCAI CSVCEMIQQQ NIIDVFHIVK TLRNNKSNMV 1500 

60 ETLEQYKFVY EVALEYLSSF 

Scq ID NO: 99 DNA sequence 

Nucleic Acid Accession #: NM 002988. 1 

Coding sequence: 71-340 (underlined sequences correspond to start and stop codons) 

65 

1 11 21 31 41 51 

; till' 

CCGGCACGAG AGGAGTTGTG AGTTTCCAAG CCCCAGCrCA CTCTGACCAC TTCTCTGCCT 60 
GCCCAGCATC ATGAAGGGCC TTGCAGCTGC CCTCCTTGTC CTCGTCTGCA CCATGGCCCT 120 

70 CTGCTCCTGT GCACAAGTTG GTACCAACAA AGAGCTCTGC TGCCTCGTCT ATACCTCCTG 180 
GCAGATTCCA CAAAAGTTCA TAGTTGACTA TTCTGAAACC AGCCCCCAGT GCCCCAAGCC 240 
AGGTGTCATC CTCCTAACCA AGAGAGGCCG GCAGATCTGT GCTGACCCCA ATAAGAAGTG 300 
GGTCCAGAAA TACATCAGCG ACCTGAAGCT GAATGCCTGA GGGGCCTGGA AGCTGCGAGG 360 
GCCCAGTGA A CTTGGTGGGC CCAGGAGGGA ACAGGAGCCT GAGCCAGGGC AATGGCCCTG 420 

7 5 CC ACCCTGGA GGCCACCTCT TCTAAG AGTC CCATCTGCTA TGCCCAGCCA CATTA ACTAA 4*0 
CTTTAATCTT AGTTTATGCA TCATATTTCA TTTTGAAATT GATTTCTATT GTTGAGCTGC 540 
ATTATGAAAT TAGTATTTTC TCTGACATCT CATGACATTG TCTTTATCAT CCTTTCCCCT 600 
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TTCCCTTCAA CTCTTCGTAC ATTCAATGCA TGGATCAATC AGTGTGATTA GCTTTCTCAG 660 
CAGACATTGT GOCATATGTA TCAAATGACA AATCTTTATT GAATGGTTTT GCTCAGCACC 720 
ACCTTTTAAT ATATTGGCAG TACTTATTAT ATAAAAGGTA AACCAGCATT CTCACTGTGA 780 
AA AAA A A AAA AA A A A A A A A A AAA 

Seq ID NO: 100 Protein sequence: 
Protein Accession tt: NP_002979.1 

1 11 21 31 41 51 

I I I I i I 

MKGLAAALLV LVCTMALCSC AQVGTNKELC CLVYTSWQ1P QKFIVDYSET SPQCPKPGVl 60 
LLTKRGRQIC ADPNKKWVQK YISDLKLNA 

Seq ID NO: 101 DN A sequence 

Nucleic Acid Accession #: NM_01 5507.2 

Coding sequence: 241-1902 (underlined sequences correspond to start and slop codons) 
1 11 21 31 41 51 

t i : i i 

CCGCAGAGGA GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCOCC AGGCCGCGAG 60 
CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAGACTGCA GGGACAGG\C CCGGTAACTG 1 20 
CGAGTGGAGC GGAGGACCCG AGCGGCTGAG GAGAGAGGAG GCGGCGGCTT AGCTGCTACG 180 
GGGTCCGGCC GGCGCCCTCC CGAGGGGGGC TCAGGAGGAG GAAGGAGGAC CCGTGCGAGA 240 
ATGCCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 
GGGAACGCGG CCAGTGCAAG GCATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 
TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 420 
TGTGAAGCTA CATGCGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 480 
AGATGCTTTC CAGGATACAC CGGGAAAACC TGCAGTCAAG ATGTGAATGA GTGTGGAATG 540 
AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACACG GAAGCTACAA GTGCTTTTGC 600 
CTCAGTGGCC; ACATGCTCAT GCCAGATGCT ACGTGTGTGA ACTCTAGGAC ATGTGCCATG 660 
ATAAACTGTC AGTACAGCTG TGA AGACACA GA AGAAGGGC CACAGTGCCT GTGTCCATCC 720 
TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 780 
GGTA AAGTCA TCTGTCCCTA CAATCGAAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 840 
AAATGTCACA TTGGTTTCGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 900 
AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATACCCAA 960 
GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 1020 
ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 1080 
AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 1 140 
CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AGCCCTTCAA CTATGAAGAG 1200 
ATAGTTTCC A GAGGCGGGAA CTCTCATGGA GGTA AAA A AG GGAATGAAGA GAAAATGAAA 1260 
GAGGGGCTTG AGGATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AGAGGAGCGA 1320 
AGCCTGCGAG GAGATGTGTT TTTCCCTAAG GTG AATGAAG CAGGTGAATT CGGCCTGATT 1380 
CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 
GACTGCAGCT TCAATCATGG GATCTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC 1500 
TGG AATCCTG CTGATCGAGA TAATGCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 1 560 
GGTCACAAGA AAGACATTGG CCGATTGAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 1620 
AACTTCTGTT TGCTCrTTGA TTACCGGCTG GCCGGAGACA AAGTCGGGAA ACTTCGAGTG 1680 
TTTGTGAA AA ACAGTAACAA TGCCCTGGCA TGGG AGAAGA CCACGAGTGA GGATGAA AAG 1 740 
TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 1800 
GAAGCAGAAC GTGGCAAGGG CAAAACCGGC GAAATCGCAG TGGATGGCGT CTTGCTTGTT 1860 
TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTGG ATGACTGAATGTTACT ATCTTTATAT 1 920 
TTGACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 1980 
TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAGATGCCTT 2040 
TCTTGTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2100 
TTTCTGAATC TTTCCACATT ATATTATAAA ATATGGAAAT GTCAGTTTAT CTCCCCTCCT 2160 
CAGTATATCT GATTTGTATA AGTAAGTTGA TGAGCTTCTC TCTACAACAT TTCTAGAAAA 2220 
TAG A AAA AAA AGCACAGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 
TATG ACATCA AAGATAGACT TTTGCCTAAG TGGCTTAGCT GGGTCTTTCA TAGCCAAACT 2340 
TGTATATfTA AATTCTTTGT AATAATAATA TCCAAATCAT C AAA AAA AAA AAAAAAAA 

Seq ID NO: 102 Protein sequence: 
Protein Accession #: NP_056322.2 

1 11 21 31 41 51 

! ! I I I 

MPLPWSLALP LLLSWVAGGF GNAASARHHG LLASARQPGV CHYGTKLACC YGWRRNSKGV 60 
CEATCEPGCK FGECVGPNKC RCFPGYTGKT GSQDVNECGM KPRPCQHRCV NTHGSYKCFC 1 20 
LSGHMLMPDA TCVNSRTCAM rNCQYSCEDT EEGPQCLCPS SGLRLAPNGR DCLDIDECAS 180 
GKVICPYNRR CVNTFGSYYC KCHIGFELQY ISGRYDCIDI NECTMDSHTC SHHANCFNTQ 240 
GSFKCKCKQG YKGNGLRCSA IPENSVKEVL RAPGTIKDRI KKXLAHKNSM KXKAKIKNVT 300 
PEPTRTPTPK VNLQPFNYEE IVSRGGNSHG GKKGNEEKMK EGLEDEICREE KALKNDIEER 360 
SLRGDVFFPK VNEAGEFGLI LVQRKALTSK LEHKDLNISV DCSFNHG1CD WKQDRED0FD 420 
WNPADRDNAI GFYMAVPALA GHKKDIGRLK LLLPDLQPQS NFCLLFDYR1 AGDKVGKLRV 480 
FVKNSNNALA WEKTTSEDEK WKTGKJQLYQ GTDATKSIIF EAERGKGKTG EIAVDGVLLV 540 
SGLCPDSLLS VDD 

Seq ID NO: 103 DNA sequence 
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Nucleic Acid Accession ft: NM_001 565.1 

Coding sequence: 67-363 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

5 I i i i f ] 

GAGACATTCC TCAATTGCTT AGACATATTC TGAGCCTACA GCAGAGGAAC CTCCAGTCTC 60 
AGCACCATGA ATCAA ACTGC G A TTCTGATT TGCTGCCTTA TCTTTCTGAC TCTAAGTGGC 1 20 
ATTCA AGGAG TACCTCTCTC TAGAACCGTA CGCTGTACCT GCATCAGCAT TAGTAATCAA 1 80 
CCTGTTAATC CAAGGTCTTT AG AAAAACTT GAAATTATTC CTGCAAGCCA ATTTTGTCCA 240 
1 0 CGTGTTGAGA TCATTGCTAC AATGAAAAAG AAGGGTGAGA AGAGATGTCT GAATCCAGAA 300 
TCGAAGGCCA TCAAGAATTT ACTGAAAGCA GTTAGCAAGG AAATGTCTAA AAGATCTCCT 360 
TAAAACCAGA GGGGAGCAAA ATCGATGCAG TGCTTCCAAG GATGGACCAC ACAGAGGCTG 420 
CCTCTCCCAT CACTTCCCTA CATGGAGTAT ATGTCAAGCC ATAATTGTTC TTAGTTTGCA 480 
GTTACACTAA AAGGTGACCA ATGATGGTCA CCAAATCAGC TGCTACTACT CCTGTAGGAA 540 
1 5 GGTTAATGTT CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 600 
GCTCTACTGA GGTGCTATGT TCTTAGTGGA TGTTCTGACC CTGCTTCAAA TATTTCCCTC 660 
ACCTTTCCCA TCTTCCAAGG GTACTAAGGA ATCTITCTGC TTTGGGGTTT ATCAGAATTC 720 
TCAGAATCTC A A AT A ACTA A AAGGTATGCA ATCAAATCTG CTTTTTAAAG AATGCTCTTT 780 
ACTTCATGGA CTTCCACTGC CATCCTCCCA AGGGGCCCAA ATTCTTTCAG TGGCTACCTA 840 
20 CATACAATTC CAAACACATA CAGGAAGGTA GAAATATCTG AAAATGTATG TGTAAGTATT 900 
CTTATTTAAT GAAAGACTGT ACAAAGTATA AGTCTTAGAT GTATATATTT CCTATATTGT 960 
TTTCAGTGTA CATGGAATAA CATGTAATTA AGTACTATGT ATCAATGAGT AACAGGAAAA 1020 
TTTTAAAAAT ACAGATAGAT ATATGCTCTG CATGTTACAT AAGATAAATG TGCTGAATGG 1080 
TTTTCAAATA AAAATGAGGT ACTCTCCTGG AAATATTAAG 



25 



Seq ID NO 104 Protein sequence: 
Protein Accession #: NP_001556. 1 



1 11 21 31 41 51 

30 i i i i I 

MNQTAILICC LIFLTLSGIQ GVPLSRTVRC TCISISNQPV NPRSLEKLEI IPASQFCPRV 60 
E1IATMKI1KG EFCRCLNPESK AIKNLLKAVS KEMSKRSP 

Seq ID NO: 105 DNA sequence 
35 Nucleic Acid Accession H; MM 015068 1 

Coding sequence: 1 170*2243 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

40 GTAACAACCG TCACCCTGGG TCCCGACTGC CCACCTCCTC CTCCTCCCCC TCCCCCCAAC 60 

AACAACAACA ACA ACAACTC CAAGCACACC GGCCATAAGA GTGCGTGTGT CCCCAACATG 1 20 
ACCGAACGAA GAAGGGACGA GCTCTCTGAA GAGATCAACA ACTTAAGAGA GAAGGTCATG 180 
AAGCAGTCGG AGGAGAACAA CAACCTGCAG AGCCAGGTGC AGAAGCTCAC AG AGGAG A AC 240 
ACCACCCTTC GAGAGCAAGT GGAACCCACC CCTGAGGATG AGGATGATG A CATCGAGCTC 300 

45 CGCGGTGCTG CAGCAGCTGC TGCCCCACCC CCTCCAATAG AGGAAGAGTG CCCAGAAGAC 360 
CTCCCAGAGA AGTTCGATGG CAACCCAC.AC ATGCTGGCTC CTTTCATGGC CCAGTGCCAG 420 
ATCTTCATGG AAAAGAGCAC CAGGGATTTC TCAGTTGATC GTGTCCGTGT CTGCTTCGTG 480 
ACAAGCATGA TGACCGGCCG TGCTGCCCGT TGGGCCTCAG CAAAGCTGGA GCGCTCCCAC 540 
TACCTGATGC ACAACTACCC AGCTTTCATG ATGGAAATGA AGCATGTCTT TGAAGACCCT 600 

50 CAGAGGCGAG AGGTTGCCAA ACGCAAGATC AGACGCCTGC GCCAAGGCAT GGGGTCTGTC 660 
ATCGACTACT CCAATGCTTT CCAGATGATT GCCCAGGACC TGGATTGGAA CGAGCCTGCG 720 
CTGATTGACC AGTACCACGA GGGCCTCAGC GACCACATTC AGGAGGAGCT CTCCCACCTC 780 
GAGGTCGCCA AGTCGCTGTC TGCTCTGATT GGGCAGTGCA TTCACATTGA GAGAAGGCTG 840 
GCCAGGGCTG CTGCAGCTCG CAAGCCACGC TCGCCACCCC GGGCGCTGGT GTTGCCTCAC 900 

5 5 ATTGCA AGCC ACCACCAGGT AGATCCAACC GAGCCGGTGG GAGGTGCCCG CATGCGCCTG 960 
ACGCAGGAAG AAAAAGAAAG ACGCAGAAAG CTGAACCTGT GCCTCTACTG TGGAACAGGA 1020 
GGTCACTACG CTGACAATTG TCCTGCCAAG GCCTCAAAGT CTTCGCCGGC GGGAAACTCC 1080 
CCGGCCCCGC TGTAGAGGGA CCTTCAGCGA CCGGGCCAGA AATAATAAGG TCCCCACAAG 11 40 
ATGATGCCTC ATCTCCACAC TTGCAAGTG A TGC TCCAGAT TC ATCTTCCG GGCAGACACA 1 200 

60 CCCTGTTCGT CCGAGCCATG ATCGATTCTG GTGCTTCTGG CAACTTCATT GATCACGAAT 1260 
ATGTTGCTCA AAATGGAATT CCTCTAAGAA TCAAGGACTG GCCAATACTT GTGGAAGCAA 1320 
TTGATGGGCG CCCCATAGCA TCGGGC;CCAG TTGTCCACGA AACTCACGAC CTGATAGTTG 1380 
ACCTGGGAGA TCACCGAGAG GTGCTGTCAT TTGATGTGAC TCAGTCTCCA TTCTTCCCTG 1440 
TCGTCCTAGG GGTTCGCTGG CTGAGCACAC ATGATCCCAA TATCACATGG AGCACTCGAT 1500 

65 CTATCGTCTT TGATTCTGAA TACTGCCGOT ACCACTGCCG GATGTATTCT CCAATACCAC 1 560 
CATCGCTCCC ACCACCAGCA CCACAACCGC CACTCTATTA TCCAGTAGAT GGATACAGAd 1620 
TTTACCAACC AGTGAGGTAT TACTATGTCC AGAATGTGTA CACTCCAGTA GATGAGC \CG 16K0 
TCTACCCAGA TCACCGCCTG GTTGACCCTC ACATAGAAAT GATACCTGGA GCACACAGTA 1740 
TTCCCAGTGG ACATGTGTAT TCACTGTCCG AACCTGAAAT GGCAGCTCTT CGAGATTTTG 1800 

70 TGGCAAGAAA TGTAA AAGAT GGGCTAATTA CTCCAACGAT TGCACCTAAT GGAGCCCAAG 1 860 
TTCTCCAGGT GAAGAGGGGG TGGAAACTGC AAGTTTCTTA TGATTGCCGA GCTCCAAACA 1920 
ATTTTACTAT CCAGAATCAG TATCCTCGCC TATCTATTCC AAATTTAGAA GACCAAGCAC 1980 
ACCTGGCAAC GTACACTGAA TTCGTACCTC AAATACCTGG ATACCAAACA TACCCCACAT 2040 
ATGCCGCGTA CCCGACCTAC CCAGTAGGAT TCGCCTGGTA CCCAGTGGGA CGAGACGGAC 2100 

7 5 AAGGA AGATC ACTATATGTA CCTGTGATGA TCACTTGGA A TCCACACTGG TACCGCCAGC 2 1 60 
CTCCGGTACC ACAGTACCCG CCGCCACAGC CGCCGCCTCC ACCACCACCA CCGCCGCCGC 2220 
CTCCATCTTA CAGTACCCTG TAAATACCTG TCATGTCCTT CAGGATCTCT GCCCTCAAAA 2280 
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TTTATTCCTG TTCAGCTTCT CAATCAGTGA CTGTGTGCTA AATTTTAGGC TACTGTATCT 2340 
TCAGGCCACC TGAGGCACAT CCTCTCTGAA ACGGCTATGG AAGGTTAGGG CCACTCTGGA 2400 
CTGGCACACA TCCTAAAGC^ CCAAAAGACC TTCAACATTT TCTGAGAGCA ACAGAGTATT 2460 
TGCCAATA AA TGATCTCTCA TTTTTCCACC TTGACTGCCA ATCTAACTAA AATAATTAAT 2520 
5 \AGTTTACTT TCCAGCCAGT CCTGG AAGTC TGGGTTTTAC CTGCCAAAAC CTCCATCACC 2580 
ATCTAAATTA TAGGCTGCCA AATTTGCTGT TTA ACATTTA CAGAGAAGCT G ATACAAACG 2640 
CAGGAAATGC TGATTTCTTT ATGGAGGGGG AGACGAGGAG GAGGAGGACA TGACTTTTCT 2700 
TGCGGTTTCG GTACCCTCTT TTTAAATCAC TGGAGGACTG AGGCCTTATT AAGGAAGCCA 2760 
AAATTATCGG TGCAGTGTGG AAAGGCTTCC GTGATCCTCT CGCTGCACCC TTAGAAACTT 2820 
1 0 CACCGTCTTC AAACTCCATTTCCATGGTTC TGTTAATTCT CAAGGAGCAG CAACTCGACT 2880 

GGTTCTCCCA GGAGCAGGAA AAACCCTTGT GACATGAAAC ATCTCAGGCC TGAAAAGAAA 2940 
GTGCTCTCTC AGATGGACTC TTGCATGTTA AGACTATGTC TTCACATCAT GGTGCAAATC 3000 
ACMGTACCC AATGACTCCG GCTTTGACAC AACACCTTAC CATCATCATG CCATGATGGC 3060 
TTCCM2AAAG CATTAAACCT GGTAACCAGA GATTACTGGT GGCTCCAGCG TTGTTAGATG 3120 

1 5 TTCATGAAAT GTGACCACCT CTCAATCACC TTTGAGGGCT AAAGAGTAGC ACATCAAAAG 3180 

GACTCCAAAA TCCCATACCC AACTCTTAAG AGATTTGTCC TGGTACTTCA GAAAGAATTT 3240 
TCATGAGTGT TCTTAATTGG CTGGAAAAGC ACCAGCTGAC GTTTTGGAAG AATCTATCCA 3300 
TGTGTCTGCC TCCATATGCA TCTGGGC ATT TCATCTTCAG TCCCCTCATT AGACTGTAGC 3360 
ATTAGGATGT GTGGAGAGAG GAGAAATGAT TTAGCACCCA GATTCACACT CCTATGCCTG 3420 
20 GAAGGGGG AC ATCTTTGAAG AAG AGGAATT AGGGCTGTGG ACACTGTCTT GAGGATGTGG 3480 
ACTTCCTTAG TGAGCTCCAC ATTACTTGAT GGTA ACCACT TCAAAAGGAT CAGAATCCAC 3540 
GTAATGAAAA AGGTCCCTCT AGAGGATGGA GCTG ATGTGA AGCTGCCAAT GGATGAAAAG 3600 
CCTCAGAAAG CAACTCAAAG GACTCAAAGC AACGGACAAC ACAAGAGTTG TCTTCAGCCC 3660 
AGTGACACCT CTGATGTCCC CTGGAAGCTT TGTGCTAACC TGGGACTGCC TGACTTCCTT 3720 

2 5 TAGCCTGGTC CCTTGCTACT ACCTTG AACT GTTTTATCTA ACCTCTCTTT TTCTGTTTAA 3780 

TTCTTTGCTA CTGCCATTGA CCCTGCTGCA GGATTTGTGT CATTTTCCTG CCTGG TTGCT 3840 
GAGACTCCAT TTTGCTGCCA CAC\CAGAGA TGTAAGAGGC AGGCTTTAAT TGCC\AAGCA 3900 
^GTTTGAGC AGTAGAAAAC AACATGGTGT ATATCTCAAA TTGCCTGACA TGAAGAGGAG 3960 
TCTAACGGTG AAGTTTCACT TTTCATCAGC ATCATCTTTC ACATGTTCAT TATCATCCGC 4020 
30 TCTTATTCTT GCATGTTTAA ACACTTAAAA TTTTTAGTAT AATTTTTAGT GTGTTTTGAA 4080 
GTGGTGACTA GGCTTTCAAA AACTTCCATT GAATTACAAA GC AC TATCC A GTTCTTATTG 4140 
TTA A ACT A AG TAAAAATG AT AAGTAACATA GTGTAAAATA TTCCTTTACT GTGAACTTCT 4200 
TAC^ATGCTG TGAATGAGAG GCTCCTCAGA ACTGGAGCAT TTGTATAATA ATTCATCCTG 4260 
TTCATCTTCA ATTTTAACAT CATATATAAT TTCAATTCTA TCAATTGGGC CTTTAAAAAT 4320 

3 5 CATATAAAAG GATATAAAAT TTG A AAAGAG AAACCTAATT GGCTATTT AA TCC AAAACAA 4380 

C I 1 1 1 1 1 1 1 1 TCCTTCAATG GAATCAG AAA GCTTGTCAAT CACTCATGTG TTTTAGAGTA 4440 
ATTACTTTTA AAATGGTGCA TTTGTGCTTC TGAACTATTT TGAAGAGTCA CTTCTGTTTA 4500 
CCTCAAGTAT CAATTCATCC TCCATACATT TGAATTCAAG TTGTTTTTTG TCAAATTTAC 4560 
AGTTGTCAAT TGATCTTCAA GCTGCAGGGT GCCTAGAAAT GGGCCGTTGT CTGTAGCCCT 4620 

40 GGCATGTGCA CACGGACATT TGCCACCACT GCAAGCA AAA GTCTGGAGAA GTTCACCAAC 4680 
GACAAGAACG ATTAGGGAAA ATATGCTGCT GTGGGTTAAC AACTCAGMA GTCCCTGATC 4740 
CACATTTGGC TGTTTACTAA AGCTTGTGAT TAACTTTTTG GCAGTGTGTA CTATGCTCTA 4800 
TTGCTATATA TGCTATCTAT AAATGTAGAT GTTAAGGATA AGTAATTCTA AATTTATTAT 4860 
TCTATAGTTT TGAAGTTTGG TTAAGTTTCC TTTCACTCAA TTGATTTATT TTGTTGTTAA 4920 

45 TCAAATTTAT GTTAATTGGA TCCTTTAAAT TTTTTTTGGC ATTTTCCAAC AAAAATGGCT 4980 

TTATTCATAA GAAAGGAAAA AAATCAATGG AATTTGATAT CTAAAGAAGT TAGAAAGGGA 5040 
GC*AAATAAA AAACATAAAG GAGATAGATG AATTAGTAAG CAAATCAGTA GTCGAGTTTT 5100 
TCAAACTGGC AAAATTAATT AATTGACTTTTAGCCCAAAT TTACATTGTT AATTAAATCA 5160 
AGAAGGAAGA AGATCTAAGA GCTCCCATTG ATAGGCAAGC CTAGAGAGAA CTAGCTAAAT 5220 

5 0 TTATCATGCT AGGATATTGA AACAC AGAAA GTTTACATAC ATTTATGAAG GGTCAATTTA 5280 
GTTTGGACAG TG AGGTATTT GTCTTAGTGG AAA AAAGGAG AATTAGTCTG ATCAAATCGT 5340 
GA4GTAATAC AGTGAACTTG CAGGTGCACA AAATAAGAGG GCCACATCTA TATGGTGCAG 5400 
TCTGGAATTC TGTTTAAGTT TGTAGGTACC TCTTGGACTT CTGAATTGAT CCAGTTGTCA 5460 
TCCACCACAG ACATCTCACA TCAGATACAG ACAGTTCCAA GATTGACAAC AGAGAACAAC 5520 

5 5 CTGCTGGAA A GACCTGGGCA GAAATGGAGA GCCCTGCGGG AACCATGCTA CATTTTCATC 5580 
TAAAGAGAGA ATGCACATCT GATGAGACTG AAAGTTCTTT GTTGTTTTAG ATTGTAGAAT 5640 
GGTATTG \AT TGGTCTGTGG AAAATTGCAT TGCTTTTATT TCTTTGTGTA ATCA AGTTTA 5700 
AGTAATAGGG GATATATAAT CAT AAGCATT TTAGGGTGGG AGGGACTATT AAGTAATTTT 5760 
AAGTGGGTGG GGTTATTTAG AATGTTAGAA TAATATTATG TATTAGATAT CGCTATAAGT 5820 

60 GGACATGCGT ACTTACTTGT AACCCTTTAC CCTATAATTG CTATCCTTAA AGATTTCAAA 5880 

TAAACTCGGA GGGAACTGCA GGGAGACCAA CTTATTTAGA GCGAATTGGA CATGGATAAA 5940 
AACCCCAGTG GGAGAAAGTT CAAAGGTGAT TAGATTAATA ATTTAATAGA GGATGAGTGA 6000 
CCTCTGATAA ATTACTGCTA GAATGAACTT GTCAATGATG GATGGTAAAT TTTCATGGAA 6060 
GTTATAAAAG TGATAAATAA AAACCCTTGC TTTTACCCCT GTCAGTAGCC CTCCTCCTAC 6120 

65 CACTGAACCC CATTGCCCCT ACCCCTCCTT CTAACTTTAT TGCTGTATTC TCTTCACTCT 6180 
ATATTTCTCT CTATTTGCTA ATATTGCATT GCTGTTACAA TAAAAATTCA ATAAAGATTT 6240 
AGTGGTTAAG TGC 

Seq ID NO: 106 Protein sequence: 
70 Prolein Accession #: NP 055883.1 

1 11 21 31 41 51 

LtERRRDELS EErNKLREKV MKQSEENNNL QSQVQKLTEE NTTLREQVEP TPEDEDDDIE 60 
75 LRG \AAAAAP PPPIEEECPE DLPEKFDGNP DMLAPFMAQC QIFMEKSTRD FSVDRVRVCF 120 

VTSMMTGRA \ RWASAKLERS HYLMHNYPAF MMEMKHVFED PQPvREVAKRK IRRIRQGMGS 1 80 
VIDYSNAFQM IAQDLDWNEP ALJDQYHEGL SDHIQEELSH LEVAKSLSAL IGQCIMERR 240 
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LARAAAARKP RSPPRALVLP HIASHHQVDP TEPVGGARMR LTQEEKERRR KXNLCLYCGT 300 
GGHYADNCPA KASKSSPAGN SPA PL 

Seq ID NO: 107 DNA sequence 

Nucleic Acid Accession #: NMJ)03 679.1 

Coding sequence: 47-1 50 7( underlined sequences correspond to start and stop codons) 

I 11 21 31 41 51 

I \ I I I I 

GGCACGAGCA GAAGCAACAA TAATTGTGAA AAATACTTCA GCAGTT ATG G ACTCATCTGT 60 
CATTCAAAGG AAAAA AGTAG CTGTCATTGG TGGTGGCTTG GTTGGCTCAT TACAAGCATG 1 20 
CnTCTTGCA AAGAGGAATT TCCAGATTGA TGTATATGAA GCTAGGGAAG ATACTCGAGT 180 
GGCTACCTTC ACACGTGGAA GAAGCATTAA CTTAGCCCTT TCTCATAGAG GACGACAAGC 240 
CTTGAAAGCT GTTGGCCTGG AAGATCAGAT TGTATCCCAA GGTATTCCCA TGAGAGCAAG 300 
AATGATCCAC TCTCTTTCAG GAAAAAAGTC TGCAATTCCC TATGGGACAA AGTCTCAGTA 360 
TATTCTTTCT GTAAGCAGAG AAAATCTAAA CAAGGATCTA TTGACTGCTG CTGAGAAATA 420 
CCCCAATGTG AAAATGCACT TTAACCACAG GCTGTTGAAA TGTAATCCAG AGGAAGGAAT 4 SO 
GATCACAGTG CTTGGATCTG ACAAAGTTCC CAAAGATGTC ACTTGTGACC TCATTGTAGG 540 
ATGTGATGGA GCCTATTCAA CTGTCAGATC TCACCTGATG AAGAAACCTC GCTTTGATTA 600 
CAGTCAGCAG TACATTCCTC ATGGGTACAT GGAGTTGACT ATTCCACCTA AGAACGGAGA 660 
TTATGCCATG GAACCTAATT ATCTGCATAT TTGGCCTAGA AATACCTTTA TGATGATTGC 720 
ACTTCCTAAC ATGAACAAAT CATTCACATG TACTTTGTTC ATGCCCTTTG AAGAGTTTGA 780 
AAAACTTCTA ACCAGTAATG ATGTGGTAGA TTTCTTCCAG AAATACTTTC CGGATGCCAT 840 
CCCTCTAATT GGAGAGAAAC TCCTAGTGCA AGATTTCTTC CTGTTGCCTG CCCAGCCCAT 900 
GATATCTGTA AAGTGCTCTT CATTTCACTT TAAATCTCAC TGTGTACTGC TGGGAGATGC 960 
AGCTCATGCT ATAGTGCCGT TTTTTGGGCA AGGAATGAAT GCGGGCTTTG AAGACTGCTT 1020 
GGTATTTGAT GAGTTAATGG ATAAATTCAG TAACGACCTT AGTTTGTGTC TTCCTGTGTT 1080 
CTCAAGATTG AGAATCCCAG ATGATCACGC GATTTCAGAC CTATCCATGT ACAATTACAT 1 140 
AGAGATGCGA GCACATGTCA ACTCAAGCTG GTTCATTTTT CAGAAGAACA TGGAGAGATT 1200 
TCTTCATGCG ATTATGCCAT CGACCTTTAT CCCTCTCTAT ACAATGGTCA CTTTTTCCAG 1260 
AATAAGATAC CATGAGGCTG TGCAGCGTTG GCATTGGCAA AAAAAGGTGA TAAACAAAGG 1320 
ACTCTTTTTC TTGGGATCAC TGATAGCCAT CAGCAGTACC TACCTACTTA TACACTACAT 1380 
GTCACCACGA TCTTTCCTCT GCTTGAGAAG ACCATGGAAC TGGATAGCTC ACTTCCGGAA 1440 
TACAACATGT TTCCCCGCAA AGGCCGTGGA CTCCCTAGAA CAAATTTCCA ATCTCATTAG 1500 
CAGG TGA TAG AAAGGTTTTG TGGTAGCAAA TGCATGATTT CTCTGTGACC AAAATTAAGC 1560 
ATGAAAAAAA TGTTTCCATT GCCATATTTG ATTCACTAGT GGAAGATAGT GTTCTGCTTA 1620 
TAATTAA ACT GAATGTAGAG TATCTCTGTA TGTTAATTGC A ATTACTGGT TGGGGGGTGC 1 680 
ATTTTAAAAG ATGAAACATG CAGCTTCCCT ACATTACACA CACTCAGGTT GAGTCATTCT 1740 
AACTATA AAA GTGCAATGAC TAAGATCCTT CACTTCTCTG AAAGTAAGGC CCTAG ATGCC 1 800 
TCAGGGAAGA CAGTAATCAT GCCTTTTCTT TAAAAGACAC AATAGGACTC GCAACAGCAT 1 860 
TGACTCAACA CCTAG G ACTA AAAATCACAA CTTAACTAGC ATGTTAACTG CACTTTTCAT 1920 
TACGTGAATG GAACTTACCT AACCACAGGG CTCAGACTTA CTAGATAAAA CCAGAAATGG 1980 
AAATAAGGAA TTCAGGGGAG TTCCAGAGAC TTACAAAATG AACTCATTTT ATTTTCCCAC 2040 
CTTCAAATAT AAGTATTATC ATCTATCTGT TTATCGTCTA TCTATCTATC ATCTATCTAT 2100 
CTATCTATCA TCTATCTATC TATCTATCTA TCTATCTATC TATCTATCTA TCTATCTATC 2160 
TCTATTTATT TATGTATTTA G AGATCAGGT CTCACTCTGT TGACCAGGCT GG AGTGCAGT 2220 
GGTGAGATCT GGGTTCACTG CAACCTCTGC CTCCTGGGCT CAAGCAATCC TCCCACTTCA 2280 
GCCTCCCAAA TAGCTGGGGC TACCATGGTA TTTTTCAGTA G AGACCGGGT CTTCCCATGC 2340 
TGCCCAGGCC AGTCTCAAAC TCCTGGCCTC ATGTGATCTG CCCACCTCAG CCTCCCAAAG 2400 
TACAGGGATT AGAGTTGTGA GCCACCGCTG CCAGCCCAGA GTTACCCTCT A A AG ATA AG A 2460 
AAA AGGCTAT TAATATCATA CTAAGTGAAG GACAGGAAAG GGTTTTATTC ATAAATTAAA 2520 
TGTCTACATG TGCCAGAATG GAAAGGAAAC AAGGGGAGAC AACTTTTATA GAAATACAAA 2580 
GCC ATTACTT TATTCAATTT CAGACCCTCA GAAGCAATTT ACT A A TTTAT TCTTCGACTA 2640 
CATACTGCAG CAGAACCAGC AATACACTTG ATTTTTAAAA GCACATTTAG TGAAATGTTT 2700 
TCTTTGGTTC ATCCTTCTTT AACAGGCTGC TGAGTCACTC AG AAATCCTT CAAACATGAT 2760 
TAATTATGAA GATGAAACAC TAGAGTCATA TAAGAAATAA AAATTGGGCA ATAAAATAAA 2820 
ATG ATTCAGT GTTTCTTTTC TATATTGTCA ATGAAAACCT TGAGTTCTAA TAATCCATGT 2880 
TCAGTTTGTA GGGAAAGAAA A AATAATTTT TCCTTCTACC CACTTTAGGT TCCTTGGCTG 2940 
GGGCCCCTAT AACAAAAGAC AGATTGACAA GAGAAAAACA AACATAAATT TATTAGCGGG 3000 
TATATGTAAT ATATATGTGG GAAATACAGG GGAATGAGCA AATCTCAAAG AGCTGGCGTC 3060 
TTAGAACTCC CTGGCTTATA TAGCATCGAC AAAGAACAGT AAA 1 I II TAG AGAAACAACA 3120 
AAACAAAGAA AAAGAGCTTT GAGTCTGTAG GGGCAGCAAT TTGGGGGAAG CAAATATATG 3180 
GGAGTTTGCC TTGTAGATTC CTCTGGTGGT GGTCTCCAGG CTGACAAGGA TTCAAAGTTG 3240 
TCTCTG AAAC TCCTCTTTGT CATACTGCAC ATATAAAACG TCTTTTGTTT CCAACAAGAG 3300 
GATTTCTTTT TCATTCTAGA ATTATCTCCT TGATAACTTG ATCAGATATA GGACATGACA 3360 
CTGAATAGAG TCCAACAGTA CAAAAAAAAT TCAGTATGTT CTAGCTACTT CACACATGTG 3420 
TACGCGACAG TTATTTTTAC AGTAAGGTAT TTTCGAGAAA AATGCATTAC GTGTTTTGGA 3480 
AAATAGAGTA ATTTAAAAAA TATATTTGAA ATGAAAATCT CCAACACATT AGAAGATGAT 3540 
GATGTTAGAT GCCCATCGTG TGCCACAAGT GGTTTTTTCA TTATGTAAAG CACCCGTTGA 3600 
ATTAA AAGAA TTTGTTTTTG TTCAACCTCT TCCTGAGGCC CAAGAGCATA TGGGCAATTC 3660 
GGATTTCCTG CTGGACCACA AGGTTCTGTT GATATTACAT AGAAACGGGT ATTCCAGACA 3720 
CTTCTTATGA TGAAAGTCCA AAAGTGGCAT CCAATTTAAG GCCCCATCTT TCGTTGCCAT 3780 
TCTTCATTCC TACAAAGGAC GAACTTGGAT TACATCAACT TTGGACCCAT TGGTTTTGTC 3840 
GCTGTCGTCA ACTGACAGTG ATTCACCACT GGTGATGATA AAAATGATGG AAGAAGAGTT 3900 
GAAAGTCACT TTTTTCTTTG GCCTGTCCCC ATCTTTCTGT GACATCACAA TGGGTCTGAT 3960 
CTGCATTTCA CTTCCAGCTG CTGGTAGGTC TTTAGCAGGC CTCTGGCACC TCAGCAGTCG 4020 
GAGGCACAGA AGCTGCAAAA GGGATCTTCG AAACTGGGCA GAGAAAAAAT AAAGTGGAAT 4080 
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VTTAAGTAAA AGTTGGGCAC TAATCTGG AT TAACATTCGA GGAAATCAGT TGAGCTGATT 4140 
TAAGTTGTTT TTTGTTTGTT AGCAGGTGTG GATGTGGGGT TATGTGGTCA TGCTCAG ATC 4200 
TACCTAAATC ACCCCAGAGC TTTATGTCTT TTATTCATTC TAAATCTTAT TAACCGGAAT 4260 
ATGTAGGACC ATTTCA ATAC CTTGTAATCC TCCAAGCTTC AATCTGCACA CACTTTCTAT 4320 
5 GAGGGCAGGT ACAACTATTA AGAGATTTTG AACATTAAGT TAGTCC ACAA ATATTCAGTG 4380 
GGCATCTACT AGGTGACAGC CACTGTGCTA T AATTAGAGA CTTTTTACTA TAAGCATCAA 4440 
\A\CAGATAA GGCTCTTCCT GGCAGAGTTT ACAGCCTGGT GTACTTGCTA ATGTCTCTTT 4500 
AATTAGGTG A AGAATTTTTT TTTTCTATCG AAATTACTAA TCAGTTGGGG A A AA A A AT AC 4560 
TATAGCAGAC AGCACTAATG TCATCAACAA ACATTGTTCT TCTCCGTGTC CTGGGTACA A 4620 

1 0 CATCGAATAA TATTTCTTGG CCTCCTTTCC GCTTCTCCTC TCTGCTGTTC CTCTCTACAA 4680 

GAACCTGGGA GGCCAACGCC TAAAGATCAT AATATCACAA TGGAAGGAAC CTAGATTCCT 474C 
\A ATGACTGC ATAGG ACAGA TCCCATCTCC TCCACCCAAT ACATTATTAG ACTGAACTGT 4800 
GACCTGAAAT GAGCAATAAA CTCTGTATTA ATTCACTGAA ATGTTGGGGT TGCTTGTTAT 4860 
AGTAGTCGGT CCATCATGAC CAGTAAAACA TAAATCAAAA GTTAATGTAA TTGTTATCCC 4920 

1 5 ATTATTTAGA GCGAAATAAA TGTTGAATAT ATGGACTTTC TCAGATTAGG AAATACCAAT 4980 
TA AM AT ATA ATAAATAGCT 
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Seq ID NO; 108 Protein sequence: 
Protein Accession #: NP_003670. 1 

1 II 21 31 41 i 



MDSSV1QRKK VAVIGGGLVG SLQACFLAKR NFQIDVYEAR EDTRVATFTR GRSINLALSH 60 
RGRQALKAVG LEDQIVSQGI PMRARMIHSL SGKKSA1PYG TKSQYILSVS RENLNKDLLT 120 

25 AAEKYPNVKM HFNHRLLKCN PEEGMITVLG SDKVPKDVTC DLIVGCDGAY STVRSHLMKK 180 
PRFDYSQQYI PHGYMELTIP PKNGDYAMEP NYLH1WPRNT FMMIALPNMN KSFTCTLFMP 240 
FEEFEKLLTS NDWDFFQKY FPDAIPLtGE KLLVQDFFLL PAQPMISVKC SSFHFKSHCV 300 
LLGDA \HAIV PFFGOGMNAG FEDCLVFDEL MDKFSNDLSL CLPVFSRLR1 PDDHAISDLS 360 
MYNYIEMRAH VNSSWFIFQK NMERFLHAIM PSTFIPLYTM VTFSR1RYHE AVQRWIIWQKK 420 

30 VINKGLFFLG SL1AISSTYL LIHYMSPRSF LCLRRPWNWI AHFRNTTCFP AKAVDSLEQ1 480 
SNLISR 



Seq ID NO: 109 DNA sequence 
Nucleic Acid Accession #: NM 006115.1 

3 5 Coding sequence: 236- 1 765 (underlined sequences correspond to start and stop codons) 



75 



1 11 21 31 41 51 



GCTTCAGGGT ACAGCTCCCC CGCAGCCAGA AGCCGGGCCT GCAGCCCCTC AGCACCGCTC 



60 



40 CGGGACACCC CACCCGCTTC CCAGGCGTGA CCTGTCAACA GCAACTTCGC GGTGTGGTGA 120 
ACTCTCTGAG GAAAA ACCAT TTTGATTATT ACTCTCAGAC GTGCGTGGCA ACAAGTGACT 1 80 
GAGACCTAGA AATCCAAGCG TTGGAGGTCC TGAGGCCAGC CTAAGTCGCT TCAAAATGGA 240 
ACGAAGGCGT TTGTGGGGTT CCATTCAG AG CCG ATA CATC AGCATGAGTG TGTGGACAAG 300 
CCCACGGAGA CTTGTGGAGC TGGCAGGGCA GAGCCTGCTG AAGGATGAGG CCCTGGCCAT 360 
45 TGCCGCCCTG GAGTTGCTGC CCAGGGAGCT CTTCCCGCCA CTCTTCATGG CAGCCTTTGA 420 
CGGGAGACAC AGCCAGACCC TGAAGGCAAT GGTGCAGGCC TG GCCCTTCA CCTGCCTCCC 480 
TCTGGGAGTG CTGATGAAGG GACAACATCT TCACCTGGAG ACCTTCAAAG CTGTGCTTGA 540 
TGGACTTGAT GTGCTCCTTG CCCAGGAGGT TCGCCCCAGG AGGTGGAAAC TTCAAGTGCT 600 
GGATTTACGG AAGAACTCTC ATCAGGACTT CTGGACTGTA TGGTCTGGAA ACAGGGCCAG 660 
50 TCTGTACTCA TTTCCAGAGC CAGAAGCAGC TCAGCCCATG ACAAAGAAGC GAAAAGTAGA 720 
TGGTTTGAGC ACAG AGGCAG AGCAGCCCTT CATTCCAGTA GAGGTGCTCG TAG ACCTGTT 780 
CCTCAAGGAA GGTGCCTGTG ATGAATTGTT CTCCTACCTC ATTGAGAAAG TGAAGCGAAA 840 
GAAAA ATGTA CTACGCCTGT GCTGTAAGAA GCTGAAGATT TTTGCAATGC CCATGCAGGA 900 
TATCAAGATG ATCCTGAAAA TGGTGC AGCT GGACTCTATT GAAGATTTGG AAGTGACTTG 960 
5 5 TACCTGG AAG CTACCCACCT TGGCG A A ATT TTCTCCTTAC CTGGGCCAGA TGATTAATCT 1 020 
GCGTAGACTC CTCCTCTCCC ACATCCATGC ATCTTCCTAC ATTTCCCCGG AGAAGGAAGA 1080 
GCAGTATATCGCCCAGTTCA CCTCTC AG TT CCTCAGTCTG CAGTGCCTGC AGGCTCTCTA 1140 
TGTGGACTCT TTATTTTTCC TTAGAGGCCG CCTGG ATCAG TTGCTCAGGC ACGTGATGAA 1 200 
CCCCTTGGAA ACCCTCTCAA TAACTAACTG CCGGCTTTCG GAAGGGGATG TG ATGCATCT 1260 
60 GTCCCAGAGT CCCAGCGTCA GTCAGCTAAG TGTCCTGAGT CTAAGTGGGG TCATGCTGAC 1320 
CGATGTAAGT CCCGAGCCCC TCCAAGCTCT GCTGGAGAGA GCCTCTGCCA CCCTCCAGGA 1380 
CCTGGTCTTT GATGAGTGTG GGATCACGG A TGATCAGCTC CTTGCCCTCC TGCCTTCCCT 1440 
GAGCCACTGC TCCCAGCTTA CAACCTTAAG CTTCTACGGG AATTCCATCT CCATATCTGC 1500 
CTTGCAGAGTCTCCTGCAGC ACCTCATCGG GCTGAGCAATCTGACCXACG TGCTGTATCC 1560 
65 TGTCCCCCTG GAGAGTTATG AGGACATCCA TGGTACCCTC CACCTGGAGA GGCTTGCCTA 1620 
TCTGCATGCC AGGCTCAGGG AGTTGCTGTG TGAGTTGGGG CGGCCCAGCA TGGTCTGGCT 1680 
TAGTGCCAAC CCCTGTCCTC ACTGTGGGG A CAGAACCTTC TATGACCCGG AGCCCATCCT 1740 
GTGCCCCTGT TTCATGCCTA ACTAGCTGGG TGCACATATC AAATGCTTCA TTCTGCATAC 1 800 
TTGGACACTA AAGCCAGGAT GTGCATGCAT CTTGAAGCAA CAAAGCAGCC ACAGTTTCAG 1860 
70 ACAAATGTTC AGTGTG AGTG AGGAAAACAT GTTCAGTGAG GAAAAAACAT TCAGACAAAT 1920 
GTTCAGTGAG GAAAAAAAGG GGAAGTTGGG GATAGGCAGA TGTTGACTTG AGGAGTTAAT 1980 
GTG ATCTTTG GGGAGATACA TCTTATAGAG TTAGAAATAG AATCTGAATT TCTAAAGGGA 2040 
G VTTCTGGCT TGGGAAGTAC ATGTAGGAGT TAATCCCTGT GTAGACTGTT GTAAAGAAAC 2100 
TGTTGAAA AT AAAGAGAAGC AATGTGAAGC AAAAAAAAAA AAAAAAAA 



Seq ID NO: 1 10 Protein sequence: 
Protein A ccession «: NP_0061 06. 1 
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1 II 21 31 41 51 

I I I I I I 

MERRRLWGSI QSRYISMSVW TSPRRLVELA GQSLLKDEAL AIAALELLPR ELFPPLFMAA 60 
5 FDGRHSQTLK AMVQAWPFTC LPLGVLMKGQ HLHLETFKAV LDGLDVLLAQ EVRPRRWKLQ 120 
VLDLRKNSHQ DFWTVWSGNR ASLYSFPEPE A AQPMTKfCRK VDGLSTEAEQ PFIPVEVLVD 1 80 
LFLKEGACDE LFSYLIEKVK RKJCNVLRLCC KKLKIFAMPM QDIKMILKMV QLDSIEDLEV 240 
TCTWKLPTLA KFSPYLGQMI NLRRLLLSHI HASSYISPEK EEQYIAQFTS QFLSLQCLQA 300 
LYVDSLFFLR GRLDQLLRHV MNPLETLSIT NCRLSEGDVM HLSQSPSVSQ LSVLSLSGVM 360 
1 0 LTDVSPEPLQ ALLERASATL QDLVFDECGI TDDQLL ALLP SLSHCSQLTT LSFYGNSISI 420 

SALQSLLQHL IGLSNLTHVL YPVPLESYED IHGTLHLERL AYLHARLREL LCELGRPSMV 480 
WLSANPCPHC GDRTFYDPEP 1LCPCFMPN 



1 5 Seq ID NO: 1 1 1 DNA sequence 

Nucleic Acid Accession #: NM_0038I5 

Coding sequence: 8-2452 (underlined sequences correspond to start and stop codons) 



CGCTGCCATG CGGCTGGCGC TGCTCTGGGC CCTGGGGCTC CTGGGCGCGG GCAGCCCTCT 60 
GCCTTCCTGG CCGCTCCCAA ATATAGGTGG CACTGAGGAG CAGCAGGCAG AGTCAGAGA A 1 20 
GGCCCCGAGG GAGCCCTTGG AGCCCCAGGT CCTTCAGG AC G ATCTCCCAA TTAGCCTCAA 1 80 
AAAGGTGCTT CAGACCAGTC TGCCTG AGCC CCTGAGGATC AAGTTGGAGC TGGACGGTG A 240 

2 5 CAGTCATATC CTGGAGCTGC TACAG AATAG GG AGTTGGTC CCAGGCCGCC CAACCCTGGT 300 
GTGGTACCAG CCCGATGGCA CTCGGGTGGT CAGTGAGGGA CACACTTTGG AGAACTGCTG 360 
CTACCAGGGA AGAGTGCGGG GATATGCAGG CTCCTGGGTG TCCATCTGCA CCTGCTCTGG 420 
GCTCAGAGGC TTGGTGGTCC TGACCCCAGA GAGAAGCTAT ACCCTGGAGC AGGGGCCTGG 480 
GG ACCTTCAG GGTCCTCCCA TTATTTCGCG AATCCAAGAT CTCCACCTGC CAGGCCACAC 540 

30 CTGTGCCCTG AGCTGGCGGG AATCTGTACA CACTCAGACG CCACCAGAGC ACCCCCTGGG 600 
ACAGCGCCAC ATTCGCCGGA GGCGGGATGT GGTAACAGAG ACCAAGACTG TGGAGTTGGT 660 
GATTGTGGCT GATCACTCGG ACiGCCCAGAA ATACCGGGAC TTCCAGCACC TGCTAAACCG 720 
CACACTGGAA GTGGCCCTCT TGCTGGACAC ATTCTTCCGG CCCCTGAATG TACGAGTGGC 780 
ACTAGTGGGC CTGGAGGCCr GGACCCAGCG TGACCTGGTG GAGATCAGCC CAAACCCAGC 840 

35 TGTCACCCTC GAAAACTTCC TCCACTGGCG CAGGGCACAT TTGCTGCCTC GATTGCCCCA 900 
TGACAGTGCC CAGCTGGTGA CTGGTACTTC ATTCTCTGGG CCTACGGTGG GCATGGCCAT 960 
TCAG AACTCC ATCTGTTCTC CrGACTTCTC AGGAGGTGTG AACATGG ACC ACTCCACCAG 1 020 
CATCCTGGGA GTCGCCTCCT CC AT AGCCC A TGAGTTGGGC CACAGCCTGG GCCTGGACCA 10X0 
TGATTTGCCT GGG AATAGCT GCCCCTGTCC AGGTCCAGCC CCAGCCAAGA CCTGCATCAT 1 1 40 

40 GGAGGCCTCC ACAGACTTCC TACCAGGCCT GAACTTCAGC AACTGCAGCC GACGGGCCCT 1200 
GGAGAAAGCC CTCCTGGATG GAATGGGCAG CTGCCTCTTC GAACGGCTGC CTAGCCTACC 1260 
CCCTATGGCT GCTTTCTGCG G AAATATGTT TGTGG AGCCG GGCGAGCAGT GTGACTGTGG 1320 
CTTCCTGGAT GACTGCGTCG ATCCCTGCTG TG ATTCTTTG ACCTGCCAGC TGAGGCCAGG 1 380 
TGCACAGTGT GCATCTGACG GACCCTGTTG TCAAAATTGC CAGCTGCGCC CGTCTGGCTG 1440 

45 GCAGTGTCGT CCTACCAGAG GGG ATTGTG A CTTGCCTG AA TTCTGCCCAG GAG ACAGCTC 1 500 
CCAGTGTCCC CCTGATGTCA GCCTAGGGGA TGGCG AGCCC TGCGCTGGCG GGCAAGCTGT 1560 
GTGCATGCAC GGGCGTTGTG CCTCCTATGC CCAGCAGTGC CAGTCACTTr GGGGACCTGG 1 620 
AGCCCAGCCC GCTGCGCCAC 1TTGCCTCCA G ACAGCTAAT ACTCGGGGAA ATGCTTTTGG 1 630 
GAGCTGTGGG CGCAACCCCA GTGGCAGTTA TGTGTCCTGC ACCCCTAGAG ATGCCATTTG 1740 

5 0 TGGGCAGCTC CAGTGCC AGA CAGGTAGG AC CC AGCCTCTG CTGGGCTCCA TCCGGGATCT 1 800 
ACTCTGGGAG ACAATAGATG TGAATGGGAC TG AGCTG AAC TGCAGCTGGG TGCACCTGGA 1860 
CCTGGGCAGT GATGTGGCCC AGCCCCTCCT GACTCTGCCT GGCACAGCCT GTGGCCCTGG 1 920 
CCTGGTGTGT ATAG ACCATC GATGCCAGCG TGTGG ATCTC CTGGGGGCAC AGGAATGTCG 1980 
AAGCAAATGC CATGGACATG GGGTCTGTGA CAGCAACAGG CACTGCTACT GTGAGGAGGG 2040 

5 5 CTGGGCACCC CCTG ACTGCA CCACTCAGCT CAAAGCAACC AGCTCCCTG A CCACAGGGCT 2 1 00 
GCTCCTCAGC CTCCTGGTCT TATTGGTCCT GGTGATGCTT GGTGCCGGCT ACTGGTACCG 2 1 60 
TGCCCGCCTG CACCAGCGAC I CTGCCAGCT CAAGGGACCC ACCTGCCAGT ACAGGGCAGC 2220 
CCAATCTGG T CCCTCTGAAC GGCCAGGACC TCCGCAGAGG GCCCTGCTGG CACGAGGCAC 2280 
TA AGTCTCAG GGGCCAGCCA AGCCCCCACC CCCAAGG AAG CCACTGCCTG CCGACCCCCA 2340 

60 GGGCCGGTGC CCATCGGGTG ACCTGCCCGG CCCAGGGGCT GGAATCCCGC CCCTAGTGGT 2400 
ACCCTCCAGA CCAGCGCCAC CCCCTCCGAC AGTGTCCTCG CTCTACCTC T GA CCTCTCCG 2460 
G AGGTTCCGC TGCCTCCAAG CCGGACTTAG GGCTTCAAGA GGCGGGCGTG CCCrCTGGAG 2520 
TCCCCTACCA TGACTGAAGG CGCCAGAGAC TGGCGGTGTC TTAAGACTCC GGGCACCGCC 2580 
ACGCGCTGTC AAGCAACACT CTGCGGAOCT GCCGGCGTAG TTGCAGCGGG GGCTTGGGG A 2640 

65 GGGGCTGGGG GTTGGACGGG ATTGAGGAAG GTCCGCACAG CCTGTCTCTG CTCAGTTGCA 2700 
ATAAACGTGA CATCTTGGGA GCGTTAAAAA AAAAAAAAAA 



Seq ID NO: 1 12 Protein sequence: 
„ n Protein Accession H: NP 003806 2 

70 

I II 21 31 41 51 

I I I t I i 

MRLAI IAVAIX3 I IjGAGSPLPS WPLPN1GGTE EQQAESEK.AP REPLEPQVI^ DDLP1SLKXV 60 
LQTSI.PEPLR IK1.ELDGDSH ILELLQNREL VPGRPTLVWY QPDGTRVVSE GHTLENCCYQ 120 
75 GRVRGYAGSW VSICTCSGI.R GLVVLTPERS YTLEQGPGDI. QGPPIISRIQ DLIILPGHTCA 180 
LSWRESVHTQ TPPEIIPLGQR HIRRRRDVVT ETKTVELVIV ADHSEAQKYR DFQHLLNRTL 240 
EVALLLDTFF RPLNVRVAI.V GI.EAWTQRDL VEISPNPAVT LENFLHWRRA HLLPRLPHDS 300 
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10 



25 



30 



AQLVTGTSFS GPTV'GMAIQN SICSPDFSGG VNMDHSTSIL GVASSIAHEL GHSLGLDHDL 360 
PGNSCPCPGP APAKTCIMEA STDFLPGLNF SNCSRRALEK ALLDGMGSCL FERLPSLPPM 420 

a^cgnwv Tpgeqcdcgfl ddcvdpccds ltcqlrpgaq casdgpccqn cqlrpsgwqc 80 

RPTRGDCDLP EFCPGDSSQC PPDVSLGDGE PCAGGQAVCM HGRCASYAQQ CQSLWGPGAQ 540 
P AAP1 CLOT A NTRGNAFGSC GRKPSGSYVS CTPRDA1CGQ LQCQTGRTQP LLGSIRDLLW 600 
ETIDVNGTEL NCSWVIILDLG SDVAQPLLTL PGTACGPGLV CIDHRCQRVD LLGAQECRSK 660 
CHGHGVCDSN RHCYCEEGWA PPDCTTQLKA TSSLTTGLLL SLLVLLVLVM LGAGYWYRAR 720 
LHQRLCQLKG PTCQYRAAQS GPSERPGPPQ RALLARGTKS QGPAICPPPPR KPLPADPQGR 780 
CPSGDLPGPG AGIFPLVVPS RPAPPPPTVS SLYL 

Seq ID NO: 113 DNA sequence 

Nucleic Acid Accession #: NM_002416 

Coding sequence: 40-417 (underlined sequences correspond to start and stop codons) 



15 1 11 21 31 41 

I I I I 



ATCCAATACA GGAGTGACTT GGAACTCCAT TCTATCACTATGAAGAAAAG TGGTGTTCTT 60 
TTCCTCTTGG GCATCATCTT GCTGGTTCTG ATTGGAGTGC AAGG AACCCC AGTAGTGAGA 120 
A AGGGTCGCT GTTCCTGCAT CAGCACCAAC CAAGGG ACTA TCCACCTACA ATCOTGAAA 1 80 
20 GACCTTAAAC AATTTGCCCC AAGCCCTTCC TGCGAGAAAA TTGAAATCAT TGCTACACTG 240 
aSt^G^^CATGTCTAAACCCA GATTCAGCAG ATGTGAAGG^ ^ AOTA™^ 300 
AAGTGGGAGA AACAGGTCAG CCAAAAGAAA AAGCAAAAGA 

AAG AAAGTTC TGAAAGTTCG AAAATCTCAA CGTTCTCGTC AAAAGAAGAC TACATMGAG 420 
ACCACTTCAC CAATAAGTAT TCTGTGTT AA AAATG TTCTA TTTTAATTAT ACCGCTATCA , 480 
TTCCAAAGG A GGATGGCATA TAATACAAAG GCTTATTA AT TTGACTAGAA AAmAAAAC 540 
ATTACTCTGA AATFGTAACT AAAGTTAGAA AGTTGATTTT AAGAATCCAA ACGTTAAGAA ^600 
TTGTTAAAGG CTATGATTGT CTTTGTTCTT CTACCACCCA CCAGTTGAAT TTCATCATGC 660 
SaIg^CCA? GATTTTAGCA AT ACCCATGT CTACACAGAT GTTCACCCAA CCACATCCCA 720 
CTCACAACAG CTGCCTGGAA G AGCAGCCCT AGGCTTCCAC GTACTGCAGC CTCCAGAGAG 780 
StCTGAGGC ACATGTCAGC AAGTCCTAAG CCTGTTAGCA TGCTGGTGAG O^CAim 840 
TG AAATTGAG CTGGACCTCA CCAAGCTGCT GTGGCCATCA ACCTCTGTAT TTGAATCAGC 900 
SaCaS CACACACAAT GTGTCTGAGA GATTCATGCT OATTG^ATT GGGTATCACC 960 
ACTGGAG ATC ACCAGTGTGT GGCTTTCAGA GCCTCCTTTC TGGCTTTGGA AjGCCATCTG A 10 2 0 
TTCCATCTTG CCCGCTCAGG CTGACCACTT TATTTCTTTT TGTTCCCCTT TGCTTCATTC 10BO 
3 5 aXgTCAGCtc TTCTCC ATCC TACCAC A ATG CAGTGCCnT CrrCTCTCC A GTGCACCTGT 1 140 
C ATATGCTCT GATTTATCTG AGTCAACTCC TTTCTCATCT TGTCCCCAAC ACCCCACAGA 1200 
AGTGCTTTCT TCTCCCAATT CATCCTCACT CAGTCCAGCT TAGTTCAAGT CCTGCCTCTT 1260 
A A AT AA ACC T TTTTGG AC AC ACAAATTATC TTAAAACTCC TGTTTCACTT GGTTCAGTAC 1320 
CAcItG^GTC aTcACTCAAT GGTTAACTAA TTCTTGGGTG ™TCCTAT aCTCCJACC 1380 
40 AGATTGTCAG CTCCTTGAGG GCAAGAGCCA CAGTATATTT CCCTGTTTCT TCCACAGTGC 440 
CTA ATAATAC TGTGGAACTA GGTTTT AATA ATTTTTTAAT TGATGTTGTT ATGGGCAGG A 1 500 

Sgca^cagacca^^^^ 1560 

CTaSS™ GT^5SnT ACTTATTATC TTCAGGACAC TCACTACAGG ° AOCAGGGAT 1 620 
a A TCP AACAT CCTTGTCTTT TTATGACAGG ATGTTTGCTC AGCTTCTCCA ACAATAAGAA 1 680 
45 GcIcGTCGtI ™AC?TOCGGATArTCTG GACTGTTTTT AAAAAA TATA CAGTTTAOCG 1740 
A A AATCAT AT A ATCTTACA A TGAAAAGG AC TTTATAGATC AGCCAGTGAC CAACCTTTTC 800 
CCAACCATAC AAAAATTCCT TTTCCCG AAG GAAAAGGGCT TTCTCAATAA GCCTCAGCTT 860 
^CTAaSaTCT AACAAGATAG CCACCGAGAT CCTTATCG AA ACTCATTTTA GGCAAATATG 920 
AG^TTATrG TCCGTTTACT TGTTTCAGAG TTTGTATTGT GATTATCAATTACCACACCA 1980 
50 TCTOCCATGA AGAAAGGGAA CGGTGAAGTA CTAAGCGCTA CAGGAAGCAO iCCA^TOGT 2040 
TAGTGGAAGC ATGATTGGTG CCCAGTTAGC CTCTGCAGGA TGTGGAAACC TCCTTCCAGG 2 00 
GGAGGTTCAG TGAATTGTGT AGGAGAGGTT GTCTGTGGCC AGAATTTAAA CCTATACTCA 2160 
^^TmAATC^CVCK^C^CrQ CTGATGATTT AGAG^T CXGG^A 2^0 
TCCCACCCGA ACGTCTTATC TAATCATG AA ACTCCCTAGT TCCTTCATGT AACTTCCCTG i 2280 
55 IaaAATCTAA GTGTTTCATA AATTTGAGAG TCTGTGACCC ACTTACCTTG CATCTCACAG 2340 
GTAGACAGTA TATAACTAAC AACCAAAGAC TACATATTGT CACTGACACA WCGTTATAA 2400 
TCATrTATCA TATATATACA TACATGCATA CACTCTCAAA GCAAATAATT TTTCACTTCA 2460 
AAACAGTATT GACTTGTATA CCTTGTAATT TGAAATATTT TCnTGTTAA AATAGAATGG 2520 
TATCAATAAA TAGACCATTA ATCAG 



60 



Seq ID NO: 1 14 Protein sequence: 
Protein Accession #: NP_002407 



1 11 21 31 41 51 

65 MKKSGVIFII GIIIT.V1JGV QGTPVVRKGR CSCISTNQGT IHLQSLKDLK QFAPSPSCEK 60 

IEIIATIKNG VQTCLNPDSA DVKELDCKWE KQVSQK-KKQK NuKJsHQKKKV LKVRKSQRSR .-0 
QKKTT 

70 Seq ID NO: 115 DNA sequence 

Nucleic Acid Accession #: NM„003238.1 

Coding sequence: 182- 1426 (underlined sequences correspond lo start and stop codons) 
1 11 21 31 41 51 

75 CAAGCAGGAT ACGTTTTTCT GTTGGGC ATT G ACTA GATTG TTTGCAAAAG TTTCGCATCA 60 

AAAACAAACA ACAA^AACAA AAAACCAAAC AACTCTCCTT G ATCTATACT TTGAGAATTG 120 
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'ITGATTTCTT ITrnTT ATT CTGACTTTTA AAAACAACTT TITTTTCCAC TTriTTAAAA 1 80 
AATGCACTAC TGTGTGCrGA GCGCriTTCT GATCCTGCAT CTGGTCAOGG TCGCGCTCAG 240 
CCTGTCTACC TGCAGCACAC TCGATATGGA CCAGITCATG CGCAAGAGGA TCGAGGCGAT 300 
CCGCGGGCAG ATCCTGAGCA AGCTGAAGCT CACCAGTCCC CCAGAAGACT ATCCTGAGCC 360 
5 CGAGGAAGTC CCCCCGGAGG TGATTTCCAT CTACAACAGC ACCAGGGACT TGCTCCAGGA 420 

GAAGGCGAGC CGGAGGGCGG CCGCCrGCGA GCGCGAGAGG AGCGACGAAG AGTACTACGC 480 
CAAGGAGGTT TAC A AAATAG AC ATGCCGCC CTTCTTOCCC TCCGAAAATG CCATCCCGCC 540 
CACTTTCTAC AGACCCTACT TCAGAATTGT TCGATTTGAC GTCTCAGCAA TGGAGAAGAA 600 
TGCTTCCAAT TTGGTGAAAG CAGAGTTCAG AGTCTTTCGT TTGCAGAACC CAAAAGCCAG 660 

1 0 AGTGCCrGAA C AACG GATING AGCTATATCA GATTCTCAAG TCCAAAGATT TAACATCTCC 720 
AACCCAGCGC TACATCGACA GCAAAGTTGT GAAAACAAGA GCAGAAGGCG AATGGCTCTC 780 
CTTCGATGTA ACTGATGCTG TTCATGAATG GCTTCACCAT AAAGACAGGA ACCTGGGATT 840 
TAAAATAAGC TTACACTGTC CCTGCTGCAC TTTTGTACCA TCTAATAATT ACATCATCCC 900 
AAATAAAAGT GAAG AACTAG AAGCAAGATT TGCAGGTATT GATGGCACCT CCACATATAC 960 

1 5 CAGTGGTGAT CAGAAAACTA TAAAGTCCAC TAGGAAAAAA AACAGTGGGA AGACCCCACA 1020 
TCTCCTGCTA ATGITATTGC CCTCCTACAG ACTTGAGTCA CAACAGACCA ACCGGCGGAA 1080 
GAAGCGTGCT TTGGATGCGG CCTATTGCTT TAGAAATGTG CAGGATAATT GCTGCCTACG 1 140 
TCCACTTTAC A tTGATTTCA AGAGGGATCT AGGGTGGAAA TGGATACACG AACCCAAAGG 1200 
GTACAATGCC AACTTCTGTG CTGGAGCATG CCCGTATTTA TGG AGTTCAG ACACTCAGCA 1260 

20 C AGCAGGGTC CTG AGCTTAT ATAATACCAT AAA TCCAGA A GCATCTGCTT CTCCTTGCTG 1 320 
CGTGTCCCAA GATTTAGAAC CTCTAACCA T TCTCTACTAC ATTGGCAAAA CACCCAAGAT 1380 
IGAACAGCTT 'ICTAATATGA TTGTAAAGTC TTGCAAATGC AGCTAAAATT CTTGGAAAAG 1440 
1GGCAAGACC AAAATGACAA TGATGATGAT AATGATGATG ACGACGACAA CGATGATGCT 1500 
1GTAACAAGA AAACATAAGA GAGCCTTGGT TCATCAGTGT TAAAAAATTT TTGAAAAGGC 1560 

25 GGTACTAGTT C AG AC ACT IT GGAAGTTTGT GTI CTGTTTG TTAAAACFGG CATCTGACAC 1620 
AAAAAAAGTT GAAGGCC1TA TrCTACATrT CACCTACTTT GTAAGTGAGA GAGACAAGAA 1680 
GCAAATTTTT TTAAA 

Seq ID NO 1 1 6 Protein sequence: 
30 Protein Accession U: NP_003229.1 

1 11 21 31 41 51 

I I ! I I I 

MHYCVLSAFL ILHLVTVALS LSTCSTLDMD QFMRKRIEAI RGQILSKIXL TSPPEDYPEP 60 
35 EEVPPEV1SI YNSTRDLLQE KASRRAAACE RERSDEEYYA KEVYKIDMPP FFPSENAIPP 120 
I FYRFYFR1 V RFDVSAM EKN ASNLVKAEFR VFRLQNPKAR VPEQRJEl YQ ILKSKDLTSP 180 
IQRYIDSKVV KTRAEGEWLS FDVTDAVHEW LliHKDRNLGF KISLHCPCCT FVPSNNYIIP 240 
NKSEEL EARF A(3IDGTS TYT SGDQKTIKST RKKNSGKTPH LLLMLLPSYR LESQQTNRRK 300 
K RALDAA YCF RNVQDTJCCl^ PLYIDFKRDL G WKWIHEPKG YNANFCAG AC PYLWSSDTQH 360 
40 SRVLSl.YNTI NPEASASPCC VSQDLEPLTI LYYIGKTPKI EQLSNMIVKS CKCS 

Seq ID NO 1 17 DNA sequence 

Nucleic Acid Accession #: NM 000095.1 

Coding sequence: 26-2299 (underlined sequences correspond to start and stop codons) 

45 

1 11 21 31 41 51 

I I 1 I I 1 

C AGCACCCAG CTCCCC GCCA CCGCCATGGT CCCCG ACACC GCCTGCGTTC TTCTGCTCAC 60 
CCTGGCTGCC CTCGGCGCGT CCGGACAGGG CCAuAGCCCG TTGGGCTCAG ACCTGGGCCC 120 

50 GCAGATGCTT CGGGAACTGC AGGAAACCAA CGCGGCGCTG CAGGACGTGC GGGACTGGCT 180 
GCGGCAGCAG GTCAGGGAGA TCACGTTCCT G AAAAACACG GTGATGGAGT GTGACGCGTG 240 
CGGGATGCAG CAGTCAGTAC GCACCGGCCT ACCCAGCGTG CGGCCCCTGC TCCACTGCGC 300 
GCCCGGCTTC TGCTTCCCCG GCGTGGCCTG CATCCAGACG GAGAGCGGCG GCCGCTGCGG 360 
CCCCTGCCCC GCGGGCTTCA CGGGCAACGG CTCGCACTGC ACCGACGTCA ACGAGTGCAA 420 

55 CGCCCACCCC TGCTTCCCCG GAGTCCGCTG TATCAACACC AGCCCGGGGT TCCGCIGCGA 480 
GGCTTGCCCG CCGGGGTACA GCGGCCCCAC CCACCAGGGC GTGGGGCTGG CITTCGCCAA 540 
GGCCAACAAG CAGGT1TGCA CGGACATCAA CGAG TGTGAG ACCGGGCAAC ATAACTGCGT 600 
C CCCAACTCC GTGTGCATCA ACACCCGGGG CTCCITCCAG TGCGGCCCGT GCCAGCCCGG 660 
CITCGTGGGC GACCAGGCG T CCGGCTGCC A GCGCGGCGCA CAGCGCTTCT GCCCCGACGG 720 

60 CTCGCCCAGC GAGTGC CACG AGCATGCAGA CTGCGTCCTA GAGCGCGATG GCTCGCGGTC 780 
GTGCGTGTGT CGCGTT(j(3CT GG<3CCGGCAA CGGGATCCTC TGTGGTCGCG AC^CTGACCT 840 
AGACGGCTTC CCGGAC GAGA AGCTGCGCTG CCCGGAGCCG CAGTGCCGTA AGGACAACTG 900 
CGTGACI GTG CCCAACrCAG GGCAGGAGGA TGTGGACCGC GATGGCATCG GAGACGCCTG 960 
C GATCCGGAT GCCG ACGGGG ACGGGGTCCC CAATGAAAAG GACAACTGCC CGCTGGTGCG 1 020 

65 GAACCCAGAC CAGCGfAACA CGGACGAGGA CAAGTGGGGC GATGCGTGCG ACAACTGCCG 10SO 
G TCCCAGAAG AACGACGACC AAAAGGACAC AGACCAGGAC GGCC3GGGCG ATGCGTGCGA 1 140 
C GACGACATC GACGGCGACC GG ATCCGCAA CCAGGCCGAC AACI GCCCTA GGGTACCCAA 1200 
CTCAGACCAG AAGGAC AGTG ATGGCGATGG TATAGGGGAT GCCI'G TGACA ACTGTCCCCA 1260 
GAAGAGCAAC CCGGA TCAGG CGGATGTGG A (XACGACTTT GTGGGAGATG CTTGTGACAG 1320 

70 C G ATC AAG AC ( AGGATGGAG ACXjG ACATCA GG ACTCTCGG G ACAACTGTC CCACGGTGCC 1380 
7AACAGTGCC CAGGAGGACT CAGACCACGA TGGCCAGGGT GATGCCTGCG ACGACGACGA 1440 
C GACAATGAC GGAGTCCCFG ACAGTCGGGA CAACTGCCGC CTGGTGCCTA ACCCCGGCCA 1500 
GGAGGACGCG GACAGGGAtX GCGTGGGCGA CGTGTGCCAG GACGACTTTG ATGCAGACAA 1560 
GGTGGTAGAC AAGATCGACG IGTGTCCGGA GAACGCTGAA GTCACGCTCA CCGACTTCAG 1620 

75 GGCCTTCCAG ACAGTCGTGC TGG ACCCGG A GGGTG ACGCG CAGATrGACC CCAACTGGGT 1 680 
GGTGCTCAAC CAGGGAAGGG AGATCGTGCA GACAATGAAC AGCGACCCAG GCCTGGCTGT 1740 
GGGTTACACT GCCTTCAATG GCGTGGAC1T CGAGGGCACG TTCCATGTGA ACACGGTCAC 1 800 
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GGATGACGAC TATGCGGGCT TCATCTTTGG CTACCAGGAC AGCrCCAGCT TCTACGTGGT 1860 
CATGTGGAAG CAGATGGAGC AAACGTATTG GCAGGCGAAC CCCTTCCGTG CTGTGGCCGA 1920 
GCCTGOCATC CAACTCAAGG CTGTGAAGTC TTCCACAGGC CCCGGGG AAC AGCTGCGGAA 1980 
CGCTCTGTGG CATACAGGAG ACACAGAGTC CCAGGTGCGG CTGCTGTGGA AGGACCCGCG 2040 
5 AAACGTGOGT TGGAAGGACA AGAAGTCCTA TCGTTGGTTC CTGCAGCACC GGCCCCAAGT 2 100 
GGGCTACATC AGGGTGCGAT TCTATGAGGG CCCTGAGCTG GTGGCCGACA GCAACGTGGT 2160 
CTTGGACACA ACCATGCGGG GTGOCCGCCT GGGGGTCTTC TGCTTCTCCC AGGAGAACAT 2220 
CATCTGGGCC AACCTGCGTT ACCGCTGCAA TGACACCATC CCAG AGG ACT ATGAGACCCA 2280 
TCAGCTGCGG CAAGCCTAGG GACCAGGGTG AGGACCCGCC GGATG ACAGC CACCCTCACC 2340 
10 GCGGCTGGAT GGGGGCTCTG CACCCAGCCC AAGGGGTGGC CGTCCTGAGG GGGAAGTGAG 240O 
AAGGGCTCAG AGAGGACAAA ATAAAGTGTG TGTGCAGGG 

Seq ID NO: 1 1 8 Protein sequence: 
Protein Accession #: NPJXW086.1 



1 11 21 31 41 51 

MVPDTACVLL LTLAALGASG QGQSPLGSDL GPQMLRELQE TNAALQDVRD WLRQQVREIT 60 
FLKNTVMECD ACGMQQSVRT GLPSVRPLLH CAPGFCFPGV ACIQTESGGR CGPCPAGFTG 120 

20 NGSHCTDVNE CNAHPCFPRV RCINTSPGFR CEACPPGYSG PTHQGVGLAP AKANKQVCTD 180 
INECETGQHN CVPNSVCINT RGSFQCGPCQ PGFVGDQASG CQRGAQRFCP DGSPSECHEH 240 
ADCVLERDGS RSCVCRVGW A GNGILCGRDT DLDGFPDEKL RCPEPQCRKD NCVTVPNSGQ 300 
EDVDRDGIGD ACDPDADGDG VPNEKDNCPL VRNPDQRNTD EDKWGDACDN CRSQKKDDQK 360 
DTDQDGRGDA CDDDIDGDRI RNQADNCPRV PNSDQKDSDG DGIGDACDNC PQKSNPDQAD 420 

25 VDHDFVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD HDGQGDACDD DDDNDGVPDS 4S0 
RDNCRLVPNP GQEDADRDGV GDVCQDDFDA DKVVDKJDVC PENAEVTLTD FRAFQTVVLD 540 
PEGDAQIDPN WVVLNQGREI VQTMNSDPGL AVGYTAFNGV DFEGTFHVNT VTDDDYAGFI 600 
FGYQDSSSFY VVMWKQMEQT YWQANPFRAV AEPGIQLKAV KSSTGPGEQL RNALWHTGDT 660 
ESQVRLLWKD PRNVGWKDKK SYRWFLQHRP QVG Y1RVRFY EGPELVADSN VVLDTTMRGG 720 

30 RLGVFCFSQE N1IWANLRYR CNDT1PEDY E THQLRQA 

Seq ID NO: 1 19 DNA sequence 

Nucleic Acid Accession #: NMJH421 1 

Coding sequence: 157-1479 (underlined sequences correspond to start and stop codons) 
1 1! 21 31 41 51 

GGGACAGGGC TGAGGATGAG GAGAACCCTG GGG ACCCAGA AG ACCGTGCC TTGCCCGGAA 60 
GTCCTGCCTG TAGGCCTGAA GGACTTGCCC TAACAGAGCC TCAACAACTA CCTGGTGATT 120 
40 CCTACTTCAG CCCCTTGGTG TGAGCAGCTT CTCAACAJG A ACTACAGCCT CCACTTGGCC 1 80 
TTCGTGTGTC TGAGTCTCTT CACTGAGAGG ATGTGCATCC AGGGG AGTCA GTTCAACGTC 240 
G AGGTCGGCA GAAGTGACAA GCTTTCCCTG CCTGGCTTTG AGAACCTCAC AGCAGGATAT 300 
AACAAATTTC TCAGGCCC AA TTTTGGTGGA GAACCCGTAC AGATAGCGCT GACTCTGGAC 360 
ATTGCAAGTA TCTCTAGCAT TTCAGAGAGT AACATGGACT ACACAGCCAC CATATACCTC 420 

45 CGACAGCGCT GGATGGACCA GCGGCTGGTG TTTGAAGGCA ACAAGAGCTT CACTCrGGAT 480 
GCCCGCCTCG TGGAGTTCCT CTGGGTGCCA GATACTTACA TTGTGGAGTC CAAG AAGTCC 540 
TTCCTCCATG AAGTCACTGT GGG AAACAGG CTCATCCGCC TCTTCTCCAA TGGCACGGTC 600 
CTGTATGCCC TCAGA ATCAC GACAACTGTT GCATGTAAC A TGGATCTGTC TAAATACCCC 660 
ATGGACACAC AGACATGCAA GTTGCAGCTG GAAAGCTGGG GCTATGATGG AAATGATGTG 720 

50 GAGTTCACCT GGCTGAGAGG GAACGACTCT GTGCGTGGAC TGGAACACCT GCGGCTTGCT 780 
CAGTACACCA TAGAGCGGTA TTTCACCTFA GTCACCAGAT CGCAGCAGGA GACAGGAAAT 840 
TACACTAG AT TGGTCTTACA GTTTGAGCTT CGGAGGAATG TTCTGTATTT CATTTTGG AA 900 
ACCTACGTTC CTTCCACTTT CCTGGTGGTG TTGTCCTGGG TTTCATTTTG GATCTCTCTC 960 
G ATTCAGTCC CTGCAAGAAC CTGCATTGGA GTGACG ACCG TGTTATCAAT GACCACACTG 1020 

55 ATGATCGGGT CCCGCACTTC TCTTCCCAAC ACCAACTGCT TCATCAAGGC CATCGA TGTG 1080 
TACCTGGGGA TCTGCTTTAG CTTTGTGTTT GGGGCCTTGC TAGAATATGC AGTTGCTCAC 1 140 
TACAGTTCCT TACAGCAGAT GGCAGCCAAA GATAGGGGGA C AAC A A AGG A AGTAGAAGAA 1200 
GTCAGTATTA CTAATATCAT CAACAGCTCC ATCTCCAGCT TTAAACGGAA GATCAGCTTT 1 260 
GCCAGCATTG AAATTTCCAG CG ACAACGTT G ACTACAGTG ACTTG ACAAT GAAAACCAGC 1320 

60 G ACAAGTTCA AGTTTGTCTT CCGAGAAAAG ATGGGCAGGA TTGTTGATTA TTTCACAATT 1 380 
C AAAACCCCA GTAATGTTGA TCACTATTCC AAACTACTGT TTCCTTTGAT TTTTATGCTA 1440 
GCCAATGTAT TTTACTGGGC ATACTACATG TATTTTTG_AG TCAATGTTAA ATTTCTTGCA 1500 
TGCCATAGGT CTTCAACAGG ACAAGATAAT GATGTAAATG GTATTTTAGG CCAAGTGTGC 1560 
ACCCACATCC AATGGTGCTA CAAGTGACTG AAATAATATT TGAGTCTTTC TGCTCA AAG A 1 620 

65 ATG AAGCTCC AACCATTGTT CTAAGCTGTG TAG AAGTCCT AGCATTATAG GATCTTGTAA 1 680 
TAGAAACATC AGTCCATTCC TCTTTCATCT TAATC AAGG A CATTCCCATG GAGCCCAAGA 1740 
TTACAAATGT ACTCAGGGCT GTTTATTCGG TGGCTCCCTG GTTTGCATTT AOCTCATATA 1800 
AAGAATGGGA AGGAGACCAT TGGGTAACCC TCAAGTGTCA G AAGTTGTTT CTAAAGTAAC I860 
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AAAGTACTGG GTTGACTCAG AGAGTCGCTG TCATTCTOTC ATTGCTGCTA CTCTAACACT 2460 
GAGCAACACT CTCCCAGTGG CAGATCCCCT GTATCATTCC AAGAGGAGCA TTCATCCCTT 2520 
TGCTCTAATG ATCAGGAATG ATGC1TATTA GAAAACAAAC TGCTTGACCC AGGAACAAGT 2580 
GGCTTAGCTT AAGTAAACTT GGCTTTGCTC AGATCCCTG A TCCTTCCAGC TGGTCTGCTC 2640 
5 TGAGTGGCTT ATCCCGCATG AGCAGGAGCG TGCTGGCCCT GAGTACTGAA CTTTCTGAGT 2700 
AACAATGAGA CACGTTACAG AACCTATGTT CAGGTTGCGG GTGAGCTGCC CTCTCCAAAT 2760 
CCAGCCAGAG ATGCACATTC CTCGGCCAGT CTCAGCCAAC AGTACCAAAA GTGATTTTTG 2820 
AGTGTGCCAG GGTAAAGGCT TCCAGTTCAG CCTCAGTTAT TTTAGACAAT CTCGCCATCT 2880 
TTAATTTCTT AGCTTCCTGT TCTAATAAAT GCACGGCTTT ACCTTTCCTG TCAGAAATAA 2940 

1 0 ACCAAGGCTC T AAA AG ATGA TTTCCCTTCT GTAACTCCCT AGAGCCACAG GTTCTCATTC 3000 
CTTTTCCCAT TATACTTCTC ACAATTCAGT TTCTATGAGT TTGATCACCT GATTTTTTTA 3060 
ACAAAATATT TCTAACGGG A ATGGGTGGGA GTGCTGGTG A AAAGAGATGA AATGTGGTTG 3120 
TATGAGCCAA TCATATTTGT GATTTTTTAA AAAAAGTTTA AAAGGAAATA TCTGTTCTGA 31 80 
AACCCCACTT AAGCATTGTT TTTATATAAA AACAATGATA AAGATGTGAA CTGTGAAATA 3240 

1 5 AATATACCAT ATTAGCTACC CACC 

Scq ID NO: 120 Protein sequence: 
Protein Accession #: NP_05 502 6. 1 

20 1 II 21 31 41 51 
I I I I I I 

MNYSLHLAFV CLSLFTERMC IQGSQFNVEV GRSDKLSLPG FENLTAGYNK FLRPNFGGEP 60 
VQIALTLDIA SISSISESNM DYTATIYLRQ RWMDQRLVFE GNKSFTLDAR LVEFLWVPDT 120 
YIVESKKSFL HEVTVGNRU RLFSNGTVLY ALRITTTVAC NMDLSKYPMD TQTCKLQLES 1 80 
25 WGYDGNDVEF TWLRGNDSVR GLEHLRLAQY T1ERYFTLVT RSQQETGNYT RLVLQFELRR 240 
NVLYFILETY VPSTFLVVLS WVSFWISLDS VPARTC1GVT TVLSMTTLMI GSRTSLPNTN 300 
CF1KAIDVYL GICFSFVFGA LLEYAVAHYS SLQQMAAKDR GTTK.EVEEVS ITMIINSSIS 360 
SFKRK1SFAS IE1SSDNVDY SDLTMKTSDK FKFVFREKMG RIVDYFTIQN PSNVDHYSKL 420 
LFPL1FMLAN VFYWAYYMYF 

30 

Scq ID NO: 121 DNA sequence 

Nucleic Acid Accession #: NM_001 854 

Coding sequence: 163-5582 (underlined sequences correspond to start and stop codons) 

35 1 11 21 31 41 51 

I I I I 1 I 

AACCATCAAA TTTAGAAGAA AAAGCCCTTT GACTTTTTCC CCCTCTCCCT CCCCAATGGC 60 
TGTGTAGCAA ACATCCCTGG CGATACCTTG GAAAGGACGA AGTTGGTCTG CAGTCGCAAT 120 
TTCG rGGGTT GAGTTCACAG TTGTGAGTGC GGGGCTCGG A GATGG AGCCG TGGTCCTCTA 1 80 

40 GGTGGAAAAC G AAACGGTGG CTCTGGGATT TCACCGTAAC AACCCTCGCA TTGACCTTCC 240 
TCTTCCAAGC TAGAGAGGTC AGAGGAGCTG CTCCAGTTGA TGTACTAAAA GCACTAGATT 300 
TTCACAATTC TCCAGAGGGA ATATCAAAAA CAACGGGATT TTGCACAAAC AGAAAGAATT 360 
CTAAAGGCTC AGATACTGCT TACAGAGTTT CAAAGCAAGC ACAACTCAGT GCCCCAACAA 420 
AACAGTTATT TCCAGGTGGA ACTTTCCCAG AAGACTTTTC AATACTATTT ACAGTAAAAC 480 

45 CAAAAAAAGG AATTCAGTCT TTCCTTTTAT CTATATATAA TGAGCATGGT ATTCAGCAAA 540 
TTGGTGTTGA GGTTGGGAGA TCACCTGTTT TTCTGTTTG A AGACCACACT GGAAAACCTG 600 
C CCC AG AAG A CTATCCCCTC TTCAGAACTG TTAACATCGC TGACGGG AAG TGGCATCGGG 660 
TAGCAATCAG CGTGGAGAAG AAAACTGTGA CAATGATTGT TGATTGTAAG AAGAAAACCA 720 
CGAAACCACT TGATAGAAGT GAGAGAGCAA TTGTTGATAC CAATGGAATC ACGGTTTTTG 780 

50 GAACAAGGAT TTTGGATGAA GAAGTTTrTG AGGGGGACAT TCAGCAGTTT TTGATCACAG 840 
GTGATCCCAA GGCAGCATAT GACTACTGTG AGCATTATAG TCCAGACTGT GACTCTTCAG 900 
CACCCAAGGC TGCrCAAGCT CAGGAACCTC AGATAGATGA GTATGCACCA GAGGATATAA 960 
TCGAA'I ATGA CTATGAGTAT GGGGAAGCAG AGTATAAAGA GGCTGAAAGT GTAACAGAGG 1020 
GACCCACTGT AACTGAGGAG ACAATAGCAC AGACGG AGGC AAACATCGTT G ATGATTTTC 1 080 

55 AAG A AT AC AA CTATGGAACA ATGGAAAGTT ACCAGACAGA AGCTCCTAGG CATGTTTCTG 1 140 
GG AC A A ATGA GCCAAATCCA GTTGAAGAAA TATTTACTGA AG AATATCTA ACGGGAGAGG 1200 
A H ATGATTC CCAGAGGAAA AATrCTGAGG ATACACTATA TGAAAACAAA GAAATAGACG 1260 
GCAGGGATTC TG ATCTTCTG GTAGATGGAG ATTTAGGCGA ATATGATTTT TATGAATATA 1320 
AAGAATATGA AGATAAACCA ACAAGCCCCC CTAATG AAGA ATTTGG TCCA GGTGTACCAG 1380 

60 CAGAAACTGA TATTACAGAA ACAAGCATAA ATGGCCATGG TGCATATGGA GAGAAAGGAC 1440 
AGAAAGGAGA ACCAGCAGTG GTTGAGCCTG GTATGCTTGT CGAAGGACCA CCAGGACCAG 1500 
CAGGACCTGC AGGTATTATG GGTCCTCCAG GTCTACAAGG CCCCACTGGA CCCCCTGGTG 1560 
ACCCTGGCGA TAGGGGCCCC CCAGGACGTC CTGGCTTACC AGGGGCTGAT GGTCTACCTG 1620 
GTCC I CCTGG TACTATGTTG ATGTTACCGT TCCGTTATGG TGGTGATGGT TCCAAAGGAC 1 680 

65 CAACCATCTC TGCTCAGGAA GCTCAGGCTC AAGCTATTCT TCAGCAGGCT CGGATTGCTC 1740 
TGAGAGGCCC ACCTGGCCCA ATGGGTCTAA CTGGAAGACC AGGTCCTGTG GGGGGGCCTG 1800 
GTTCATCTGG GGCCAAAGGT GAGAGTGGTG ATCCAGGTCC TCAGGGCCCT CGAGGCGTCC I860 
AGGGTCCCCC TGG TCCAACG GGAAAACCTG GAAAAAGGGG TCGTCCAGGT GCAGATGGAG 1920 
GAAGAGGAAT GCCAGGAGAA CCTGGGGCAA AGGGAGATCG AGGGTTTGAT GGACTTCCGG 1980 

70 OTCTGCCAGG TGACAAAGGT CACAGGGGTG AACGAGGTCC TCAAGGTCCT CCAGGTCCTC 2040 
CTGGTGATGA TGGAATGAGG GGAGAAGATG GAGAAATTGG ACCAAGAGGT CTTCCAGGTG 2100 
AAGCTGGCCC ACGAGGTTTG CTGGGTCCAA GGGGAACTCC AGGAGCTCCA GGGCAGCCTG 2160 
GTATGGCAGG TGTAGATGGC CCCCCAGGAC CAAAAGGGAA CATGGGTCCC CAAGGGGAGC 2220 
CTGGGCCTCC AGGTCAACAA GGGAATCCAG GACCTCAGGG TCTTCCTGGT CCACAAGGTC 2280 

75 CAATTGGTCC TCCTGGTGAA AAAGGACCAC AAGGAAAACC AGGACTTGCT GG ACTTCCTG 2340 
GTGCTGATGG GCCTCCTGGT CATCCTGGGA AAGAAGGCCA GTCTGGAGAA AAGGGGGCTC 2400 
TGGGTCCCCC TGGTCCACAA GGTCCTA ITG GATNNCCGGG CCCCCGGGGA GTAAAGGGAG 2460 
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CAGATGGTGT CAG AGGTCTC AAGGGATCTA AAGGTGAAAA GGGTGAAG AT GGTTTTCCAG 2520 
GATTCAAAGG TGACATGGGT CTAAAAGGTG ACAGAGGAGA AGTTGGTCAA ATrGGCCCAA 2580 
GAGGGNAAGA TGGCCCTGAA GGACCCAAAG GTCGAGCAGG CCCAACTGGA GACCCAGGTC 2640 
CTTCAGGTCA AGCAGGAGAA AAGGGAAAAC TTGGAGTTCC AGGATTACCA GGATATCCAG 2700 
5 GAAGACAAGG TCCA AAGGGT TCCACTGGAT TCCCTGGGTT TCCAGGTGCC AATGGAGAGA 2760 
AAGGTGCACG GGGAGTAGCT GGCAAACCAG GCCCTCGGGG TCAGCGTGGT CCAACGGGTC 2820 
CTCGAGGTTC AAGAGGTGCA AGAGGTCCCA CTGGGAAACC TGGGCCAAAG GGCACTTCAG 2880 
GTGGCGATGG CCCTCCTGGC CCTCCAGGTG AAAGAGGTCC TCAAGGACCT CAGGGTCCAG 2940 
TTGGATTCCC TGGACCAAAA GGCCCTCCTG GACCACCAGG AAGGATGGGC TGCCCAGGAC 3000 
10 ACCCTGGGCA ACGTGGGGAG ACTGGATTTC AAGGCAAGAC CGGCCCrCCT GGGCCAGGGG 3060 
GAGTGGTTGG ACCACAGGGA CCAACCGGTG AGACTGGTCC AATAGGGGAA CGTGGGTATC 3120 
CrGGTCCTCC TGGCCCTCCT GGTGAGCAAG GTCTTCCTGG TGCTGCAGGA AAAGAAGGTG 3180 
CAAAGGGTGA TCCAGGTCCT CAAGGTATCT CAGGGAAAGA TGGACCAGCA GGATTACGTG 3240 
GTTTCCCAGG GGAAAGAGGT CTTCCTGGAG CTCAGGGTGC ACCTGGACTG AAAGGAGGGG 3300 
1 5 AAGGTCCCCA GGGCCCACCA GGTCCAGTTG GCTCACCAGG AGAACGTGGG TCAGCAGGTA 3360 
CAGCTGGCCC AATTGGTTTA CGAGGGCGCC CGGGACCTCA GGGTCCTCCT GGTCCAGCTG 3420 
G AGAGAAAGG TGCTCCTGGA GAAAAAGGTC CCCAAGGGCC TGCAGGGAGA GATGGAGTTC 3480 
AAGGTCCTGT TGGTCTCCCA GGGCCAGCTG GTCCTGCCGG CTCCCCTGGG GAAGACGG AG 3540 
ACAAGGGTGA AATTGGTGAG CCGGGACAAA AAGGCAGCAA GGGTGGCAAG GGAGAAAATG 3600 
20 GCCCTCCCGG TCCCCCAGGT CTTCAAGGAC CAGTTGGTGC CCCTGGAATT GCTGG AGGTG 3660 
ATGGTGAACC AGGTCCTAGA GGACAGCAGG GGATGTTTGG GCAAAAAGGT G ATGAGGGTG 3720 
CCAGAGGCTT CCCTGGACCT CCTGGTCCAA TAGGTCTTCA GGGTCTGCCA GGCCCACCTG 3780 
GTGAAAAAGG TGAAAATGGG GATGTTGGTC CATGGGGGCC ACCTGGTCCT CCAGGCCCAA 3S40 
GAGGCCCTCA AGGTCCCAAT GGAGCTGATG GACCACAAGG ACCCCCAGGT TCTGTTGGrr 3900 
25 CAGTTGGTGG TGTTGGAG AA AAGGGTGAAC CTGGAG AAGC AGGAAACCCA GGGCCTCCTG 3960 
GGGAAGCAGG TGTAGGCGGT CCCAAAGGAG AAAGAGGAGA GAAAGGGG AA GCTGGTCCAC 4020 
CTGGAGCTGC TGGACCTCCA GGTGCCAAGG GGCCGCCAGG TGATG ATGGC CCTAAGGGTA 4080 
ACCCGGGTCC TGTTGGTTTT CCTGGAGATC CTGGTCCTCC TGGGGAACTT GGCCCTGCAG 4140 
GTCAAGATGG TGTTGGTGGT GACAAGGGTG AAGATGG AG A TCCTGGTCAA CCGGGTCCTC 4200 
30 CTGGCCCATC TGGTGAGGCT GGCCCACCAG GTCCTCCTGG AAAACGAGGT CCTCCTGGAG 4260 

CTGCAGGTGC AGAGGGAAGA CAAGGTGAAA AAGGTGCTAA GGGGGAAGCA GGTGCAGAAG 4320 
GTCCTCCTGG AAAAACCGGC CCAGTCGGTC CTCAGGGACC TGCAGGAAAG CCTGGTCCAG 4380 
AAGGTCTTCG GGGCATCCCT GGTCCTGTGG GAGAACAAGG TCTCCCTGGA GCTGCAGGCC 4440 
AAGATGGACC ACCTGGTCCT ATGGGACCTC CTGGCTTACC TGGTCTCAAA GGTGACCCTG 4500 
35 GCTCCAAGGG TGAAAAGGGA CATCCTGGTT TAATTGGCCT G ATTGGTCCT CCAGGAGAAC 4560 
AAGGGGAAAA AGGTGACCGA GGGCTCCCTG GAACTCAAGG ATCTCCAGGA GCAAAAGGGG 4620 
ATGGGGGAAT TCCTGGTCCT GCTGGTCCCT TAGGTCCACC TGGTCCTCCA GGCTTACCAG 46S0 
GTCCTCAAGG CCCAAAGGGT AACAAAGGCT CTACTGGACC CGCTGGCCAG AAAGGTGACA 4740 
GTGGTCTTCC AGGGCCTCCT GGGCCTCCAG GTCCACCTGG TGAAGTCATT CAGCCTTTAC 4800 
40 CAATCTTGTC CTCCAAAAAA ACGAGAAGAC ATACTGAAGG CATGCAAGCA GATGCAGATG 4860 
ATAATATTCT TGATTACTCG GATGGAATGG AAGAAATATT TGGTTCCCTC AATTCCCTG A 4920 
AACAAGACAT CGAGCATATG AAATTTCCAA TGGGTACTCA GACCAATCCA GCCCGAACTT 4980 
GTAAAGACCT GCAACTCAGC CATCCTGACT TCCCAGATGG TGAATATTGG ATTGATCCTA 5040 
ACCAAGGTTG CTCAGGAGAT TCCTTCAAAG TTTACTGTAA TTTCACATCT GGTGGTGAGA 5100 
45 CTTGCATTTA TCCAGACAAA AAATCTGAGG GAGTAAGAAT TTCATCATGG CCAAAGGAGA 5160 
AACCAGGAAG TTGGTTTAGT GAATTTAAGA GGGG AAAACT GCTTTCATAC TTAGATGTTG 5220 
AAGGAAATTC CATCAATATG GTGCAAATGA CATTCCTGAA ACTTCTGACT GCCTCTGCTC 5280 
GGCAAAATTT CACCTACCAC TGTCATCAGT CAGCAGCCTG GTATGATGTG TCATCAGGAA 5340 
GTTATGACAA AGCACTTCGC TTCCTGOGAT CAAATGATGA GGAGATGTCC TATGACAATA 5400 
5 0 ATCCTTTTAT C A A A ACACTG TATG ATGGTT GT ACGTCCAG A AAAGGCTAT G AAAAAACJG 5460 
TCATTGAAAT CAATACACCA AAAATTGATC AAGTACCTAT TGTTGATGTC ATGATCAGTG 5520 
ACTTTGGTG A TCAGAATCAG AAGTTCGG AT TTGAAGTTGG TCCTGTTTGT TTTCTTGGCT 5580 
AAGATTAAGA CAAAGAACAT ATCAAATCAA CAGAAAATGT ACCTTGGTGC CACCAACCCA 5640 
TTTTGTGCCA CATGCAAGTT TTGAATAAGG ATGTATGGAA AACAACGCTG CATATACAGG 5700 
55 TACCATTTAG GAAATACCGA TGCCTTTGTG GGGGCAGAAT CACAGACAAA AGCTTTGAAA 5760 
ATCATAAAGA TATAAGTTGG TGTGGCTAAG ATGGAAACAG GGCTGATTCT TGATTCCCAA 5820 
TTCTCAACTC TCCTTTTCCT ATTTGAATTT CTTTGGTGCT GT AG AAA AC A A AAAAAGAAA 5880 
AATATATATT CATAAAAAAT ATGGTGCTCA TTCTCATCCA TCCAGGATGT ACTAAAACAG 5940 
TGTGTTTAAT AAATTGTAAT TATTTTGTGT ACAGTTCTAT ACTGTTATCT GTGTCCATTT 6000 
60 CCAAAACTTG CACGTGTCCC TG AATTCCGC TGACTCTAAT TTATG AGG AT GCCGAACTCT 6060 
G ATGGCAATA ATATATGTAT TATGAAAATG AAGTTATGAT TTCCGATGAC CCTAAGTCCC 6120 
TTTCTTTGGT T AATGATGAA ATTCCTTTGT GTGTGTTT 

Seq ID NO: 1 22 Protein sequence: 
65 protein Accession #: NPJ)01845 



1 11 21 31 41 51 

70 llEPWSSRWKT KRWLWDFTVT TLALTFLFQA REVRGAAPVD VLKALDFHNS PEGISKTTGF 60 
CTNRKNSKGS DTAYRVSKQA QLSAPTKQLF PGGTFPEDFS ILFTVKPKKG IQSFLLSIYN 120 
EHGIQQIGVE VGRSPVFLFE DHTGKPAPED YPLFRTVNIA DGKWHRVAIS VEKKTVTMIV 180 
DCKKKTTKPL DRSERAIVDT NGITVFGTRI LDEEVFEGDI QQFLITGDPK AAYDYCEHYS 240 
PDCDSSAPKA AQAQEPQIDE YAPEDIIEYD YEYGEAEYKE AESVTEGPTV TEETIAQTEA 300 

75 NIVDDFQEYK YGTMESYQTE APRHVSGTNE PNPVEE1FTE EYLTGEDYDS QRKNSEDTLY 360 
ENKEIDGRDS DLLVDGDLGE YDFYEYKEYE DKPTSPPNfiE FGPGVPAETD ITETSrNGHG 420 
AYGEKGQKGE PAWEPGMLV EGPPGPAGPA G1MGPPG1.QG PTGPPGDPGD RGPPGRPGLP 480 
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GADGLPGPPG TMLMLPFRYG GDGSKGPTIS AQEAQAQAIL QQARIALRGP PGPMGLTGRP 540 
GPVGGPGSSG AKGESGDPGP QGPRGVQGPP GPTGKPGKRG RPGADGGRGM PGEPGAKGDR 600 
GFDGLPGLPG DKGHRGERGP QGPPGPPGDD GMRGEDGEIG PRGLPGEAGP RGLLGPRGTP 660 
GAPGQPGMAG VDGPPGPKGN MGPQGEPGPP GQQGNPGPQG LPGPQGPIGP PG EKGPQGKP 720 
5 GLAGLPGADG PPGHPGKEGQ SGEKGALGPP GPQGPIGXPG PRGVKGADGV RGLKGSKGEK 780 
GEDGFPGFKG DMGLKGDRGE VGQIGPRGXD GPEGPKGRAG PTGDPGPSGQ AGEKGKLGVP 840 
GLPGYPGRQG PKGSTGFPGF PGANGEKGAR GVAGKPGPRG QRGPTGPRGS RGARGPTGKP 900 
GPKGTSGGDG PPGPPG ERGP QGPQGPVGFP GPKGPPGPPG RMGCPGHPGQ RGETGFQGKT 960 
GPPGPGGWG PQGPTGETGP IGERGYPGPP GPPGEQGLPG AAGKEGAKGD PGPQGISGKD 1020 

1 0 GPAGLRGFPG ERGLPG AQG A PGLKGGEGPQ GPPGPVGSPG ERGSAGTAGP IGLRGRPGPQ 1 080 
GPPGPAGEKG APGEKGPQGP AGRDG VQGPV GLPGPAGPAG SPGEDGDKGE IGEPGQKGSK 1140 
GGKGENGPPG PPGLQGPVGA PGIAGGDGEP GPRGQQGMFG QKGDEGARGF PGPPGPIGLQ 1200 
GLPGPPGEKG ENGDVGPWGP PGPPGPRGPQ GPNGADGPQG PPGSVGSVGG VGEKGEPGEA 1260 
GNPGPPGEAG VGGPKGERGE KGEAGPPGAA GPPGAKGPPG DDGPKGNPGP VGFPGDPGPP 1320 

1 5 GELGPAGQDG VGGDKGEDGD PGQPGPPGPS GEAGPPGPPG fCRGPPGAAGA EGRQGEKGAK 1380 
GEAGAEGPPG KTGPVGPQGP AGKPGPEG LR GIPGPVGEQG LPGAAGQDGP PGPMGPPGLP 1440 
GLKGDPGSKG EKGHPGLIGL IGPPGEQGEK GDRGLPGTQG SPGAK.GDGGI PGPAGPLGPP 1500 
GPPGLPGPQG PKGNKGSTGP AGQKGDSGLP GPPGPPGPPG EVIQPLPILS SKKTRRHTEG 1560 
MQADADDNIL DYSDGMEE1F GSLNSLKQDI EHMKFPMGTQ TNPARTCKDL QLSHPDFPDG 1620 

20 EYWIDPNQGC SGDSFKVYCN FTSGGETCIY PDKKSEGVRI SSWPKEKPGS WFSEFKRGKL 1680 

LSYLDVEGNS INMVQMTFLK LLTASARQNF TYHCHQSAAW YDVSSGSYDK ALRFLGSNDE 1740 
EMSYDNNPFI KTLYDGCTSR KGYEK.TVIEI NTPKIDQVPI VDVM1SDFGD QNQKFGFEVG 1800 
PVCFLG 

25 Scq ID NO 123 DNA sequence 

Nucleic Acid Accession #: NM_0 15886 

Coding sequence: 485-1261 (underlined sequences correspond to start and stop codons) 



GAATTCCCCC CCCCCCCCCC TCACTTGGTG TGTCTATATG TCTGGCAGAC ATTATCAGCA 60 
CATTCTCTGT TGTTACCTGT GATTCATTTT TTCTTCACTC TCCAGGTGAA TTTCAATTGC 120 
TGAAAATTTC CCACTG AAA A TATGCAGTAA TATATTTTGT GGTTCAGACA TTTGGGGCAA 1 80 
ATGGTTC AC A TTCATTTTAG GGTTAGTGGT CATGCTGTTT ATTTTTCTCT GCTATACAAA 240 

3 5 GTTCCTCTTA GGGGTCTGCC TCATG ACACT AAAAAATGAA TAGAGATTCT ACTGTAGGTT 300 
ATCTCCTAGG CTTGAGTTCA AC ATTTGTTT GG ATTTTTG A AGAAA GTCAA ATCAAGCAAT 360 
GCTCCCAAAT GATGTCTTTG TAAATTCATA CCCTCTGGCC CTATTTTTTT TCATAGACCC 420 
TAACTCTACC TTTCTGCTTT AAAGCAAAGT AAACTCGGTG GCCTCTTCTT CTCCACCCCT 480 
CAAAATGATA GCAATCTCTG CCGTCAGCAG TGCACTCCTG TTCTCCCTTC TCTGTGAAGC 540 

40 AAGTACCGTC GTCCTACTCA ATTCCACTGA CTCATCCCCG CCAACCAATA ATTTCACTGA 600 
TATTGAAGCA GCTCTGAAAG CACAATTAG A TTCAGCGGAT ATCCCCAAAG CCAGGCGGAA 660 
GCGCTACATT TCGCAGAATG ACATGATCGC CATTCTTGAT TATCATAATC AAGTTCGGGG 720 
CAAAGTGTTC CCACCGGCAG CAAATATGGA ATATATGGTT TGGGATGAAA ATCTTGCAAA 780 
ATCGGCAGAG GCTTGGGCGG CTACTTGCAT TTGGGACCAT GGACCTTCTT AC'ITACTGAG 840 

45 ATTTTTGGGC CAAAATCTAT CTGTACGCAC TGGAAGATAT CGCTCTATTC TCC AGTTGGT 900 
CAAGCCATGG TATGATGAAG TGAAAGATTA TGCTTTTCCA TATCCCCAGG ATTGCAACCC 960 
CAGATGTCCT ATGAG ATGTT TTGG TCCCAT G TGCACACAT TATACGCAGA TGGTTTGGGC 1 020 
CACTTCCAAT CGG AT AGGAT GCGCAATTCA TGCTTGCCAA AACATGAATG TTTGGGGATC 1 080 
TGTGTGGCGA CGTGCAGTTT ACTTGGTATG CAACTATGCC CCAAAGGGCA ATTGGATTGG 1 140 

50 AGAAGCACCA TATAAAGTAG GGGTACCATG TTCATCTTGT CCTCCAAGTT ATGGGGGATC 1200 
TTGTACTGAC AATCTG TG'IT TTCCAGGAGT TACGTCAAAC TACCTGTACT GGTTTAAAJA 1260 
AGTTTACCTT TTCCTCCAGG AAATATAATG ATTTCTGGGA ACATGGGCAT GTATATATAT 1320 
AT ATGGAGAG AGAATTTTGC ACATATTATA CATATTTTGT GCTAATCTTG TTTTCCTCTT 1380 

55 AGTATTCCTT TGTATAAA1T AGTGTTTGTC TAGCATGTTT GTTTAATCCT TTGGGAATTC 

Seq ID NO: 124 Protein sequence: 
Protein Acces sio n #: NP_0 5 69 7 0 . 1 

1 II 21 31 41 51 

60 | | | I I | 

MIAISAVSSA LLFSLLCEAS TVVLLNSTDS SPPTNNFTDI EAALKAQI.DS AD1PKARRKR 60 
YISQNDMIA1 LDYHNQVRGK VFPPAANMEY MVWDENLAKS AEAWAATQW DHGPSYIXRF 120 
LGQNl-SVRTG RYRSILQLVK PWYDEVKDYA FPYPQDCNPR CPMRCFGPMC THYTQMVWAT 180 
SNRiGCAIHA CQNMNVWGSV WRRAVYLVCN YAPKGrWIGE APYKVGVPCS SCPPSYGGSC 240 

65 TDNLCFPGVT SNYLYWFK 



Scq ID NO: 125 DNA sequence 
Nucleic Acid Accession #: NM 001793 

70 C oding sequence: 54-2543 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I 1 ( I 

GCGGAACACC GGCCCGCCGT CGCGGCAGCT GCTTCAOCCC TCTCTCTGCA GCCATGGGGC 60 
75 TCCC1 CGTGG ACCTCTCGCG TCICI CCrcC ITCTCCAGGT TTGCTGGCTG CAGTGCGCGG 120 

CCTCCGAGCC GTGCCGGGCG GTCITCAGGG AGGCTGAAGT GACCTTGGAG GCGGGAGGCG 180 
CGGAGCAGGA GCCCGGCCAG GCGCTGGGGA AAGTATTCAT GGGCTGCCCT GGGCAAGAGC 240 
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15 



20 



CAGCTCTGTT TAGCACTGAT AATG ATGACT TCACTGTGCG GAATGGCGAG ACAGTCCAGG 300 
AAAGAAGGTC ACTGAAGGAA AGG AATCCAT TGAAGATCTT CCCATCCAAA CGTATCTTAC 360 
GAAGACACAA G AG AG ATTGG GTGGTTGCTC CAATATCTGT CCCTG AAAAT ^ 
CCTTCCCCCA GAGACTGA AT CAGCTCAAGT CTA ATAAAGA TAGAGACACC AAGATTTTCT 480 
5 ACAGCATCAC GGGGCCGGGG GCAGACAGCC CCCCTGAGGG TGTCTTCGCT GTAG AGAAGG 540 
AGA^GGC^ GTTGTTGTTG AATAAGCCAC TGGACCGGGA OOAGATTOCC AAaTATGAGC 00 
TCTTTGGCCA CGCTGTGTCA GAGAATGGTG CCTCAGTOGA GGACCCCATG AACATCTCCA 660 
TCATCGTGAC CG ACCAG AATGACCACAAGC CCAAGTTTAC CCAGGACACC TTCCGAGGGA 720 
GTGTCTTAGA GGGAGTCCTA CCAGGTACTT CTGTGATGCA GGTG ACAGCC ACAGATGAGG 7S0 
10 ATGATGCCAT CTACACCTAC AATGGGGTGG TTGCTTACTC CATCCATAGC CAAGAACCAA 840 
AG^ACCCACACGACCTCATG TTCACAATTC ACCGGAGCAC AGGCACCATC AGCGTCATCT 900 
CCAGTGGCCT GGACCGGGAA AAAGTCCCTG AGTACACACT GACCATCCAG GCCACAGACA 960 
TG^ATGGGGA CGGCTCCACC ACCACGGCAG TGGCAGTAGT GGAGATCCTT G ATGCCAATG 1020 
ACA ATGCTCC CATGTT TG AC CCCCAGAAGT ACGAGGCCCA TGTGCCTGAG AATGCAGTGG 1080 
GCCATGAGGT GCAGAGGCTG ACGGTCACTG ATCTGGACGC CCCCAACTCA CCAGCGTGGC 1140 
GTGCCACCTA CCTT VT C ATG GGCGGTGACG ACGGGG ACC A nTTACCATC ACCACCCACC 1200 

Sgagagcaa ccagggcatc ctg acaacca ggaagggttt ggattttg ag gccaaaaacc 1260 

AGCACACCCTGTACGTrGAAGTGACCAACG AGGCCCCTTT TGTGCTGAAG CTCCCAACCT 1320 
CCACAG^CACCATAGTGGTCCACGTGGAGG ATGTGAATGAGGCACCTGTGTTTGTCCCAC .380 
CCTTCAAAGTCGTrGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TOTGTCTACA .440 

SSag^aga ccctgacaag gagaatcaaa agatcagcta ccgcatcctg agagacccag isoo 

STCGCT AGCCATGG AC CCAGACAGTG GGCAGGTCAC AGCTGTGGGC ACCCTCGACC ,560 

gtgaggvtga gcag ittgtg agg a acaaca tctatgaagt catggtotg gccatggaca i 620 

AtcSaAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTGATr G ATGTCAACG 1680 

25 accatcgScagtccctgag ccccgtcaga tcaccatctg iwo 

ACGTGCTGAA CATCACGGAC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGCCCAGC 1800 
TCACAGATGA CTCAGACATC TACTGGACGG CAGAGGTCAA CGAGGAAGGT GACACAGTGG 1860 
TCTTfXrCC^T^ 3AAG AAGTTC CTGAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG 1920 
I^TCG^AA C^AGAGCAG CTG ACGGTG A TCAGGGCCAC TGTGTGCGAC TGCCATGGCC 1980 

30 aSaAAC CTG^CCCTGGA CCCTGGAAAG GAGGTTrCAT CCTCCCTGTG CTGGGGGCTG 2040 
TCC^CTCT GCTG VrCCTC CTGCTGGTGC TGCTTTTGTr GGTGAGAAAG AAGCGGAAGA 2.00 
SSc fSnCCTACTC CCAGAAG ATG ACACCCGTGA CA™T^A^ «« 
AAGAGGGGGG TGGCGAAGAG GACCAGGACT ATGACATCAC CCAGCTCCAC CGAGGTCTGG 2220 
AGGCCAGGOC GG AGGTGGTT CTCCGCAATG ACGTGGCACC ™ 

35 TGTACCGTCC TAGGCCAGCC AACCCAGATG AAA TCGGCAA CTTTATAATT GAGAACCTGA 2340 
A^G^^CI^A CACAGACCCC ACAGCCCCGC CCTACG ACAC CCTCTTGGTG TTCGACTATG 2400 
AGGGC AGCGG CTCCG ACGCC GCGTCCCTGA GCTCCCTCAC CTCCTCCGCC TCCGACCAAG 2460 
ACCAAGATTA CGATTATCTG AACGAGTGGG GCAGCCGCTT C AAG AAGCTG GCAGACATGT 2520 
ACGGTGGCGG GG AGG ACG AC TAGGCGGCCT GCCTGCAGGG CTGGGGACCA AACGTCAGGC 2580 

40 Sagcat ctccaagggg fcTcAGncc <x™c^W£^Mg^2™ 

GGAAGTGGCC GTAGCAACTT GGCGGAGACA GGCTATGAGT -CtaMXmM ^™J™CT^™° 
TCCTTAGCCT TTCAGGATGG AGG AATGTGG GCAGTTTGAC TTCAGCACTG AAAACCTCTC 2760 

c Sgg?cYgggttgcct cagaggccaa onra^GC^^TO^/^ 2no 

TGCTCAACCC TGTGTCCTGG GCCTGGGCCT GCTGTGACTG ACCTACAGTG GACTTTCTCT 2880 
CTGGAATGG A ACCTTCTT AG GCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGCTATCT 2940 
T^AAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCAGA GCTGCTGGGC CCACTGGCCG 3000 
TCCTGCATTT CTGGTTTCCA GACCCCAATG CCTCCCATTC GGATGGATCT CTGCGTTTTT 3060 
ATACTGAGTG TGCCTAGGTT GCCCCTT AT T TTTT ATTTTC CCTGTTGCGT TGCTATAGAT 3120 
GAAGGGTGAG GACAATCGTG TATATGTACT AGAACTTTTT TATTAAAGAA A 
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70 



Seq ID NO: 126 Protein sequence: 
Protein Accession #: N?_00 1 784 



1 11 21 31 41 51 

55 llGLPRGPLAS LIXLQVCWLQ CAASEPCRAV FREAEVTLEAGGAEQEPGQA LGKVFMGCPG 60 
OEPALf ^STDN DDFTVRNGET VQERRSLKER NPLKIFPSKR ILRRHKRDWV VAPISVPENG 20 
KGPFPQRLNQ LKSNKDRDTK IFYSITGPGA DSPPEGVFAV EKETGWLLLN KPLDREEIAK 180 
YELFGHAVSE NGASVEDPMN ISlIWDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 240 
60 DEDDAIYTYN GWAYSIHSQ EPKDPHDLMF T1HRSTGTIS VISSGLDREK VPEYTLTIQA 300 

SdGDGSTT TAV AWEILD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAWSP 360 
A^ATYUMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVUCL 420 
PTTTATIVVH VEDVNEAPVF VPPSKVVEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRiLR 480 
DP AG WLAM DP DSGQVTAVGT LDREDEQFVR NNTYEVMVLA MDNGSPPTTG TGTLLLTLID 540 
65 VNDHGPVPEP RQITICNQSP VRHVLNITDK DLSPHTSPFQ AQLTDDSDIY WTAEVNEEGD 600 
wSxKFl- K.QDTYD VH LS LSDHGNKEQL TVLRATVCDC HGHVETCPGP WKGGFILPVL 660 
C^VLALL* LI. LVLULVRKK RK1KEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQI HR 720 
GLEARPEWL RNDVAPTIIP TPMYRPRPAN PDEIGNFUE NLKAANTDPT APPYDTLLVF 780 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKJCLA DMYGGGEDD 



Seq ID NO: 127 DNA sequence 

Nucleic Acid Accession #. KM 003256.1 

Coding sequence: 60-734 (underlined sequences correspond to start and stop coders) 



75 1 II 21 31 41 51 

CCTGCTGGGG CCGTCCAGTC CCCCAGACCT CACAGGCTCA GTCGCGGATC TGCAGTGTCA 60 
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JGCCTGGGAG CCCTCGGCCC GCGCCAAGCT GGGTGCTGTT GCTGCGGCTG CTGGCG'ITGC 120 
TGCGGCCCCC GGGGCTGGGT GAGGCATGCA GCTGCGCCCC GGCGCACCCT CAGCAGCACA 180 
TCTGCCACTC GGCACTTGTG ATTCGGGCCA AAATCTCCAG TGAGAAGGTA GTTCCGGCCA 240 
GTGCAGACCC TGCTGACACT GAAAAAATGC TCCGGTATGA AATCAAACAG ATAAAGATGT 300 
5 TCAAAGGGTT TGAGAAAGTC AAGGATGTTC AGTATATCTA TACGCCTTTT GACTCTTCCC 360 
TCTGTGGTGT G AAACTAGAA GCCAACAGCC AGAAGCAGTA TCTCTTGACT GGTCAGGTCC 420 
TCAGTGATGG AAAAGTCTTC ATCCATCTGT GCAACTACAT CGAGCCCTGG GAGGACCTGT 480 
CCTTGGTGCA G AGGGAA AGT CTGAATCATC ACTACCATCT GA ACTGTGGC TGCCAA ATCA 540 
CCACCTGCrA CACAG TACCC TG T ACCATCT CGGCCCCTAA CGAGTGCCTC TGGACAGACT 600 

1 0 GGCTGTTGGA ACGAAAGCTC TATGG TTACC AGGCTCAGCA TTATGTCTGT ATGAAGCATG 660 
ITGACGGCAC CTGCAGCTGG TACCGGGGCC ACCTGCCTCT CAGGAAGGAG TT TG TTGACA 720 
TCGTICAGCC C TAG TAGGGA CCAGTGACCA TCACATCCCT TCAAGAGTCC TGAAGATCAA 780 
GCCAGTTCTC CTTCCCTGCA GAGCTITGGC CATTACCACC TGACCTCTTG CTGCCAGCTA 840 
ATAAGAAGTG CCAAGTGGAC AGTCTGGCCA CTGTCAAGGC AGGGAAGGGG CCATGACTTT 900 

1 5 TCTGCCCTGC CCTCAGCCTG TTGCCCTGCC TCCCAAACCC CATTAGTCTA GCCTTGTAGC 960 
TGTTACTGCA AGTGTTTCTT CTGGCTTAGT CTGTTTTCTA AAGCCAGGAC TATTCCCTTT 1020 
CCTCCCCAGG AATATGTGTT TTCCTTTGTC TTAATCGATC TGGTAGGGGA GAAATGGCGA 1080 
ATGTCATACA CATGAGATGG TATATCCTTG CGATGTACAG AATCAGAAGG TGGTTTGACA 1 140 
GCATCATAAA CAGGCTGACT GGCAGGAATG AAAAAAAAAA AA AAA A AAA 

20 

Scq ID NO 128 Protein sequence: 
Protein Acteision H: NP_003247. 1 

I II 21 31 41 51 

25 i | i i i i 

MPGSPRPAPS WVU.LRLLAL LRPPGLGEAC SCAPAHPQQH ICHSALVIRA KISSEKVVPA 60 
SA DP A DTE KM I.RYEIKQIKM FKGFEKVKDV QYIYTPFDSS IjCGVKLEANS QKQYLLTGQV 120 
LSDGKVFIHL CNYIEPWEDI. SLVQRESLNH HYHLNCGCQI TTCYTVPCTI SAPNECLWTD 1 80 
WLLERKLYGY QAQHYVCMKH VDGTCSWYRG HLPLRKEFVD IVQP 

Seq ID NO 129 DNA sequence 

Nucleic Acid Accession #: NM_007207.2 

Coding sequence: 143-1591 (underlined sequences correspond to start and stop codons) 
35 1 11 21 31 41 51 

I I I i I I 

CCACGCG'ICC GCAATGAAGC CGAGTGAATG GGGGCTGAAT GTGCGAGTCC ATAGCTGAAG 60 
AGGAGCGCCA GATGGTGG AG G AATACAC TT ATTTATGAAA CTGTCTTGAG TTCTrCTTGA 1 20 
ATTGCCAGTT TTCAGCCTCC TCATGCCTCC GTCTCCTTTA GACGACAGGG TAGTAGTGGC 1 80 

40 ACTATCTAGG CCCGTCCGAC CTCAGGATCT CAACCTTTGT TTAGACTCTA GTTACCTTGG 240 
CTCTGCCAAC CCAGGCAGTA ACAGCCACCC TCCTGTCATC GCCACCACCG TTGTGTCCCT 300 
CAAGGCTGCG AATCTG ACGT ATATG CCCTC ATCCAGCGGC TCTGCCCGCT CGCTGAATTG 360 
TGGATGCAGC AGTGCCAGCT GCTGCACTGT GGCAACCTAC GACAAGGACA ATCAGGCCCA 420 
AACCCAAGCC ATTGCCGCTG GCACCACCAC CACTGCCATC GGAACCTCTA CCACCTGCCC 480 

45 TGCTAACCAG ATGGTCAACA ATAATGAGAA TACAGGCTCT CTAAGTCCAT CAAGTGGGGT 540 
GGGCAGCCCT GTGTCAGGGA CCCCCAAGCA GCTAGCCAGC ATCAAAATAA TCTACCCCAA 600 
TGACTTGGCA AAGAAGATGA CCAAATGCAG CAAGAGTCAC CTGCCGAGTC AGGGCCCTGT 660 
C ATCATTG AC TGCAGGCCCT TCATGGAGTA CAACAAG AGT CACATCCAAG GAGCTGTCCA 720 
CATTAACTGT GCCGATAAGA TCAGCCGGCG GAGACTGCAG CAGGGCAAGA TCACTGTCCT 780 

50 AGACTTCATT TCCTGTAGGG AAGGCAAGGA CTCTTTCAAG AGGATCTTTT CCAAAGAAAT 840 
TATAGTTTAT GATGAGAATA CCAATGAACC AAGCCGAGTG ATGCCCTCCC AGCCACTTCA 900 
CATAGTCCTC GAGTCCCTG A AGAGAGAAGG CAAAGAACCT CTGGTGTTGA AAGGTGGACT 960 
TAGTAGTTTT AAGCAGAACC ATGAAAACCT CTGTGACAAC TCCCTCCAGC TCCAAGAGTG 1020 
CCGGGAGGTG GGGGGCGGCG CATCCGCGGC CTCGAGCTTG CTACCTCAGC CCATCCCCAC 1080 

55 C ACCCCTGAC ATCGAGAACG CTGAGCTCAC CCCCATCTTG CCC1TCCTGT TCCTTGGCAA 1 140 
TGAGCAGGAT GCTCAGGACC TGGACACCAT GCAGCGGCTG AACATCGGCT ACGTCATCAA 1200 
CGTCACCACT CATCTTCCCC TCTACCACTA TGAGAAAGGC CTGTTCAACT ACAAGCGGCT 1 260 
GCCAGCC ACT GACAGCAACA AGCAGAACCT GCGGCAGTAC TTTGAAG AGG CTTTTGAGTT 1320 
CATTGAGGAA GCTCACCAGT GTGGGAAGGG GCTTCTCATC CACTGCCAGG CTGGGGTGTC 1380 

60 CCGCTCCGCC ACCATCGTCA TCGCTTACTT GATGAAGCAC ACTCGGATGA CCATGACTGA 1440 
TGCTTATAAA TTTGTCAAAG GCAAACGACC AATTATCTCC CCAAACCTTA ACTTCATGGG 1500 
GCAGTTGCTA GAGTTCGAGG AAGACCTAAA CAACGGTGTG ACACCGAGAA TCCTTACACC 1560 
AAAGCTGATG GGCGTGGAGA CGGTTGTG TG A CAATGGTCr GGATGGA AAG GATTGCrGCT 1620 
CTCCATTAGG AGACAATGAG GAAGGAGGAT GGATTCTGGT TTTTTTTCTT TCTTTTTT1T 1680 

65 TrGTAGTTGG GAGTAAAGTT TGTGAATGGA AACAAACTTG GTTAAACACT TTATTTTTAA 1740 
CAAGTGTAAG AAGACTATAC TTTTGATGCC ATTGAGATTC ACCTTCCACA AACTGGCCAA 1800 
ATTAAGGAGG TTAAAGAAGT AATTTTTTTT AAGCCCAACC ATTAAAAATT TAATACAACT 1 860 
TGGTTTCTCC CCCTTrTTCC TTTAAAGCTA NTTTGTAAAA GTTI'ATGAG 

70 Seq ID NO 130 Protein sequence: 

Protein Act fusion #: NP 0O9138 1 

1 11 21 31 41 51 

75 MPPSPLDDRV VVALSRPVRP QDLNLCLDSS YLGSANPGSN SIIPPVIATTV VSLKAANLTY 60 
MPSSSGSARS LNCGCSSASC CTVATYDKDN QAQTQA1AAG TTTTAIGTST TCPANQMVNN 120 
NENTGSLSPS SGVGSPVSGT PKQLASIKII YPNDLAKKMT KCSKSHLPSQ GPVIIDCRPF 180 
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55 



75 



MEYNKSHIQG AVHINCADKI SRRRLQQGKI TVLDL1SCREGKDSFKRIFS KEI1VYDENT 240 
NEPSRVMPSQ PLHIVLESLK REGK.EPLVLK GGLSSFKQNH ENLCDNSLQL QECREVGGGA 300 
SAASSLLPQP IPTTPDIENA ELTPILPFLF LGNEQDAQDL DTMQRLN1GY VINVTTHLPL 360 
YHYEKGLRs'Y KRLPATDSNK QNLRQYFEEA FEFIEEAHQC GKGLLIHCQA GV SRSATIVl 420 
5 AYLMKHTRMT MTDAYKFVKG KRPIISPNLN FMGQLLEFEE DLNNGVTPRi LTPKLMGVET 4h0 
W 

SeqIDN0:131 DN A sequence 
Nucleic Acid Accession tf: NM_0O5409.3 
1 0 Coding sequence: 94-37S (underlined sequences correspond to start and slop codons) 

1 11 21 31 41 51 

TTCCTTTCAT GTTCAGCATT TCT ACTCCTT CCAAGAAGAG CAGCAAAGCT GAAGTAGCAG 60 

1 5 caacagcacc agcagcaaca gcaaaaaaca aacajgagtg tcaagggcat ggctatagcc 120 

TTGGCTGTGA TATTGTGTGC TACAGTTGTT CAAGGCTTCC CCATGTTCAA AAGAGGACGC180 
TGTCTTTGCA TAGGCCCTGG GGTAAAAGCA GTGAAAGTGG CAGATATrGA GAAAGCCTCC 240 
ATAATGTACC CAAGTAACAA CTGTGACAAA ATAGAAGTGA ™[^^^^^L ™ 
AAAGGACAAC GATGCCTAAA TCCCAAATCG AAGCAAGCAA (K3CTTATAAT CAAAAAAGTT 360 

20 GAAAGAAAGA ATTTTTAAAA ATATCAAAAC ATATGAACTC ^^^^^ ™ 
AACCTAGAAC AAGTTTAACT GTG ACTACTG ^^^^^^J^^ 0 ^ 
AGACTTTTCT ATGGTTTTGT GACTTTCAAC TTTTG TACAG TTATGTGAAG G ATGAAAGGT 540 

GG ACCAAAAACA G AAATACAG T CTTCCTGAAT ^^^^^^^f^^J^SSj ™ 
CTGCCCAAAG GAGTCCAGCA ATTAAATGGA TTTCTAGGAA AAGCTACCTT ^GAAA^GGCT 660 

25 GCHTACCATCGGAGmACAAAGTCCrmACGT^ ™ 
GCATTTCTAG GCTAGAGAAC CTTCTAGATT TGATGCTTAC ™ 
GAGAACATTT CTGTCTCTAG AAGTTATCrG TCTGTATTGA TCTTTATGCT ATATTACTAT 840 
CTGTGGTTAC AGTGGAGACA TTGACATTAT TACTGGAGTC AAGCCCTTAT AAGTCAAAAG 900 
CATCTATGTG TCGTAAAGCA TTCCTCAAAC ATTTnTCAT GCAAATACAC ACTTCTTTCC 960 

30 ccIaatatcatgtagcacatcaatatg 

TTTATAACCA ATTCATTAAA TGTAATTCAT AAAATGTACT \°™ 
TGGGATACTG GC AACAGTGC ACATATTTCA TAACCAAATT AGCAGCACCG GTCITAATTT 1140 
GATGTTTTTC AACTTTTATT CATTGAGATG TTTTGAAGCA ATTAGG ATAT GTGTG nTAC 1 200 
TGTACTTTTT GTTTTG ATCC GTTTGTATAA ATGATAGCAA T ATCTTG ^^^^^^FJ^^^fLl? ^ 0 0 
35 TACAAAATGT TTTTGTCTAC CAAAGAAAAA TGTTG AAAAA TAAGCAAATG TATACCTAGC 1320 

AATCAcrrrr actttttgta attctgtctc ttag aaaaat acataatcta atcaatttct i 3bo 

TTGTTCATGC CTATATACTG TAAAATTTAG GTATACTCAA GACTAGTTTA AAGAATCAAA 1440 

gtcatttttt tctctaataa actaccacaa ccTrTcrnr ttaaaaaaaa aaa 

40 Tn Nn ' V Protein sequence: 

Protein Accession #: NP_0O54O0. 1 

1 11 21 31 41 51 

45 MSVKGMAIAL AVILCATVVQ GFPMFKRGRC LCIGPGVKAV KVADIEKASI MYPSNNCDKI 60 
EVIITLKENK GQRCLNPKSK QARL1IKKVE RKNF 

Seq ID NO 133 DNA sequence 
Nucleic Acid Accession U\ NM_012342 
50 Coding sequence: 373-1 155 (underlined sequences correspond to start and stop codons) 



] 11 21 31 41 51 

I I 1 I I I 



CTGGCGCGGG CGGG AGCTGC GGCGGATACC CTTGCGTGCT GTGGAGACCC TACTCTCTTC 60 
SSgA^TcG GCCGCTAGCG GGG ACTGAAG GCCGGGAGCX CACTCCCGAC CCGGGGCTAG 120 
CGTGCGTCCC TAGAGTCGAG CGGGGCAAGG GAGCCAGTGG CCGCCGACGG GGGACCGGG A 180 
AACTTTTCTG GGCTCCTGGA GAGCCCTGTA GCCGCGCTCC ATGCTCCGGC AGCGGCCCGA 240 
AACCCAGCCC CGCCGCTGAC GGAGCCCGCC GCTCCGGGCA GGGCCCATGC CCTGCGCGCT 300 
CCGGGGGTCG TAGCTGCCGC CG AGCCGGGG CTCCGG AAGCOGG CGGGGG C G^CGCGG CCG 3 60 
60 TGCGGGGCGT CAATGGATCG CCACTCCAGC TACATCTTCA TCTGGCTGCA GCTGGAGCTC 420 
TGCGCCATGG CCGTGCTGCT CACCAAAGGT G AAATTCGAT GCTACTGTGA TGCTGCCCAC 480 
TGTGTAGCCA CTGGTTATAT GTGTAAATCT GAGCTCAGCG CCTGCTTCTC TAGACTTCTT 540 
GATCCTCAG A ACTCAAATTC CCCACTCACC CATGGCTGCC TGGACTCTCT TGCAAGCACG 600 
ACAGACATCT GCCAAGCCAA ACAGGCCCGA AACCACTCTG GCA CC ACC AT ACCC ACATTG 660 
65 GAATGCTGTC ATGAAGACAT GTGCAATTAC AGAGGGCTGC ACGATGTTCT CTCTCCTCCC 720 
^GGGGTGAGG CCTCAGGACA AGGAAACAGG TATCAGCATG ATGGTAGCAG AAACCT TATC 780 
ACCAAGGTGC AGGAGCTGAC TTCTTCCAAA GAGTTGTGGT TCCGGGCAGC Go I CATTGCC 84u 
GTGCCCATTG CTGGAGGGCT GATTTTAGTG TTGCTTATTA TGTTG GCCCT G AGGATGCTT 900 
CGAAGTG AAA ATAAG AGGCT GCAGGATCAG CGGCAACAGA TGCTCTCCCG TTTGCACTAC 960 
70 ScmcACC Tgacaccattc caaaaagggg CAGGTTGCAA AGTTAGACTT GGAATGCATG 1020 
GTGCCGGTCA GTGGGCACGA GAACTGCTGT CTGACCTGTG ATAAAATGAG ACAAGCAGAC 1080 
CTCAGCAACG ATAAG ATCCT CTCGCTTGTT CACTGGGGCA TGTACAGTGG GCACGGGAAG 1 140 
CTGGAATTCG TATGACGGAG TCTTATCTGA ACTACACTTA CTGAACAGCT TGAAGGCCTT 1200 
TTtMOTTCTG CTGGACAGG A GCACTTTATC TGAAGACAAA CTCATTTAAT » AG 1260 
AGACAAAATG ACCTCTGCAA ACAGAATCTT GG ATA TTTCT TCTG AAG GAT TATTTGCACA 1320 
SZtA CAGTTAAATG TGTTATTTGC TTTTAAAATT ATAAAAAGCA A AG AG AAG AC 1380 
TTTGTACACA CTGTCACCAG GGTTATTTGC ATCCA AGGGA GCTGG AATTG AGTACCTAAA 1440 
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TAAACAAAAA TGTGCCCTAT GTAAGCTTCT ACATCTTGAT TTATTGTAAA GATTTAAAAG 1500 
AAATATATAT A TTTTGTCTG A 

Seq ID NO: 134 Protein sequence: 
5 Protein Accession U: NPJB6474.1 

I 11 21 31 41 51 

I I I I t I 

MDRHSSYIFI WLQLELCAMA VLLTKGEIRC YCDAAHCVAT GYMCKSELSA CFSRLLDPQN 60 
1 0 SNSPLTHGCL DSLASTTDIC QAKQARNHSG TTIPTLECCH EDMCNYRGLH DVLSPPRGEA 120 
SGQGNRYQHD GSRNLITKVQ ELTSSKELWF RAAVIA VPIA GGULVLLIM LALRMLRSEN 1 80 
KRLQDQRQQM LSRLHYSFHG HHSKKGQVAK LDLECMVPVS GHENCCI/FCD KMRQADLSND 240 
KILSLVIIWGM YSGHGKLEFV 

15 Seq ID NO: 1 35 DN A sequence 

Nucleic Acid Accession it: NM_0O1627.1 

Coding sequence: 64- 1 8 1 5 (underlined sequences correspond to start and slop codons) 



CGGGACGACG CCCCCTCCTG CGGCGTGGAC TCCGTCAGTG GCCCACCAAG AAGGAGGAGG 60 
AATATGGAAT CCAAGGGGGC CAGTTCCTGC CGTCTGCTCT TCTGCCTCTT GATCTCCGCC 120 
ACCGTCTTCA GGCCAGGCCT TGGATGGTAT ACTGT AAATT CAGCATATGG AGATACCATT 180 
ATCATACCTT GCCG ACTTGA CGTACCTCAG AATCTCATGT TTGGCAAATG GAAATATGAA 240 

2 5 AAGCCCGATG GCTCCCCAGT ATTTATTGCC TTCAGATCCT CTACAAAG AA AAGTGTGCAG 300 

TACGACGATG TACCAG AATA CAAAGACAGA TTGAACCTCT CAGAAAACTA CACTTTGTCT 360 
ATCAGTAATG CAAGGATCAG TGATGAAAAG AG ATTTGTGT GCATGCTAGT AACTGAGGAC 420 
AACGTGTTTG AGGCACCTAC AATAGTCAAG GTGTTCAAGC AACCATCTAA ACCTG AAATT 480 
GTAAGCAAAG CACTGTTTCT CGAAACAGAG CAGCTAAAAA AGTTGGGTGA CTGCATTTCA 540 
30 GAAGACAGTT ATCCAGATGG CAATATCACA TGGTACAGGA ATGGAAAAGT GCTACATCCC 600 
CTTGAAGGAG CGGTGGTCAT AATTTTTAAA AAGGAAATGG ACCCAGTGAC TCAGCTCTAT 660 
ACCATGACTT CCACCCTGGA GTACAAGACA ACCAAGGCTG ACATACAAAT GCCATTCACC 720 
TGCTCGGTGA CATATTATGG ACCATCTGGC CAGAAAACAA TTCATTCTGA ACAGGCAGTA 780 
TTTGATA ITr ACTATCCTAC AGAGCAGGTG ACAATACAAG TGCTGCCACC AAAAAATGCC 840 

3 5 ATCAAAG AAG GGG ATA AC AT C ACTCTTAAA TGCTTAGGGA ATGGC AACCC TCCCCCAG AG 900 

GAATTTTTGT TTTACTTACC AGGACAGCCC GAAGGAATAA GAAGCTCAAA TACTTACACA 960 
CTGATGGATG TGAGGCGCAA TGCAACAGGA GACTACAAGT GTTCCCTG AT AGACAAAAAA 1020 
AGCATGATTG CTTCAACAGC CATCACAGTT CACTATTTGG ATTTGTCCTT AAACCCAAGT 1080 
GGAGAAGTGA CTAGACAGAT TGGTGATGCC CTACCCGTGT CATGCACAAT ATCTGCTAGC 1 140 

40 AGGAATGCAA CTGTGGTATG GATGAAAGAT AACATCAGGC TTCGATCTAG CCCGTCATTT 1200 
TCTAGTCTTC ATTATCAGGA TGCTGGAAAC TATGTCTGCG AAACTGCTCT GCAGGAGGTT 1260 
GAAGGACTAA AGAAAAGAGA GTCATTGACT CTCATTGTAG AAGGCAAACC TCAAATAAA A 1320 
ATGACAAAGA AAACTGATCC CAGTGGACTA TCTAAAACAA TAATCTGCCA TGTGGAAGGT 1380 
TTTCCAAAGC CAGCCATTCA GTGG ACAATT ACTGGCAGTG GAAGCGTCAT AAACCAAAC A 1 440 

45 GAGGAATCTC CTTATATTAA TGGCAGGTAT TATAGTAAAA TTATCATTTC CCCTGAAGAG 1 500 
AATGTTACAT TAACTTGCAC AGCAGAAAAC CAACTGGAGA GAACAGTAAA CTCCTTGAAT 1560 
GTCTCTGCTA TAAGTATTCC AGAACACGAT GAGGCAGACG AGATAAGTGA TGAAAACAGA 1620 
GAAAAGGTGA ATGACCAGGC AAAACTAATT GTGGGAATCG TTGTTGGTCT CCTCCTTGCT 1680 
GCCCTTGTTG CTGGTGTCGT CTACTGGCTG TACATGAAGA AGTCAAAGAC TGCATCAAA A 1 740 

5 0 CATGTAAACA AGGACCTCGG TAATATGGAA GAAAACAAAA AGTTAGAAG A AAACAATCAC 1 800 
AAAACTGAAG CCJAAG AG AG AAACTGTCCT AGTTGTCCAG AGATAAAAAT CATATAGACC 1860 
AATTGAAGCA TGAACGTGGA TTGTATTTAA GACATAAACA AAGACATTGA CAGCAATTCA 1920 
TGGTTCAAGT ATTAAGCAGT TCATTCTACC AAGCTGTCAC AGGTTTTCAG AGA ATTATCT 1 980 
CAAGTAAAAC AAATGAAATT TAATTACAAA CAATAAGAAC AAGTTTTGGC AGCCATGATA 2040 

5 5 ATAGGTCATA TGTTGTGTTT GGTTCAATTT TTTTTCCGTA AATGTCTGCA CTGAGGATTT 2 100 
CTTTTTGGTT TGCCTTTTAT GTAAATTTTT TACGTAGCTA TTTTTATACA CTGTAAGCTT 21 60 
TGTTCTGGGA GTTGCTGTTA ATCTGATGTA TAATGTAATG TTTTTATTTC AATTGTTTAT 2220 
ATGGATAATC TGAGCAGGTA CATTTCTGAT TCTGATTGCT ATCAGCAATG CCCCAAACTT 2280 
TCTCATAAGC ACCTA AAACC CAA AGGTGGC AGCTTGTGAA GATTGGGGAC ACTCATATTG 2340 

60 CCCTAATTAA AAACTGTGAT TTTTATCACA AGGGAGGGGA GGCCGAGAGT CAGACTGATA 2400 
GACACCATAG GAGCCGACTC TTTGATATGC CACCAGCGAA CTCTCAGAAA TAAATCACAG 2460 
ATGCATATAG ACACACATAC ATAATGGTAC TCCCAAACTG ACAATTTTAC CTATTCTGAA 2520 
AAAGACATAA AACAGAATT 

65 Seq ID NO: 136 Protein sequence: 

Protein Accession #: NP 001618.1 

] 11 21 31 41 51 

70 MESKGASSCRLLFCLLISAT VFRPGLGWYT VNSAYGDTII IPCRLDVPQN LMFGKWKYEK 60 
PDGSPVFIAF RSSTKK SVQY DDVPEYKDRL NLSENYTLSI SNARISDEKR FVCMLVTEDN 120 
VFEAPTIVKV FKQPSKPEIV SKALFLETEQ LKKLG DCISE DSYPDGNITW YRNGKVLHPL 180 
EGAWIIFKX EMDPVTQLYT MTSTLEYKTT KADIQMPFTC SVTYYGPSGQ KTIHSEQAVF 240 
DIYYPTEQVT IQVLPPKNAI KEGDNITLKC LGNGNPPPEE FLFYLPGQPE GIRSSNTYTL 300 

75 MDVRRNATGD YKCSLIDKKS MIASTAITVH YLDLSLNPSG EVTRQIGDAL PVSCTISASR 360 

NATWWMKDN IRLRSSPSFS SLHYQDAGNY VCETALQEVE GLKKRESLTL IVEGKPQIKM 420 
TKKTDPSGLS KTIICHVEGF PKPAIQWTlT GSGSVINQTE ESPYINGRYY SKIIISPEEN 480 
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VTLTCTAENQ LERTVNSLNV SAISIPEHDE ADEJSDENRE KVNDQ.MCLiV G1VVGLXLAA 
LVAGVVYWLY MKKSKTASKH VNKDLGNMEE NKJCLEEKNHK TEA 



5 Seq ID NO: 137 DNA sequence 

Nucleic Acid Accession #: XMJ030559 

Coding sequence: 1-1 19 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

ATGAACCGCA GCCACCGGCA CGGGGCGGGC AGCGGCTGCC TGGGCACTAT GGAGGTGAAG 60 
AGCAAGTTTG GAGCTGA ATT TCGTCGGTTT TCGCTGG AAA GATCAAAACC TGG AAAATTT 1 20 
GAGGAGTTTT ATGGATTACT ACAACATGTT CATAAGATCC CCAATGTTGA CGTTTTGGTA 1 30 
GGCTATGCAG ACATCCATGG AGACTTACTA CCTATAAATA ATGATGATAA TTATCACAAA 240 

1 5 GCTGTTTCAA CGGCCAATCC ACTGCTTAGG ATATTTATAC AAAAGAAGGA AG AAGCAGAC 300 
TACAGTGCCT TTGGTACAGA CACGCTAATA AAGAAGAAGA ATGTTTTAAC CAACGTATTG 360 
CGTCCTGACA ACCATAGAAA AAAGCCACAT ATAGTCATTA GTATGCCCCA AGACTTTAGA 420 
CCTGTGTCTT CTATTATAG A CGTGGATATT CTCCCAGAAA CGCATCGTAG GGTACGTCTT 430 
TACAAATACG GCACGGAGAA ACCCCTAGGA TTCTACATCC GGGATGGCTC CAGTGTCAGG 540 

20 GTAACACCAC ATGGCTTAGA AAAGGTTCCA GGGATCTTTA TATCCAGGCT TGTCCCAGGA 600 
GGTCTGGCTC AAAGTACAGG ACTATTAGCT GTTAATGATG AAGTTTTAGA AGTTAATGGC 660 
ATAGAAGTTT CAGGGAAGAG CCTTGATCAA GTAACAGACA TGATGATTGC AAATAGCCGT 720 
AACCTCATCA TAACAGTGAG ACCGGCAAAC CAGAGGAATA ATGTTGTGAG GAACAGTCGG 780 
ACTTCTGGCA GTTCCGGTCA GTCTACTGAT AACAGCCTTC TTGGCT A CCC ACAGCAGATT 840 

25 GAACCAAGCT TTGAGCCAGA GG ATG A AG AC AGCGAAG AAG ATGACATTAT CATTG AAG AC 900 
AATGGAGTGC CACAGC AGAT TCCA A AAGCT GTTCCTAATA CTGAGAGCCT GGAGTCATTA 960 
ACACAGATAG AGCTAAGCTT TGAGTCTGGA CAGAATGGCT TTATTCCCTC TAATGAAGTG 1020 
AGCTTAGCAG CCATAGCAAG CAGCTCAAAC ACGGAATTTG AAACACATGC TCCAGATCAA 1080 
AAACTCTTAG AAGAAGATGG AACAATCATA ACATTATGA 

30 

Seq ID NO: 138 Protein sequence; 
Protein Accession #: XP_030559 



1 II 21 31 41 51 

MNRSHRHGAG SGCLGTMEVK SKFGAEFRRF SLERSKPGKF EEFYGIXQHV HKTPNVDVLV 60 
GYADIHGDLL PINNDDNYHK AVSTANPLLR IFIQKJCEEAD YSAFGTDTLI KKKNVLTNVL 120 
RPDNHRKKPH IVISMPQDFR PVSS1IDVDI LPETHRRVRL YKYGTEKPLG FYIRDGSSVR 180 
VTPHGLEKVP GIFISRLVPG GLAQSTGLLA VNDEVLEVNG IEVSGKSLDQ VTDMMIANSR 240 
40 NLIITVRPAN QRHNVVRNSRTSGSSGQSTD NSLLGYPQQI EPSFEPEDED SEEDDIIIED 300 
NGVPQQIPKA VPNTESLESL TQIELSFESG ONGFIPSNEV SLAAIASSSN TEFETHAPDQ 360 
KLLEEDGTH TL 
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It is understood that the examples described above in no way serve to limit the 
true scope of this invention, but rather are presented for illustrative purposes. All 
publications, sequences of accession numbers, and patent applications cited in this 
specification are herein incorporated by reference as if each individual publication or patent 
application were specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 



1 1 . A method of detecting a breast cancer-associated transcript in a cell 

2 from a patient, the method comprising contacting a biological sample from the patient with a 

3 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

4 as shown in Tables 1-25. 

1 2. The method of claim 1, wherein the biological sample comprises 

2 isolated nucleic acids. 

1 3. The method of claim 2, wherein the nucleic acids are mRNA. 

1 4. The method of claim 2, further comprising the step of amplifying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 5. The method of claim 1, wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1-25. 

1 6. The method of claim 1 , wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 7. The method of claim 1 , wherein the patient is undergoing a therapeutic 

2 regimen to treat breast cancer. 

1 8. The method of claim 1, wherein the patient is suspected of having 

2 breast cancer. 

1 9. An isolated nucleic acid molecule consisting of a polynucleotide 

2 sequence as shown in Tables 1-25. 

1 1 0. The nucleic acid molecule of claim 9, which is labeled. 

1 1 1 . An expression vector comprising the nucleic acid of claim 9. 

1 12. A host cell comprising the expression vector of claim 1 1 . 
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1 13. An isolated polypeptide which is encoded by a nucleic acid molecule 

2 having polynucleotide sequence as shown in Tables 1-25. 

1 14. An antibody that specifically binds a polypeptide of claim 13. 

1 15. The antibody of claim 14, further conjugated to an effector component. 

1 16. The antibody of claim 15, wherein the effector component is a 

2 fluorescent label. 

1 17. The antibody of claim 15, wherein the effector component is a 

2 radioisotope or a cytotoxic chemical. 

1 18. The antibody of claim 15, which is an antibody fragment. 

1 19. The antibody of claim 15, which is a humanized antibody 

1 20. A method of detecting a breast cancer cell in a biological sample from 

2 a patient, the method comprising contacting the biological sample with an antibody of claim 

3 14. 

1 21 . The method of claim 20, wherein the antibody is further conjugated to 

2 an effector component. 

1 22. The method of claim 21, wherein the effector component is a 

2 fluorescent label. 

1 23. A method for identifying a compound that modulates a breast cancer- 

2 associated polypeptide, the method comprising the steps of: 

3 (i) contacting the compound with a breast cancer-associated polypeptide, the 

4 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 

5 80% identical to a sequence as shown in Tables 1-25; and 

6 (ii) determining the functional effect of the compound upon the polypeptide. 
1 24. A drug screening assay comprising the steps of 
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2 (i) administering a test compound to a mammal having breast cancer or a cell 

3 isolated therefrom; 

4 (ii) comparing the level of gene expression of a polynucleotide that selectively 

5 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1-25 in a 

6 treated cell or mammal with the level of gene expression of the polynucleotide in a control 

7 cell or mammal, wherein a test compound that modulates the level of expression of the 

8 polynucleotide is a candidate for the treatment of breast cancer. 
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